From the manual:

"Multimedia files have to be loaded from virtual memory before playing, so there can be a few milliseconds delay at the beginning or every loop repetition, depending on the hardware and the filter software. For avoiding this audible delay on audio files, use snd_loop for looping music in .wav or .ogg format."

In short:
I never used videos, but media_play with sounds lacked always very much.