Maintaining synchronization of streaming audio and video using internet protocol
First Claim
1. A method for processing audio content and video content of an audio/video program, wherein the method comprises:
- receiving a sequence of audio packets conveying first control information and the audio content and obtaining from the first control information audio time information for a respective audio packet in the sequence of audio packets;
receiving a sequence of video packets conveying second control information and the video content and obtaining from the second control information video time information for a respective video packet in the sequence of video packets;
determining from the audio time information and the video time information a relative time alignment of the respective audio packet with the respective video packet and applying a perceptual model to the relative time alignment to obtain a measure of perceived misalignment between the respective audio packet and the respective video packet;
if the measure of perceived misalignment indicates the respective audio packet precedes the desired time alignment with the respective video packet by an interval that exceeds a first threshold, delaying all or part of the respective audio packet by a first amount such that according to the perceptual model a perceived misalignment between the delayed respective audio packet and the respective video packet is reduced; and
if the measure of perceived misalignment indicates the respective audio packet follows the desired time alignment with the respective audio packet by an interval that exceeds a second threshold, delaying the respective video packet by a second amount and delaying all or part of the respective audio packet by a third amount such that according to the perceptual model a perceived misalignment between the delayed respective audio packet and the delayed respective video packet is reduced.
2 Assignments
0 Petitions
Accused Products
Abstract
A playback system in a digital cinema network synchronizes the presentation of visual and aural content by deriving timing information for packets of information that are conveyed in video and audio data streams, examining the timing information to determine if any misalignment between the two data streams is likely to be perceptible and, if the misalignment is deemed to be perceptible, introducing delays into one or both data streams to correct the misalignment. If the audio data stream precedes the video data stream, the audio data stream is delayed by an integer number of audio sample periods. If the video data stream precedes the audio data stream, the video data stream is delayed by an integer number of video frames and the audio data stream is delayed by an integer number of audio sample periods.
255 Citations
14 Claims
-
1. A method for processing audio content and video content of an audio/video program, wherein the method comprises:
-
receiving a sequence of audio packets conveying first control information and the audio content and obtaining from the first control information audio time information for a respective audio packet in the sequence of audio packets;
receiving a sequence of video packets conveying second control information and the video content and obtaining from the second control information video time information for a respective video packet in the sequence of video packets;
determining from the audio time information and the video time information a relative time alignment of the respective audio packet with the respective video packet and applying a perceptual model to the relative time alignment to obtain a measure of perceived misalignment between the respective audio packet and the respective video packet;
if the measure of perceived misalignment indicates the respective audio packet precedes the desired time alignment with the respective video packet by an interval that exceeds a first threshold, delaying all or part of the respective audio packet by a first amount such that according to the perceptual model a perceived misalignment between the delayed respective audio packet and the respective video packet is reduced; and
if the measure of perceived misalignment indicates the respective audio packet follows the desired time alignment with the respective audio packet by an interval that exceeds a second threshold, delaying the respective video packet by a second amount and delaying all or part of the respective audio packet by a third amount such that according to the perceptual model a perceived misalignment between the delayed respective audio packet and the delayed respective video packet is reduced. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A medium conveying a program of instructions that is executable by a device to perform a method for processing audio content and video content of an audio/video program, wherein the method comprises:
-
receiving a sequence of audio packets conveying first control information and the audio content and obtaining from the first control information audio time information for a respective audio packet in the sequence of audio packets;
receiving a sequence of video packets conveying second control information and the video content and obtaining from the second control information video time information for a respective video packet in the sequence of video packets;
determining from the audio time information and the video time information a relative time alignment of the respective audio packet with the respective video packet and applying a perceptual model to the relative time alignment to obtain a measure of perceived misalignment between the respective audio packet and the respective video packet;
if the measure of perceived misalignment indicates the respective audio packet precedes the desired time alignment with the respective video packet by an interval that exceeds a first threshold, delaying all or part of the respective audio packet by a first amount such that according to the perceptual model a perceived misalignment between the delayed respective audio packet and the respective video packet is reduced; and
if the measure of perceived misalignment indicates the respective audio packet follows the desired time alignment with the respective audio packet by an interval that exceeds a second threshold, delaying the respective video packet by a second amount and delaying all or part of the respective audio packet by a third amount such that according to the perceptual model a perceived misalignment between the delayed respective audio packet and the delayed respective video packet is reduced. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
Specification