AUDIO VIDEO OFFSET DETECTOR
First Claim
1. A method of measuring a temporal offset between video content and audio content in a system for transmitting video, the video being subject to transmission over a media channel, the method comprising the steps of:
- prior to transmission of the video over the media channel;
obtaining reference audio fingerprint data from the audio content and reference video fingerprint data from the video content, wherein each of the reference audio fingerprint data and the reference video fingerprint data includes associated time stamp data; and
subsequent to the transmission of the video over the media channel;
obtaining target audio fingerprint data from the audio content and target video fingerprint data from the video content, wherein each of the target audio fingerprint data and the target video fingerprint data includes associated time stamp data; and
determining a temporal offset between the video content and the audio content based at least on the time stamp data associated with the reference video fingerprint data, the time stamp data associated with the target video fingerprint data, the time stamp data associated with the reference audio fingerprint data, and the time stamp data associated with the target audio fingerprint data.
5 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods of measuring a temporal offset between audio content and video content that employ audio fingerprints from an audio signal in the audio content, and video fingerprints from video frames in the video content. The systems obtain reference audio and video fingerprints prior to transmission of video over a media channel, and obtain target audio and video fingerprints subsequent to transmission of the video over the media channel. Each fingerprint has an associated time stamp. Using the reference and target audio fingerprints and their associated time stamps, the systems determine an audio time stamp offset. Using the reference and target video fingerprints and their associated time stamps, the systems determine a video time stamp offset. Using the audio and video time stamp offsets, the systems determine a temporal offset between the video content and the audio content introduced by the media channel.
-
Citations
32 Claims
-
1. A method of measuring a temporal offset between video content and audio content in a system for transmitting video, the video being subject to transmission over a media channel, the method comprising the steps of:
-
prior to transmission of the video over the media channel; obtaining reference audio fingerprint data from the audio content and reference video fingerprint data from the video content, wherein each of the reference audio fingerprint data and the reference video fingerprint data includes associated time stamp data; and subsequent to the transmission of the video over the media channel; obtaining target audio fingerprint data from the audio content and target video fingerprint data from the video content, wherein each of the target audio fingerprint data and the target video fingerprint data includes associated time stamp data; and determining a temporal offset between the video content and the audio content based at least on the time stamp data associated with the reference video fingerprint data, the time stamp data associated with the target video fingerprint data, the time stamp data associated with the reference audio fingerprint data, and the time stamp data associated with the target audio fingerprint data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
-
31. A system for measuring a temporal offset between video content and audio content, the video content and the audio content being included in video that is subject to transmission over a media channel, the system comprising:
-
a reference audio fingerprint extractor operative, prior to the transmission of the video over the media channel, to obtain reference audio fingerprint data from the audio content; a reference video fingerprint extractor operative, prior to the transmission of the video over the media channel, to obtain reference video fingerprint data from the video content, wherein each of the reference audio fingerprint data and the reference video fingerprint data includes associated time stamp data; a target audio fingerprint extractor operative, subsequent to the transmission of the video over the media channel, to obtain target audio fingerprint data from the audio content; a target video fingerprint extractor operative, subsequent to the transmission of the video over the media channel, to obtain target video fingerprint data from the video content, wherein each of the target audio fingerprint data and the target video fingerprint data includes associated time stamp data, wherein the reference audio fingerprint data includes one or more reference audio fingerprints characteristic of the audio content prior to the transmission of the video over the media channel, and wherein the target audio fingerprint data includes at least one target audio fingerprint characteristic of the audio content subsequent to the transmission of the video over the media channel; an audio fingerprint matcher operative (a) to perform fingerprint matching of the target audio fingerprint against one or more of the reference audio fingerprints to obtain one or more reference audio fingerprints that at least substantially match the target audio fingerprint, and, (b) through use of at least the time stamp data associated with the target audio fingerprint data and the time stamp data associated with the reference audio fingerprint data, to obtain one or more audio time stamp offsets indicative of one or more temporal offsets between the target audio fingerprint and one or more of the reference audio fingerprints that at least substantially match the target audio fingerprint, wherein the reference video fingerprint data includes one or more reference video fingerprints characteristic of the video content prior to the transmission of the video over the media channel, and wherein the target video fingerprint data includes at least one target video fingerprint characteristic of the video content subsequent to the transmission of the video over the media channel; a video fingerprint matcher operative (a) to perform fingerprint matching of the target video fingerprint against one or more of the reference video fingerprints to obtain one or more reference video fingerprints that at least substantially match the target video fingerprint, and, (b) through use of at least the time stamp data associated with the target video fingerprint data and the time stamp data associated with the reference video fingerprint data, to obtain one or more video time stamp offsets indicative of one or more temporal offsets between the target video fingerprint and one or more of the reference video fingerprints that at least substantially match the target video fingerprint; and an offset estimator operative, through use of at least (a) one or more of the audio time stamp offsets and (b) one or more of the video time stamp offsets, to obtain a value indicative of the temporal offset between the video content and audio content.
-
-
32. A system for measuring a temporal offset between video content and audio content, the video content and the audio content being included in video that is subject to transmission over a media channel, the system comprising:
-
a reference fingerprint database operative, prior to the transmission of the video over the media channel, to store reference audio fingerprint data from the audio content and reference video fingerprint data from the video content, wherein each of the reference audio fingerprint data and the reference video fingerprint data includes associated time stamp data; a target audio fingerprint extractor operative, subsequent to the transmission of the video over the media channel, to obtain target audio fingerprint data from the audio content; a target video fingerprint extractor operative, subsequent to the transmission of the video over the media channel, to obtain target video fingerprint data from the video content, wherein each of the target audio fingerprint data and the target video fingerprint data includes associated time stamp data, wherein the reference audio fingerprint data includes one or more reference audio fingerprints characteristic of the audio content prior to the transmission of the video over the media channel, and wherein the target audio fingerprint data includes at least one target audio fingerprint characteristic of the audio content subsequent to the transmission of the video over the media channel; an audio fingerprint matcher operative (a) to perform fingerprint matching of the target audio fingerprint against one or more of the reference audio fingerprints to obtain one or more reference audio fingerprints that at least substantially match the target audio fingerprint, and, (b) through use of at least the time stamp data associated with the target audio fingerprint data and the time stamp data associated with the reference audio fingerprint data, to obtain one or more audio time stamp offsets indicative of one or more temporal offsets between the target audio fingerprint and one or more of the reference audio fingerprints that at least substantially match the target audio fingerprint, wherein the reference video fingerprint data includes one or more reference video fingerprints characteristic of the video content prior to the transmission of the video over the media channel, and wherein the target video fingerprint data includes at least one target video fingerprint characteristic of the video content subsequent to the transmission of the video over the media channel; a video fingerprint matcher operative (a) to perform fingerprint matching of the target video fingerprint against one or more of the reference video fingerprints to obtain one or more reference video fingerprints that at least substantially match the target video fingerprint, and, (b) through use of at least the time stamp data associated with the target video fingerprint data and the time stamp data associated with the reference video fingerprint data, to obtain one or more video time stamp offsets indicative of one or more temporal offsets between the target video fingerprint and one or more of the reference video fingerprints that at least substantially match the target video fingerprint; and an offset estimator operative, through use of at least (a) one or more of the audio time stamp offsets and (b) one or more of the video time stamp offsets, to obtain a value indicative of the temporal offset between the video content and audio content.
-
Specification