Variable rate video playback with synchronized audio
First Claim
1. For use with an audiovisual display system in which an original set of audio data and a related original set of video data can be used to generate an audiovisual display at a normal display rate, a method for enabling the apparent display rate of the audiovisual display to be varied from the normal display rate, the method comprising the steps of:
- defining a correspondence between the original set of audio data and the original set of video data such that the original set of audio data is synchronized with the original set of video data;
determining a target display rate or rates for the audiovisual display including evaluating the audio and/or video data to automatically determine the value of the target display rate including;
analyzing the original set of audio data including ascertaining the stress with which spoken portions of the audio data are uttered; and
calculating the target display rate based upon the analysis of the original set of audio data including calculating the target display rate or rates based upon the relative stresses of the spoken portions of the audio data;
creating a blended audio sample, wherein the blended audio sample is associated with a portion of a first audio sample and a portion of an adjacent second audio sample;
creating a modified set of audio data, based upon the target display rate or rates and an evaluation of the content of the original set of audio data, that corresponds to the original set of audio data; and
creating a modified set of video data, wherein the modified set of video data is associated with the blended audio sample, and wherein the modified set of video data is based upon the modified set of audio data, the correspondence between the modified set of audio data and the original set of audio data, and the correspondence between the original set of audio data and the original set of video data such that the modified set of video data is synchronized with the modified set of audio data.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention enables the apparent display rate of an audiovisual display to be varied. The invention can modify an original set of audio data in accordance with a target display rate, then modify a related original set of video data to conform to the modifications made to the audio data set, such that the modified audio and video data sets are synchronized. When the modified audio and video data sets so produced are used to generate an audiovisual display, the audiovisual display has an apparent display rate that approximates the target display rate. The target display rate can be faster or slower than a normal display rate at which an audiovisual display system generates an audiovisual display from the original sets of audio and video data. The target display rate can be established solely by a user instruction, by analysis of the audiovisual data, or by modification of a user-specified nominal target display rate based upon analysis of the audiovisual data. Preferably, the method for modifying the original audio data set is one that produces a modified audio data set that can be used to generate an audio display having little or no distortion.
45 Citations
14 Claims
-
1. For use with an audiovisual display system in which an original set of audio data and a related original set of video data can be used to generate an audiovisual display at a normal display rate, a method for enabling the apparent display rate of the audiovisual display to be varied from the normal display rate, the method comprising the steps of:
-
defining a correspondence between the original set of audio data and the original set of video data such that the original set of audio data is synchronized with the original set of video data;
determining a target display rate or rates for the audiovisual display including evaluating the audio and/or video data to automatically determine the value of the target display rate including;
analyzing the original set of audio data including ascertaining the stress with which spoken portions of the audio data are uttered; and
calculating the target display rate based upon the analysis of the original set of audio data including calculating the target display rate or rates based upon the relative stresses of the spoken portions of the audio data;
creating a blended audio sample, wherein the blended audio sample is associated with a portion of a first audio sample and a portion of an adjacent second audio sample;
creating a modified set of audio data, based upon the target display rate or rates and an evaluation of the content of the original set of audio data, that corresponds to the original set of audio data; and
creating a modified set of video data, wherein the modified set of video data is associated with the blended audio sample, and wherein the modified set of video data is based upon the modified set of audio data, the correspondence between the modified set of audio data and the original set of audio data, and the correspondence between the original set of audio data and the original set of video data such that the modified set of video data is synchronized with the modified set of audio data. - View Dependent Claims (2)
-
-
3. For use with an audiovisual display system in which an original set of audio data and a related original set of video data can be used to generate an audiovisual display at a normal display rate, a method for enabling the apparent display rate of the audiovisual display to be varied from the normal display rate, the method comprising the steps of:
-
defining a correspondence between the original set of audio data and the original set of video data such that the original set of audio data is synchronized with the original set of video data;
determining a target display rate or rates for the audiovisual display, including evaluating the audio and/or video data to automatically determine the value of the target display rate, including;
analyzing the original set of audio data, including ascertaining the speaking rate at which spoken portions of the audio data are uttered; and
calculating the target display rate based upon the analysis of the original set of audio data, including calculating the target display rate or rates based upon the relative speeds of the spoken portions of the audio data;
creating a blended audio sample, wherein the blended audio sample is associated with a portion of a first audio sample and a portion of an adjacent second audio sample;
creating a modified set of audio data, based upon the target display rate or rates and an evaluation of the content of the original set of audio data, that corresponds to the original set of audio data; and
creating a modified set of video data, wherein the modified set of video data is associated with the blended audio sample, and wherein the modified set of video data is based upon the modified set of audio data, the correspondence between the modified set of audio data and the original set of audio data, and the correspondence between the original set of audio data and the original set of video data such that the modified set of video data is synchronized with the modified set of audio data. - View Dependent Claims (4)
-
-
5. For use with an audiovisual display system in which an original set of audio data and a related original set of video data can be used to generate an audiovisual display at a normal display rate, a method for enabling the apparent display rate of the audiovisual display to be varied from the normal display rate, the method comprising the steps of:
-
defining a correspondence between the original set of audio data and the original set of video data such that the original set of audio data is synchronized with the original set of video data;
determining a target display rate or rates for the audiovisual display, including evaluating the audio and/or video data to automatically determine the value of the target display rate, including;
analyzing the original set of audio data, including;
ascertaining the stress with which spoken portions of the audio data are uttered;
ascertaining the speaking rate at which spoken portions of the audio data are uttered; and
combining corresponding stresses and speaking rates to produce audio tension values for the spoken portions; and
calculating the target display rate based upon the analysis of the original set of audio data, including calculating the target display rate or rates based upon the audio tension values of the spoken portions of the audio data;
creating a blended audio sample, wherein the blended audio sample is associated with a portion of a first audio sample and a portion of an adjacent second audio sample;
creating a modified set of audio data, based upon the target display rate or rates and an evaluation of the content of the original set of audio data, that corresponds to the original set of audio data; and
creating a modified set of video data, wherein the modified set of video data is associated with the blended audio sample, and wherein the modified set of video data is based upon the modified set of audio data, the correspondence between the modified set of audio data and the original set of audio data, and the correspondence between the original set of audio data and the original set of video data such that the modified set of video data is synchronized with the modified set of audio data. - View Dependent Claims (6)
-
-
7. For use with an audiovisual display system in which an original set of audio data and a related original set of video data can be used to generate an audiovisual display at a normal display rate, a method for enabling the apparent display rate of the audiovisual display to be varied from the normal display rate, the method comprising the steps of:
-
defining a correspondence between the original set of audio data and the original set of video data such that the original set of audio data is synchronized with the original set of video data;
determining a target display rate or rates for the audiovisual display;
creating a blended audio sample, wherein the blended audio sample is associated with a portion of a first audio sample and a portion of an adjacent second audio sample;
creating a modified set of audio data, based upon the target display rate or rates and an evaluation of the content of the original set of audio data, that corresponds to the original set of audio data, including;
(i) dividing the original set of audio data into a plurality of segments, each segment representing a contiguous portion of the original set of audio data that occurs during a specified duration of time, each segment being adjacent to one or two other segments such that there are no gaps between segments and adjacent segments do not overlap;
(ii) selecting a first segment;
(iii) selecting a second segment, the second segment temporally adjacent to the first segment;
(iv) overlapping an end portion of the first segment with an end portion of the second segment that is adjacent to the first segment, the end portion of the first segment including first segment overlap data and the end portion of the second segment including second segment overlap data;
(v) identifying as part of the modified set of audio data the audio data from the first segment that is not part of the first segment overlap data; and
(vi) blending corresponding first segment overlap data and second segment overlap data; and
(vii) determining whether there are additional segments in the original set of audio data that have not been overlapped with an adjacent segment, wherein;
if there are additional segments, the following steps are further performed;
(viii) combining the blended overlap data with the audio data from the second segment that is not part of the second segment overlap data;
(ix) selecting the combined data as a new first segment; and
(x) selecting a new second segment that is temporally adjacent to the new first segment and that has not previously been selected as a segment; and
(xi) repeating steps (i) through (vii); and
if there are not additional segments, the following step is further performed;
(xii) identifying as part of the modified set of audio data the blended data and the audio data from the second segment that is not part of the second segment overlap data;
and creating a modified set of video data, wherein the modified set of video data is associated with the blended audio sample, and wherein the modified set of video data is based upon the modified set of audio data, the correspondence between the modified set of audio data and the original set of audio data, and the correspondence between the original set of audio data and the original set of video data such that the modified set of video data is synchronized with the modified set of audio data.
-
-
8. A computer program product for use with an audiovisual display system in which an original set of audio data and a related original set of video data can be used to generate an audiovisual display at a normal display rate, for enabling the apparent display rate of the audiovisual display to be varied from the normal display rate, the computer program product being embodied in a computer readable medium and comprising computer instructions for:
-
defining a correspondence between the original set of audio data and the original set of video data such that the original set of audio data is synchronized with the original set of video data;
determining a target display rate or rates for the audiovisual display including evaluating the audio and/or video data to automatically determine the value of the target display rate including;
analyzing the original set of audio data including ascertaining the stress with which spoken portions of the audio data are uttered; and
calculating the target display rate based upon the analysis of the original set of audio data including calculating the target display rate or rates based upon the relative stresses of the spoken portions of the audio data;
creating a blended audio sample, wherein the blended audio sample is associated with a portion of a first audio sample and a portion of an adjacent second audio sample;
creating a modified set of audio data, based upon the target display rate or rates and an evaluation of the content of the original set of audio data, that corresponds to the original set of audio data; and
creating a modified set of video data, wherein the modified set of video data is associated with the blended audio sample, and wherein the modified set of video data is based upon the modified set of audio data, the correspondence between the modified set of audio data and the original set of audio data, and the correspondence between the original set of audio data and the original set of video data such that the modified set of video data is synchronized with the modified set of audio data. - View Dependent Claims (9)
-
-
10. A computer program product for use with an audiovisual display system in which an original set of audio data and a related original set of video data can be used to generate an audiovisual display at a normal display rate, for enabling the apparent display rate of the audiovisual display to be varied from the normal display rate, the computer program product being embodied in a computer readable medium and comprising computer instructions for:
-
defining a correspondence between the original set of audio data and the original set of video data such that the original set of audio data is synchronized with the original set of video data;
determining a target display rate or rates for the audiovisual display, including evaluating the audio and/or video data to automatically determine the value of the target display rate, including;
analyzing the original set of audio data, including ascertaining the speaking rate at which spoken portions of the audio data are uttered; and
calculating the target display rate based upon the analysis of the original set of audio data, including calculating the target display rate or rates based upon the relative speeds of the spoken portions of the audio data;
creating a blended audio sample, wherein the blended audio sample is associated with a portion of a first audio sample and a portion of an adjacent second audio sample;
creating a modified set of audio data, based upon the target display rate or rates and an evaluation of the content of the original set of audio data, that corresponds to the original set of audio data; and
creating a modified set of video data, wherein the modified set of video data is associated with the blended audio sample, and wherein the modified set of video data is based upon the modified set of audio data, the correspondence between the modified set of audio data and the original set of audio data, and the correspondence between the original set of audio data and the original set of video data such that the modified set of video data is synchronized with the modified set of audio data. - View Dependent Claims (11)
-
-
12. A computer program product for use with an audiovisual display system in which an original set of audio data and a related original set of video data can be used to generate an audiovisual display at a normal display rate, for enabling the apparent display rate of the audiovisual display to be varied from the normal display rate, the computer program product being embodied in a computer readable medium and comprising computer instructions for:
-
defining a correspondence between the original set of audio data and the original set of video data such that the original set of audio data is synchronized with the original set of video data;
determining a target display rate or rates for the audiovisual display, including evaluating the audio and/or video data to automatically determine the value of the target display rate, including;
analyzing the original set of audio data, including;
ascertaining the stress with which spoken portions of the audio data are uttered;
ascertaining the speaking rate at which spoken portions of the audio data are uttered; and
combining corresponding stresses and speaking rates to produce audio tension values for the spoken portions; and
calculating the target display rate based upon the analysis of the original set of audio data, including calculating the target display rate or rates based upon the audio tension values of the spoken portions of the audio data;
creating a blended audio sample, wherein the blended audio sample is associated with a portion of a first audio sample and a portion of an adjacent second audio sample;
creating a modified set of audio data, based upon the target display rate or rates and an evaluation of the content of the original set of audio data, that corresponds to the original set of audio data; and
creating a modified set of video data, wherein the modified set of video data is associated with the blended audio sample, and wherein the modified set of video data is based upon the modified set of audio data, the correspondence between the modified set of audio data and the original set of audio data, and the correspondence between the original set of audio data and the original set of video data such that the modified set of video data is synchronized with the modified set of audio data. - View Dependent Claims (13)
-
-
14. A computer program product for use with an audiovisual display system in which an original set of audio data and a related original set of video data can be used to generate an audiovisual display at a normal display rate, for enabling the apparent display rate of the audiovisual display to be varied from the normal display rate, the computer program product being embodied in a computer readable medium and comprising computer instructions for:
-
defining a correspondence between the original set of audio data and the original set of video data such that the original set of audio data is synchronized with the original set of video data;
determining a target display rate or rates for the audiovisual display;
creating a blended audio sample, wherein the blended audio sample is associated with a portion of a first audio sample and a portion of an adjacent second audio sample;
creating a modified set of audio data, based upon the target display rate or rates and an evaluation of the content of the original set of audio data, that corresponds to the original set of audio data, including;
(i) dividing the original set of audio data into a plurality of segments, each segment representing a contiguous portion of the original set of audio data that occurs during a specified duration of time, each segment being adjacent to one or two other segments such that there are no gaps between segments and adjacent segments do not overlap;
(ii) selecting a first segment;
(iii) selecting a second segment, the second segment temporally adjacent to the first segment;
(iv) overlapping an end portion of the first segment with an end portion of the second segment that is adjacent to the first segment, the end portion of the first segment including first segment overlap data and the end portion of the second segment including second segment overlap data;
(v) identifying as part of the modified set of audio data the audio data from the first segment that is not part of the first segment overlap data; and
(vi) blending corresponding first segment overlap data and second segment overlap data; and
(vii) determining whether there are additional segments in the original set of audio data that have not been overlapped with an adjacent segment, wherein;
if there are additional segments, the following steps are further performed;
(viii) combining the blended overlap data with the audio data from the second segment that is not part of the second segment overlap data;
(ix) selecting the combined data as a new first segment; and
(x) selecting a new second segment that is temporally adjacent to the new first segment and that has not previously been selected as a segment; and
(xi) repeating steps (i) through (vii); and
if there are not additional segments, the following step is further performed;
(xii) identifying as part of the modified set of audio data the blended data and the audio data from the second segment that is not part of the second segment overlap data;
and creating a modified set of video data, wherein the modified set of video data is associated with the blended audio sample, and wherein the modified set of video data is based upon the modified set of audio data, the correspondence between the modified set of audio data and the original set of audio data, and the correspondence between the original set of audio data and the original set of video data such that the modified set of video data is synchronized with the modified set of audio data.
-
Specification