Real-time control of playback rates in presentations
First Claim
1. An apparatus containing a data structure representing a presentation, the data structure comprising:
- a first audio channel representing an audio portion of the presentation after time scaling by a first time scale factor, wherein the first audio channel comprises a plurality of frames;
a second audio channel representing the audio portion after time scaling by a second time scale factor that differs from the first time scale factor, wherein the second audio channel comprises a plurality of frames that are in one-to-one correspondence with the plurality of frames in the first audio channel, and corresponding frames in the first and second audio channels represent the same time interval of the presentation;
wherein each frame in the first audio channel is separately compressed using a first compression method; and
wherein the data structure further comprises a third audio channel representing the audio portion of the presentation after time scaling by the first time scale factor,wherein each frame in the third audio channel is separately compressed using a second compression method.
1 Assignment
0 Petitions
Accused Products
Abstract
Media encoding, transmission, and playback processes and structures employ a multi-channel architecture with different audio channels corresponding to different playback rates for a presentation to be transmitted over a network. Audio frames in the various audio channels all correspond to the same amount of time in the original presentation and have frame indexes that identify in the different audio channels the frames corresponding to the same time interval in the presentation. A user can make a real-time change in playback rate causing selection of a channel corresponding to the new playback rate and a frame required for prompt and smooth transition in the playback rate of the presentation. The architecture can additionally provide channels for graphics data such as image data that are displayed according to the index of the audio, and different audio channels with the same playback rate but different compression schemes for use according to available bandwidth on the network.
-
Citations
10 Claims
-
1. An apparatus containing a data structure representing a presentation, the data structure comprising:
-
a first audio channel representing an audio portion of the presentation after time scaling by a first time scale factor, wherein the first audio channel comprises a plurality of frames; a second audio channel representing the audio portion after time scaling by a second time scale factor that differs from the first time scale factor, wherein the second audio channel comprises a plurality of frames that are in one-to-one correspondence with the plurality of frames in the first audio channel, and corresponding frames in the first and second audio channels represent the same time interval of the presentation; wherein each frame in the first audio channel is separately compressed using a first compression method; and wherein the data structure further comprises a third audio channel representing the audio portion of the presentation after time scaling by the first time scale factor, wherein each frame in the third audio channel is separately compressed using a second compression method. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method for encoding audio data, comprising:
-
performing a plurality of time scaling processes on the audio data to generate a plurality of time-scaled audio data sets, each time-scaled audio data set having a different time scale factor; partitioning each time-scaled audio data set into a plurality of frames, wherein all frames resulting from the partitioning correspond to the same amount of time in the audio data; separately compressing each frame to produce compressed frames; and collecting the compressed frames into a plurality of audio channels that form a data structure, each audio channel having a corresponding one of the different time scale factors; wherein separately compressing each frame comprises applying a plurality of different compression processes to generate a plurality of compressed frames from each frame. - View Dependent Claims (10)
-
Specification