Real-time control of playback rates in presentations
First Claim
1. An apparatus containing a data structure representing a presentation, the data structure comprising:
- a first audio channel representing an audio portion of the presentation after time scaling by a first time scale factor; and
a second audio channel representing the audio portion after time scaling by a second time scale factor that differs from the first time scale factor.
1 Assignment
0 Petitions
Accused Products
Abstract
Media encoding, transmission, and playback processes and structures employ a multi-channel architecture with different audio channels corresponding to different playback rates for a presentation to be transmitted over a network. Audio frames in the various audio channels all correspond to the same amount of time in the original presentation and have frame indexes that identify in the different audio channels the frames corresponding to the same time interval in the presentation. A user can make a real-time change in playback rate causing selection of a channel corresponding to the new playback rate and a frame required for prompt and smooth transition in the playback rate of the presentation. The architecture can additionally provide channels for graphics data such as image data that are displayed according to the index of the audio, and different audio channels with the same playback rate but different compression schemes for use according to available bandwidth on the network.
247 Citations
36 Claims
-
1. An apparatus containing a data structure representing a presentation, the data structure comprising:
-
a first audio channel representing an audio portion of the presentation after time scaling by a first time scale factor; and
a second audio channel representing the audio portion after time scaling by a second time scale factor that differs from the first time scale factor. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 13, 15, 16, 17, 18, 20, 21, 22, 23, 25, 27, 28, 29)
-
-
12. An apparatus containing a data structure representing an audio presentation, the data structure comprising a plurality of audio channels representing the audio presentation after time scaling, wherein:
-
each audio channel has a corresponding time scale factor and includes a plurality of audio frames; and
each audio frame has a frame index that uniquely distinguishes the audio frame from other audio frames in the same channel and identifies the audio frame as corresponding to specific audio frames in other audio channels.
-
-
14. A method for encoding audio data, comprising:
-
performing a plurality of time scaling processes on the audio data to generate a plurality of time-scaled audio data sets, each time-scaled audio data set having a different time scale factor; and
generating a data structure containing a plurality of audio channels respectively corresponding to the plurality of time scaling processes, wherein content of each of the audio channels is derived from the time-scaled audio data set resulting from performing the corresponding time scaling process on the audio data.
-
-
19. A method for playing a presentation, comprising:
-
loading a first frame from a source into a player via a network, the first frame representing a first portion of the presentation after scaling by a first time-scaling factor, wherein the first audio frame has a first channel index value that identifies the first audio frame as being scaled by the first time scaling factor;
playing the first portion of the presentation based on data from the first audio frame;
receiving a request to change playing from the first time scaling factor to a second time scaling factor;
requesting from the source a second audio frame that has a second channel index value that identifies the second frame as being scaled by the second time-scaling factor; and
playing the second frame after the first to provide a real-time change in the time-scale of the presentation.
-
-
24. A method for playing an audio presentation on a receiver that is connected via a network to a source having a multi-channel data structure representing the audio presentation, the method comprising:
-
determining available bandwidth on the network;
selecting a first channel of the multi-channel data structure from a plurality of channels that represent the audio presentation after time-scaling by a desired time-scaling factor, wherein the first channel contains data that is compressed using a compression process that provides highest audio quality at the available bandwidth;
receiving a first frame from the first channel; and
playing the first frame.
-
-
26. A method for controlling display of web pages, comprising:
-
assigning a series of web pages to respective index values of audio data that represent an audio portion of a presentation;
playing audio generated from the audio data; and
displaying each web page in response to the playing reaching in the audio data an index value assigned to the web page.
-
-
30. A method for authoring a presentation for playback on a computing system, comprising:
-
assigning time index values to audio data for the presentation;
assigning a range of the time index values to each image represented by graphics data for the presentation; and
constructing a file containing the audio data and the graphics data, wherein the file has a format indicating display of each image occurs during playing of the audio data that has assigned time index values in the range assigned to the image. - View Dependent Claims (31, 32, 33, 34, 35, 36)
-
Specification