Terminal for composing and presenting MPEG-4 video programs
First Claim
1. A terminal for receiving and processing a multimedia data bitstream, comprising:
- a terminal manager;
a composition engine;
a plurality of content decoders; and
a presentation engine;
wherein;
said content decoders recover and decode multimedia objects from respective elementary streams of the bitstream;
said multimedia objects comprising at least one of video objects and audio objects for presentation in a multimedia scene;
said composition engine recovers scene description information from the bitstream that defines specific ones of the recovered multimedia objects that are to be provided in the multimedia scene, and characteristics of the recovered multimedia objects in the multimedia scene;
said terminal manager recovers object descriptor information from the bitstream that associates said recovered multimedia objects with respective ones of said elementary streams, and provides the recovered object descriptor information to said composition engine;
said composition engine is responsive to said recovered object descriptor information provided thereto and said recovered scene description information for creating a list of said specific ones of the recovered multimedia objects that are to be displayed in said multimedia scene; and
said presentation engine obtains said list from said composition engine, and, in response thereto, retrieves the corresponding decoded multimedia objects from said content decoders to provide data corresponding to the multimedia scene to an output device.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus for composing and presenting multimedia programs using the MPEG-4 standard at a multimedia terminal (100). A composition engine (120) maintains and updates a scene graph (124) of the current objects, including their relative position in a scene and their characteristics, and provides a corresponding list of objects (126) to be displayed to a presentation engine (150). In response, the presentation engine begins to retrieve the corresponding decoded object data that is stored in respective composition buffers (176, . . . 186). The presentation engine assembles the decoded objects to provide a scene for presentation on output devices such as a video monitor (240) and speakers (242), or for storage. A terminal manager (110) receives user commands and causes the composition engine to update the scene graph and list of objects accordingly. The terminal manager also forwards the information contained in the object descriptors to a scene decoder (122) at the composition engine. Preferably, the composition and the presentation of the content are controlled using separate control threads to allow the presentation engine to retrieve and process the decoded object data while the composition engine is recovering additional scene description information and/or object descriptors.
296 Citations
18 Claims
-
1. A terminal for receiving and processing a multimedia data bitstream, comprising:
-
a terminal manager;
a composition engine;
a plurality of content decoders; and
a presentation engine;
wherein;
said content decoders recover and decode multimedia objects from respective elementary streams of the bitstream;
said multimedia objects comprising at least one of video objects and audio objects for presentation in a multimedia scene;
said composition engine recovers scene description information from the bitstream that defines specific ones of the recovered multimedia objects that are to be provided in the multimedia scene, and characteristics of the recovered multimedia objects in the multimedia scene;
said terminal manager recovers object descriptor information from the bitstream that associates said recovered multimedia objects with respective ones of said elementary streams, and provides the recovered object descriptor information to said composition engine;
said composition engine is responsive to said recovered object descriptor information provided thereto and said recovered scene description information for creating a list of said specific ones of the recovered multimedia objects that are to be displayed in said multimedia scene; and
said presentation engine obtains said list from said composition engine, and, in response thereto, retrieves the corresponding decoded multimedia objects from said content decoders to provide data corresponding to the multimedia scene to an output device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A terminal for receiving and processing a multimedia data bitstream, comprising:
-
decoding means for recovering and decoding multimedia objects from respective elementary streams of the bitstream;
said multimedia objects comprising at least one of video objects and audio objects for presentation in a multimedia scene;
composing means for recovering scene description information from the bitstream that defines specific ones of the recovered multimedia objects that are to be provided in the multimedia scene, and characteristics of the recovered multimedia objects in the multimedia scene;
managing means for recovering object descriptor information from the bitstream that associates said recovered multimedia objects with respective ones of said elementary streams, and providing the recovered object descriptor information to said composing means;
said composing means being responsive to said recovered object descriptor information provided thereto and said recovered scene description information for creating a list of said specific ones of the recovered multimedia objects that are to be displayed in said multimedia scene; and
presenting means for obtaining said list from said composing means, and, in response thereto, retrieving the corresponding decoded multimedia objects from said decoding means to provide data corresponding to the multimedia scene to an output device.
-
-
14. A method for receiving and processing a multimedia data bitstream at a terminal, comprising the steps of:
-
recovering and decoding multimedia objects from respective elementary streams of the bitstream at respective content decoders;
said multimedia objects comprising at least one of video and audio objects for presentation in a multimedia scene;
recovering scene description information from the bitstream that defines specific ones of the recovered multimedia objects that are to be provided in the multimedia scene, and characteristics of the recovered multimedia objects in the multimedia scene;
recovering object descriptor information from the bitstream that associates said recovered multimedia objects with respective ones of said elementary streams;
creating a list of said specific ones of the recovered multimedia objects that are to be displayed in said multimedia scene in response to said recovered object descriptor information and said recovered scene description information; and
retrieving the corresponding decoded multimedia objects in response to the list to provide data corresponding to the multimedia scene to an output device. - View Dependent Claims (15, 16, 17, 18)
-
Specification