System and tools for enhanced 3D audio authoring and rendering
First Claim
Patent Images
1. A method, comprising:
- receiving audio reproduction data comprising one or more audio objects and metadata associated with each of the one or more audio objects;
receiving reproduction environment data comprising an indication of a number of reproduction speakers in the reproduction environment and an indication of the location of each reproduction speaker within the reproduction environment; and
rendering the audio objects into one or more speaker feed signals by applying an amplitude panning process to each audio object, wherein the amplitude panning process is based, at least in part, on the metadata associated with each audio object and the location of each reproduction speaker within the reproduction environment, and wherein each speaker feed signal corresponds to at least one of the reproduction speakers within the reproduction environment;
wherein the metadata associated with each audio object includes audio object coordinates indicating the intended reproduction position of the audio object within the reproduction environment and a snap flag indicating whether the amplitude panning process should render the audio object into a single speaker feed signal or apply panning rules to render the audio object into a plurality of speaker feed signals.
1 Assignment
0 Petitions
Accused Products
Abstract
Improved tools for authoring and rendering audio reproduction data are provided. Some such authoring tools allow audio reproduction data to be generalized for a wide variety of reproduction environments. Audio reproduction data may be authored by creating metadata for audio objects. The metadata may be created with reference to speaker zones. During the rendering process, the audio reproduction data may be reproduced according to the reproduction speaker layout of a particular reproduction environment.
-
Citations
20 Claims
-
1. A method, comprising:
-
receiving audio reproduction data comprising one or more audio objects and metadata associated with each of the one or more audio objects; receiving reproduction environment data comprising an indication of a number of reproduction speakers in the reproduction environment and an indication of the location of each reproduction speaker within the reproduction environment; and rendering the audio objects into one or more speaker feed signals by applying an amplitude panning process to each audio object, wherein the amplitude panning process is based, at least in part, on the metadata associated with each audio object and the location of each reproduction speaker within the reproduction environment, and wherein each speaker feed signal corresponds to at least one of the reproduction speakers within the reproduction environment; wherein the metadata associated with each audio object includes audio object coordinates indicating the intended reproduction position of the audio object within the reproduction environment and a snap flag indicating whether the amplitude panning process should render the audio object into a single speaker feed signal or apply panning rules to render the audio object into a plurality of speaker feed signals. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An apparatus, comprising:
-
an interface system; and a logic system configured for; receiving, via the interface system, audio reproduction data comprising one or more audio objects and metadata associated with each of the one or more audio objects; receiving, via the interface system, reproduction environment data comprising an indication of a number of reproduction speakers in the reproduction environment and an indication of the location of each reproduction speaker within the reproduction environment; and rendering the audio objects into one or more speaker feed signals by applying an amplitude panning process to each audio object, wherein the amplitude panning process is based, at least in part, on the metadata associated with each audio object and the location of each reproduction speaker within the reproduction environment, and wherein each speaker feed signal corresponds to at least one of the reproduction speakers within the reproduction environment; wherein the metadata associated with each audio object includes audio object coordinates indicating the intended reproduction position of the audio object within the reproduction environment and a snap flag indicating whether the amplitude panning process should render the audio object into a single speaker feed signal or apply panning rules to render the audio object into a plurality of speaker feed signals. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A non-transitory medium having software stored thereon, the software including instructions for performing the following operations:
-
receiving audio reproduction data comprising one or more audio objects and metadata associated with each of the one or more audio objects; receiving reproduction environment data comprising an indication of a number of reproduction speakers in the reproduction environment and an indication of the location of each reproduction speaker within the reproduction environment; and rendering the audio objects into one or more speaker feed signals by applying an amplitude panning process to each audio object, wherein the amplitude panning process is based, at least in part, on the metadata associated with each audio object and the location of each reproduction speaker within the reproduction environment, and wherein each speaker feed signal corresponds to at least one of the reproduction speakers within the reproduction environment; wherein the metadata associated with each audio object includes audio object coordinates indicating the intended reproduction position of the audio object within the reproduction environment and a snap flag indicating whether the amplitude panning process should render the audio object into a single speaker feed signal or apply panning rules to render the audio object into a plurality of speaker feed signals.
-
Specification