System and tools for enhanced 3D audio authoring and rendering
First Claim
Patent Images
1. A method, comprising:
- receiving audio reproduction data comprising one or more audio objects and metadata associated with each of the one or more audio objects;
receiving reproduction environment data comprising an indication of a number of reproduction speakers in the reproduction environment and an indication of the location of each reproduction speaker within the reproduction environment; and
rendering the audio objects into one or more speaker feed signals by applying an amplitude panning process to each audio object, wherein the amplitude panning process is based, at least in part, on the metadata associated with each audio object and the location of each reproduction speaker within the reproduction environment, and wherein each speaker feed signal corresponds to at least one of the reproduction speakers within the reproduction environment;
wherein the metadata associated with each audio object includes audio object coordinates indicating the intended reproduction position of the audio object within the reproduction environment and metadata indicating audio object spreads in two or more of three dimensions, wherein the audio object spreads are the same in the two or more dimensions, and wherein the rendering involves controlling the audio object spreads in the two or more dimensions in response to the metadata.
1 Assignment
0 Petitions
Accused Products
Abstract
Improved tools for authoring and rendering audio reproduction data are provided. Some such authoring tools allow audio reproduction data to be generalized for a wide variety of reproduction environments. Audio reproduction data may be authored by creating metadata for audio objects. The metadata may be created with reference to speaker zones. During the rendering process, the audio reproduction data may be reproduced according to the reproduction speaker layout of a particular reproduction environment.
-
Citations
3 Claims
-
1. A method, comprising:
-
receiving audio reproduction data comprising one or more audio objects and metadata associated with each of the one or more audio objects; receiving reproduction environment data comprising an indication of a number of reproduction speakers in the reproduction environment and an indication of the location of each reproduction speaker within the reproduction environment; and rendering the audio objects into one or more speaker feed signals by applying an amplitude panning process to each audio object, wherein the amplitude panning process is based, at least in part, on the metadata associated with each audio object and the location of each reproduction speaker within the reproduction environment, and wherein each speaker feed signal corresponds to at least one of the reproduction speakers within the reproduction environment; wherein the metadata associated with each audio object includes audio object coordinates indicating the intended reproduction position of the audio object within the reproduction environment and metadata indicating audio object spreads in two or more of three dimensions, wherein the audio object spreads are the same in the two or more dimensions, and wherein the rendering involves controlling the audio object spreads in the two or more dimensions in response to the metadata.
-
-
2. An apparatus, comprising:
-
an interface system; and a logic system configured for; receiving, via the interface system, audio reproduction data comprising one or more audio objects and metadata associated with each of the one or more audio objects; receiving, via the interface system, reproduction environment data comprising an indication of a number of reproduction speakers in the reproduction environment and an indication of the location of each reproduction speaker within the reproduction environment; and rendering the audio objects into one or more speaker feed signals by applying an amplitude panning process to each audio object, wherein the amplitude panning process is based, at least in part, on the metadata associated with each audio object and the location of each reproduction speaker within the reproduction environment, and wherein each speaker feed signal corresponds to at least one of the reproduction speakers within the reproduction environment; wherein the metadata associated with each audio object includes audio object coordinates indicating the intended reproduction position of the audio object within the reproduction environment and metadata indicating audio object spreads in two or more of three dimensions, wherein the audio object spreads are the same in the two or more dimensions, and wherein the rendering involves controlling the audio object spreads in the two or more dimensions in response to the metadata.
-
-
3. A non-transitory medium comprising a sequence of instructions, wherein the instructions, when executed by an audio signal processing device, cause the audio signal processing device to perform a method, comprising:
-
receiving audio reproduction data comprising one or more audio objects and metadata associated with each of the one or more audio objects; receiving reproduction environment data comprising an indication of a number of reproduction speakers in the reproduction environment and an indication of the location of each reproduction speaker within the reproduction environment; and rendering the audio objects into one or more speaker feed signals by applying an amplitude panning process to each audio object, wherein the amplitude panning process is based, at least in part, on the metadata associated with each audio object and the location of each reproduction speaker within the reproduction environment, and wherein each speaker feed signal corresponds to at least one of the reproduction speakers within the reproduction environment; wherein the metadata associated with each audio object includes audio object coordinates indicating the intended reproduction position of the audio object within the reproduction environment and metadata indicating audio object spreads in two or more of three dimensions, wherein the audio object spreads are the same in the two or more dimensions, and wherein the rendering involves controlling the audio object spreads in the two or more dimensions in response to the metadata.
-
Specification