System and tools for enhanced 3D audio authoring and rendering

US 10,609,506 B2
Filed: 01/23/2019
Issued: 03/31/2020
Est. Priority Date: 07/01/2011
Status: Active Grant

First Claim

Patent Images

1. A method, comprising:

receiving audio reproduction data comprising one or more audio objects and metadata associated with each of the one or more audio objects;

receiving reproduction environment data comprising an indication of a number of reproduction speakers in the reproduction environment and an indication of the location of each reproduction speaker within the reproduction environment; and

rendering the audio objects into one or more speaker feed signals by applying an amplitude panning process to each audio object, wherein the amplitude panning process is based, at least in part, on the metadata associated with each audio object and the location of each reproduction speaker within the reproduction environment, and wherein each speaker feed signal corresponds to at least one of the reproduction speakers within the reproduction environment;

wherein the metadata associated with each audio object includes audio object coordinates indicating the intended reproduction position of the audio object within the reproduction environment and metadata indicating audio object spreads in two or more of three dimensions, wherein the audio object spreads are the same in the two or more dimensions, and wherein the rendering involves controlling the audio object spreads in the two or more dimensions in response to the metadata.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Improved tools for authoring and rendering audio reproduction data are provided. Some such authoring tools allow audio reproduction data to be generalized for a wide variety of reproduction environments. Audio reproduction data may be authored by creating metadata for audio objects. The metadata may be created with reference to speaker zones. During the rendering process, the audio reproduction data may be reproduced according to the reproduction speaker layout of a particular reproduction environment.

Citations

3 Claims

1. A method, comprising:
- receiving audio reproduction data comprising one or more audio objects and metadata associated with each of the one or more audio objects;
  
  receiving reproduction environment data comprising an indication of a number of reproduction speakers in the reproduction environment and an indication of the location of each reproduction speaker within the reproduction environment; and
  
  rendering the audio objects into one or more speaker feed signals by applying an amplitude panning process to each audio object, wherein the amplitude panning process is based, at least in part, on the metadata associated with each audio object and the location of each reproduction speaker within the reproduction environment, and wherein each speaker feed signal corresponds to at least one of the reproduction speakers within the reproduction environment;
  
  wherein the metadata associated with each audio object includes audio object coordinates indicating the intended reproduction position of the audio object within the reproduction environment and metadata indicating audio object spreads in two or more of three dimensions, wherein the audio object spreads are the same in the two or more dimensions, and wherein the rendering involves controlling the audio object spreads in the two or more dimensions in response to the metadata.

2. An apparatus, comprising:
- an interface system; and
  
  a logic system configured for;
  
  receiving, via the interface system, audio reproduction data comprising one or more audio objects and metadata associated with each of the one or more audio objects;
  
  receiving, via the interface system, reproduction environment data comprising an indication of a number of reproduction speakers in the reproduction environment and an indication of the location of each reproduction speaker within the reproduction environment; and
  
  rendering the audio objects into one or more speaker feed signals by applying an amplitude panning process to each audio object, wherein the amplitude panning process is based, at least in part, on the metadata associated with each audio object and the location of each reproduction speaker within the reproduction environment, and wherein each speaker feed signal corresponds to at least one of the reproduction speakers within the reproduction environment;
  
  wherein the metadata associated with each audio object includes audio object coordinates indicating the intended reproduction position of the audio object within the reproduction environment and metadata indicating audio object spreads in two or more of three dimensions, wherein the audio object spreads are the same in the two or more dimensions, and wherein the rendering involves controlling the audio object spreads in the two or more dimensions in response to the metadata.

3. A non-transitory medium comprising a sequence of instructions, wherein the instructions, when executed by an audio signal processing device, cause the audio signal processing device to perform a method, comprising:
- receiving audio reproduction data comprising one or more audio objects and metadata associated with each of the one or more audio objects;
  
  receiving reproduction environment data comprising an indication of a number of reproduction speakers in the reproduction environment and an indication of the location of each reproduction speaker within the reproduction environment; and
  
  rendering the audio objects into one or more speaker feed signals by applying an amplitude panning process to each audio object, wherein the amplitude panning process is based, at least in part, on the metadata associated with each audio object and the location of each reproduction speaker within the reproduction environment, and wherein each speaker feed signal corresponds to at least one of the reproduction speakers within the reproduction environment;
  
  wherein the metadata associated with each audio object includes audio object coordinates indicating the intended reproduction position of the audio object within the reproduction environment and metadata indicating audio object spreads in two or more of three dimensions, wherein the audio object spreads are the same in the two or more dimensions, and wherein the rendering involves controlling the audio object spreads in the two or more dimensions in response to the metadata.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Dolby Laboratories Licensing Corporation (Dolby Laboratories Incorporated)
Original Assignee
Dolby Laboratories Licensing Corporation (Dolby Laboratories Incorporated)
Inventors
Tsingos, Nicolas R., Robinson, Charles Q., Scharpf, Jurgen W.
Primary Examiner(s)
King, Simon

Application Number

US16/254,778
Publication Number

US 20190158974A1
Time in Patent Office

433 Days
Field of Search
US Class Current
CPC Class Codes

H04R 5/02   Spatial or constructional a...

H04S 2400/01   Multi-channel, i.e. more th...

H04S 2400/11   Positioning of individual s...

H04S 3/00   Systems employing more than...

H04S 3/008   in which the audio signals ...

H04S 5/00   Pseudo-stereo systems, e.g....

H04S 7/307   Frequency adjustment, e.g. ...

H04S 7/308   Electronic adaptation depen...

H04S 7/40   Visual indication of stereo...

System and tools for enhanced 3D audio authoring and rendering

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

3 Claims

Specification

Solutions

Use Cases

Quick Links

System and tools for enhanced 3D audio authoring and rendering

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

3 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links