Methods and systems for selecting layers of encoded audio signals for teleconferencing

US 9,858,936 B2
Filed: 09/11/2013
Issued: 01/02/2018
Est. Priority Date: 09/21/2012
Status: Active Grant

First Claim

Patent Images

1. A teleconferencing method in which nodes perform audio coding to generate spatially layered encoded audio, the nodes include endpoints, and at least some of the spatially layered encoded audio is transmitted from one of the nodes to at least another one of the nodes, wherein the nodes include a first node which is configured to generate spatially layered encoded audio in response to soundfield audio data, said encoded audio including any of a number of different subsets of a set of layers, said set of layers including at least one monophonic layer and at least one soundfield layer, said method including steps of:

(a) in the first node, determining a first subset of the set of layers by performing at least one of perceptually-driven layer selection or endpoint-driven layer selection, said first subset including at least one of said monophonic layer or said soundfield layer, wherein said endpoint-driven layer selection includes at least one independent decision by at least one of the endpoints based on at least one analyzed characteristic of said at least one of the endpoints or of audio content captured by said at least one of the endpoints, and wherein said perceptually-driven layer selection is not based on any downstream capability consideration; and

(b) in said first node, generating first spatially layered encoded audio, wherein the first spatially layered encoded audio includes the first subset of the set of layers determined in step (a), and wherein the first spatially layered encoded audio does not include any layer of said set of layers which is not included in said first subset of the set of layers determined in step (a).

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

In some embodiments, a method for selecting at least one layer of a spatially layered, encoded audio signal. Typical embodiments are teleconferencing methods in which at least one of a set of nodes (endpoints, each of which is a telephone system, and optionally also a server) is configured to perform audio coding in response to soundfield audio data to generate spatially layered encoded audio including any of a number of different subsets of a set of layers, the set of layers including at least one monophonic layer, at least one soundfield layer, and optionally also at least one metadata layer comprising metadata indicative of at least one processing operation to be performed on the encoded audio. Other aspects are systems configured (e.g., programmed) to perform any embodiment of the method, and computer readable media which store code for implementing any embodiment of the method or steps thereof.

76 Citations

View as Search Results

15 Claims

1. A teleconferencing method in which nodes perform audio coding to generate spatially layered encoded audio, the nodes include endpoints, and at least some of the spatially layered encoded audio is transmitted from one of the nodes to at least another one of the nodes, wherein the nodes include a first node which is configured to generate spatially layered encoded audio in response to soundfield audio data, said encoded audio including any of a number of different subsets of a set of layers, said set of layers including at least one monophonic layer and at least one soundfield layer, said method including steps of:
- (a) in the first node, determining a first subset of the set of layers by performing at least one of perceptually-driven layer selection or endpoint-driven layer selection, said first subset including at least one of said monophonic layer or said soundfield layer, wherein said endpoint-driven layer selection includes at least one independent decision by at least one of the endpoints based on at least one analyzed characteristic of said at least one of the endpoints or of audio content captured by said at least one of the endpoints, and wherein said perceptually-driven layer selection is not based on any downstream capability consideration; and
  
  (b) in said first node, generating first spatially layered encoded audio, wherein the first spatially layered encoded audio includes the first subset of the set of layers determined in step (a), and wherein the first spatially layered encoded audio does not include any layer of said set of layers which is not included in said first subset of the set of layers determined in step (a).
- View Dependent Claims (2, 3, 4, 5, 6, 7)
- - 2. The method of claim 1, wherein each of the endpoints is a telephone system, and step (a) is performed in one of the endpoints.
  - 3. The method of claim 1, wherein the nodes include at least one server, and step (a) is performed in the server.
  - 4. The method of claim 1, wherein the set of layers also includes at least one metadata layer comprising metadata indicative of at least one processing operation to be performed on the encoded audio, and wherein the first subset determined in step (a) includes at least one said metadata layer.
  - 5. The method of claim 1, wherein the nodes include at least one monophonic endpoint, at least one soundfield endpoint, and at least one server, step (b) is performed in one said soundfield endpoint, and said method also includes a step of transmitting the first spatially layered encoded audio to at least one of the server and one said monophonic endpoint.
  - 6. The method of claim 1, wherein step (a) includes selecting said first subset of the set of layers from a spatially layered encoded audio signal, but not selecting any layer of the spatially layered encoded audio signal which is not included in said first subset.
  - 7. The method of claim 6, wherein the nodes include at least one monophonic endpoint, at least one soundfield endpoint, and at least one server, and step (a) includes selecting said first subset of the set of layers, but not any layer of the spatially layered encoded audio signal which is not included in said first subset, for processing in one said monophonic endpoint.

8. A teleconferencing system, including:
- nodes configured to perform audio coding to generate spatially layered encoded audio, wherein the nodes include endpoints, and each of the nodes is coupled to at least one other one of the nodes and configured to transmit at least some of the spatially layered encoded audio to said at least one other one of the nodes, andwherein the nodes include a first node configured to generate spatially layered encoded audio in response to soundfield audio data, said encoded audio including any of a number of different subsets of a set of layers, said set of layers including at least one monophonic layer and at least one soundfield layer, and wherein the first node is configured to determine a first subset of the set of layers by performing at least one of perceptually-driven layer selection or endpoint-driven layer selection, said first subset including at least one of said monophonic layer or said soundfield layer, wherein said endpoint-driven layer selection includes at least one independent decision by at least one of the endpoints based on at least one analyzed characteristic of said at least one of the endpoints or of audio content captured by said at least one of the endpoints, and wherein said perceptually-driven layer selection is not based on any downstream capability consideration, and wherein said first node is configured to generate first spatially layered encoded audio, wherein the first spatially layered encoded audio includes the first subset of the set of layers determined by said first node, and wherein the first spatially layered encoded audio does not include any layer of said set of layers which is not included in said first subset of the set of layers determined by said first node.
- View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
- - 9. The system of claim 8, wherein each of the endpoints is a telephone system, and the first node is one of the endpoints.
  - 10. The system of claim 8, wherein the nodes include endpoints and at least one server, and the first node is the server.
  - 11. The system of claim 8, wherein the set of layers also includes at least one metadata layer comprising metadata indicative of at least one processing operation to be performed on the encoded audio, and the first subset determined by the first node includes at least one said metadata layer.
  - 12. The system of claim 8, wherein the nodes include at least one monophonic endpoint, at least one soundfield endpoint, and at least one server, the first node is one said soundfield endpoint, and the first node is coupled and configured to transmit the first spatially layered encoded audio to at least one of the server and one said monophonic endpoint.
  - 13. The system of claim 8, the first node is configured to select said first subset of the set of layers from a spatially layered encoded audio signal, without selecting any layer of the spatially layered encoded audio signal which is not included in said first subset.
  - 14. The system of claim 13, wherein the nodes include at least one monophonic endpoint, at least one soundfield endpoint, and at least one server, and the first node is configured to select said first subset of the set of layers, but not any layer of the spatially layered encoded audio signal which is not included in said first subset, for processing in one said monophonic endpoint.
  - 15. The system of claim 13, wherein the nodes include at least one monophonic endpoint, at least one soundfield endpoint, and at least one server, and the first node is the server.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Dolby Laboratories Licensing Corporation (Dolby Laboratories Incorporated)
Original Assignee
Dolby International AB (Dolby Laboratories Incorporated), Dolby Laboratories Licensing Corporation (Dolby Laboratories Incorporated)
Inventors
Cartwright, Richard James, Dickins, Glenn
Primary Examiner(s)
ALBERTALLI, BRIAN LOUIS

Application Number

US14/421,419
Publication Number

US 20150221319A1
Time in Patent Office

1,574 Days
Field of Search
US Class Current
CPC Class Codes

G10L 19/008   Multichannel audio signal c...

G10L 19/012   Comfort noise or silence co...

G10L 19/02   using spectral analysis, e....

G10L 19/0208   Subband vocoders

G10L 19/032   Quantisation or dequantisat...

G10L 19/22   Mode decision, i.e. based o...

G10L 19/24   Variable rate codecs, e.g. ...

G10L 21/02   Speech enhancement, e.g. no...

G10L 21/0208   Noise filtering

G10L 21/0216   characterised by the method...

H04M 3/56   Arrangements for connecting...

Methods and systems for selecting layers of encoded audio signals for teleconferencing

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

76 Citations

15 Claims

Specification

Use Cases

Quick Links

Others

Methods and systems for selecting layers of encoded audio signals for teleconferencing

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

76 Citations

15 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others