Methods and systems for selecting layers of encoded audio signals for teleconferencing
First Claim
1. A teleconferencing method in which nodes perform audio coding to generate spatially layered encoded audio, the nodes include endpoints, and at least some of the spatially layered encoded audio is transmitted from one of the nodes to at least another one of the nodes, wherein the nodes include a first node which is configured to generate spatially layered encoded audio in response to soundfield audio data, said encoded audio including any of a number of different subsets of a set of layers, said set of layers including at least one monophonic layer and at least one soundfield layer, said method including steps of:
- (a) in the first node, determining a first subset of the set of layers by performing at least one of perceptually-driven layer selection or endpoint-driven layer selection, said first subset including at least one of said monophonic layer or said soundfield layer, wherein said endpoint-driven layer selection includes at least one independent decision by at least one of the endpoints based on at least one analyzed characteristic of said at least one of the endpoints or of audio content captured by said at least one of the endpoints, and wherein said perceptually-driven layer selection is not based on any downstream capability consideration; and
(b) in said first node, generating first spatially layered encoded audio, wherein the first spatially layered encoded audio includes the first subset of the set of layers determined in step (a), and wherein the first spatially layered encoded audio does not include any layer of said set of layers which is not included in said first subset of the set of layers determined in step (a).
3 Assignments
0 Petitions
Accused Products
Abstract
In some embodiments, a method for selecting at least one layer of a spatially layered, encoded audio signal. Typical embodiments are teleconferencing methods in which at least one of a set of nodes (endpoints, each of which is a telephone system, and optionally also a server) is configured to perform audio coding in response to soundfield audio data to generate spatially layered encoded audio including any of a number of different subsets of a set of layers, the set of layers including at least one monophonic layer, at least one soundfield layer, and optionally also at least one metadata layer comprising metadata indicative of at least one processing operation to be performed on the encoded audio. Other aspects are systems configured (e.g., programmed) to perform any embodiment of the method, and computer readable media which store code for implementing any embodiment of the method or steps thereof.
76 Citations
15 Claims
-
1. A teleconferencing method in which nodes perform audio coding to generate spatially layered encoded audio, the nodes include endpoints, and at least some of the spatially layered encoded audio is transmitted from one of the nodes to at least another one of the nodes, wherein the nodes include a first node which is configured to generate spatially layered encoded audio in response to soundfield audio data, said encoded audio including any of a number of different subsets of a set of layers, said set of layers including at least one monophonic layer and at least one soundfield layer, said method including steps of:
-
(a) in the first node, determining a first subset of the set of layers by performing at least one of perceptually-driven layer selection or endpoint-driven layer selection, said first subset including at least one of said monophonic layer or said soundfield layer, wherein said endpoint-driven layer selection includes at least one independent decision by at least one of the endpoints based on at least one analyzed characteristic of said at least one of the endpoints or of audio content captured by said at least one of the endpoints, and wherein said perceptually-driven layer selection is not based on any downstream capability consideration; and (b) in said first node, generating first spatially layered encoded audio, wherein the first spatially layered encoded audio includes the first subset of the set of layers determined in step (a), and wherein the first spatially layered encoded audio does not include any layer of said set of layers which is not included in said first subset of the set of layers determined in step (a). - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A teleconferencing system, including:
-
nodes configured to perform audio coding to generate spatially layered encoded audio, wherein the nodes include endpoints, and each of the nodes is coupled to at least one other one of the nodes and configured to transmit at least some of the spatially layered encoded audio to said at least one other one of the nodes, and wherein the nodes include a first node configured to generate spatially layered encoded audio in response to soundfield audio data, said encoded audio including any of a number of different subsets of a set of layers, said set of layers including at least one monophonic layer and at least one soundfield layer, and wherein the first node is configured to determine a first subset of the set of layers by performing at least one of perceptually-driven layer selection or endpoint-driven layer selection, said first subset including at least one of said monophonic layer or said soundfield layer, wherein said endpoint-driven layer selection includes at least one independent decision by at least one of the endpoints based on at least one analyzed characteristic of said at least one of the endpoints or of audio content captured by said at least one of the endpoints, and wherein said perceptually-driven layer selection is not based on any downstream capability consideration, and wherein said first node is configured to generate first spatially layered encoded audio, wherein the first spatially layered encoded audio includes the first subset of the set of layers determined by said first node, and wherein the first spatially layered encoded audio does not include any layer of said set of layers which is not included in said first subset of the set of layers determined by said first node. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
Specification