Coding method and apparatus for multiple channels of audio information representing three-dimensional sound fields
First Claim
1. An apparatus for encoding audio information for a plurality of audio channels comprising:
- one or more input terminals receiving subband signals for the plurality of audio channels,a coarse-level quantizer coupled to the one or more input terminals, wherein the coarse-level quantizer derives coarse measures of subband signal contents,a subband-allocation estimator coupled to the coarse-level quantizer, wherein the subband-allocation estimator derives an estimated number of bits to quantize the subband signals in response to the coarse measure of subband signal contents,a steering controller coupled to the one or more input terminals and to the subband-allocation estimator, wherein the steering controller generates composite representations of subband signals in respective frequency subbands for a plurality of the audio channels in response to the estimated number of bits, and generates steering control information representing levels of the subband signals represented by the composite representations,an adaptive-bit allocator coupled to the coarse-level quantizer and to the steering controller, wherein the adaptive-bit allocator allocates bits to the composite representations and to subband signals not represented by the composite representations in response to the coarse measure of subband contents and the steering control information, anda quantizer coupled to the one or more input terminals and to the steering controller, wherein the quantizer quantizes the composite representations and quantizes subband signals not represented by the composite representations.
2 Assignments
0 Petitions
Accused Products
Abstract
In an encoder, multiple channels of audio information representing multidimensional sound fields are split into subband signals and the subband signals in one or more subbands are combined to form composite signals. The composite signals, the subband signals not combined into a composite signal and information describing the spectral levels of subband signals combined into composite signals are assembled into an encoded output signal. The spectral level information conveys either the amplitude or power of the combined subband signals or the apparent direction of the sound field represented by the combined subband signals. In digital implementations, adaptive bit allocation may be used to reduce the informational requirements of the encoded signal.
486 Citations
26 Claims
-
1. An apparatus for encoding audio information for a plurality of audio channels comprising:
-
one or more input terminals receiving subband signals for the plurality of audio channels, a coarse-level quantizer coupled to the one or more input terminals, wherein the coarse-level quantizer derives coarse measures of subband signal contents, a subband-allocation estimator coupled to the coarse-level quantizer, wherein the subband-allocation estimator derives an estimated number of bits to quantize the subband signals in response to the coarse measure of subband signal contents, a steering controller coupled to the one or more input terminals and to the subband-allocation estimator, wherein the steering controller generates composite representations of subband signals in respective frequency subbands for a plurality of the audio channels in response to the estimated number of bits, and generates steering control information representing levels of the subband signals represented by the composite representations, an adaptive-bit allocator coupled to the coarse-level quantizer and to the steering controller, wherein the adaptive-bit allocator allocates bits to the composite representations and to subband signals not represented by the composite representations in response to the coarse measure of subband contents and the steering control information, and a quantizer coupled to the one or more input terminals and to the steering controller, wherein the quantizer quantizes the composite representations and quantizes subband signals not represented by the composite representations. - View Dependent Claims (2, 3, 4, 5)
-
-
6. An apparatus for decoding input information representing a plurality of audio channels comprising:
-
one or more input terminals receiving the input information, wherein the input information comprises coarse measures of frequency subband content, quantized subband signals for a respective frequency subband for the plurality of audio channels, a quantized composite representation of subband signals in another respective frequency subband for the plurality of audio channels, and steering control information, an adaptive bit-allocation calculator coupled to the one or more input terminals, wherein the adaptive bit-allocation calculator determines numbers of bits to allocate to the quantized subband signals and the quantized composite representation of subband signals in response to the steering control information and the coarse measures of frequency subband content, a dequantizer coupled to the one or more input terminals and to the adaptive bit-allocation calculator, wherein the dequantizer dequantizes the quantized subband signals and the quantized composite representation of subband signals according to the numbers of bits determined by the adaptive bit-allocation calculator to generate dequantized subband signals and a dequantized composite representation of subband signals, respectively, and an inverse steering controller coupled to the dequantizer and to the one or more input terminals, wherein the inverse steering controller generates a plurality of subband signals in response to the dequantized composite representation and the steering control information. - View Dependent Claims (7, 8, 9)
-
-
10. An apparatus for encoding audio information, wherein the apparatus comprises:
-
means for generating a discrete representation of subband signals in each respective frequency subband in a first set of frequency subbands for three or more channels of audio information representing a three-dimensional sound field, means for generating an aggregate representation of subband signals in each respective frequency subband in a second set of frequency subbands for the three or more of channels of audio information, wherein the aggregate representation for a respective frequency subband includes a composite audio-information signal and a steering control signal, and wherein the composite audio-information signal and the steering signal correspond to levels of the subband signals in the respective frequency subband, and means for encoding the discrete representations of subband signals and the aggregate representations of subband signals. - View Dependent Claims (11, 12)
-
-
13. An apparatus for decoding input information, wherein the apparatus comprises:
-
means for receiving and decoding the input information to obtain discrete representations of subband signals in each respective frequency subband in a first set of frequency subbands for three or more channels of audio information representing a three-dimensional soundfield, and to obtain aggregate representations of subband signals in each respective frequency subband in a second set of frequency subbands for the three or more channels of audio information, means for generating discrete subband signals in the second set of frequency subbands for the three or more channels of audio information from the aggregate representations, wherein the aggregate representation for a respective frequency subband includes a composite audio-information signal and a steering control signal, and wherein the composite audio-information signal and the steering signal correspond to levels of subband signals in the respective frequency subband for three or more audio channels, and means for generating three or more channels of audio information representing a three dimensional sound field from discrete subband signals for the first set of frequency subbands and from discrete subband signals for the second set of frequency subbands. - View Dependent Claims (14)
-
-
15. A method for encoding audio information, the method comprising:
-
receiving input signals conveying three or more channels of audio information representing a three-dimensional sound field, applying one or more filters or one or more transforms to the input signals to generate a plurality of subband signals for each of the channels of audio information, combining subband signals to generate composite signals, wherein a respective composite signal is generated by combining subband signals in a respective frequency subband and at least one composite signal is generated by combining subband signals for at least three of the channels of audio information, generating a steering control signal for each respective composite signal representing levels of the subband signals that are combined to generate that respective composite signal, allocating a number of bits to and quantizing the composite signals and the subband signals not combined into composite signals to generate encoded information, and assembling the encoded information and information representing the steering control signals into an output signal. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
-
22. A method for decoding an input signal conveying an encoded representation of audio information, the method comprising:
-
disassembling an input signal to obtain encoded information and steering control signals, allocating a number of bits to and dequantizing the encoded information to obtain composite signals and subband signals, generating subband signals from the composite signals, wherein subband signals are generated for three or more audio channels in a respective frequency subband from a respective composite signal for that respective frequency subband according to a respective steering control signal that represents levels of subband signals that were combined to generate that respective composite signal, and applying one or more synthesis filters or one or more inverse transforms to the subband signals obtained from the input signal and generated from composite signals to generate three or more channels of audio information representing a three dimensional sound field. - View Dependent Claims (23, 24, 25, 26)
-
Specification