Audio coding systems and methods using spectral component coupling and spectral component regeneration
First Claim
1. A method for encoding a plurality of input audio signals, wherein the method comprises:
- receiving the plurality of input audio signals and obtaining therefrom one or more baseband signals and one or more residual signals, wherein spectral components of a baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands and spectral components in an associated residual signal represent spectral components of the respective input audio signal in a second set of frequency subbands that are not represented by the baseband signal;
obtaining energy measures of at least some spectral components of one or more synthesized signals to be generated during decoding, wherein the one or more synthesized signals have spectral components within the second set of frequency subbands;
obtaining energy measures of at least some spectral components of each residual signal;
obtaining from the plurality of input audio signals a coupled-channel signal having spectral components representing a composite of spectral components of two or more of the input audio signals in a third sec of frequency subbands;
obtaining energy measures of at least some spectral components of the coupled-channel signal;
obtaining energy measures of at least some of the spectral components of the two or more input audio signals represented by the coupled-channel signal in the third set of frequency subbands;
calculating coupling scale factors by obtaining square roots of ratios of the energy measures of spectral components in the two or more input audio signals to the energy measures of spectral energy in the coupled-channel signal, square roots of ratios of the energy measures of spectral energy in the coupled-channel signal to the energy measures of spectral components in the two or more input audio signals, ratios of square roots of the energy measures of spectral components in the two or more input audio signals to square roots of the energy measures of spectral energy in the coupled-channel signal, or ratios of square roots of the energy measures of spectral energy in the coupled-channel signal to square roots of the energy measures of spectral components in the two or more input audio signals;
calculating scale factors by obtaining square roots of ratios of the energy measures of spectral components in the residual signals to the energy measures of spectral components in the one or more synthesized signals, square roots of ratios of the energy measures of spectral components in the one or more synthesized signals to the energy measures of spectral components in the residual signals, ratios of square roots of the energy measures of spectral components in the residual signals to square roots of the energy measures of spectral components in the one or more synthesized signals, or ratios of square roots of the energy measures of spectral components in the one or more synthesized signals to square roots of the energy measures of spectral components in the residual signals; and
assembling signal information and scaling information into an encoded signal, wherein the signal information represents the spectral components in the one or more baseband signals and the spectral components in the coupled-channel signal, and wherein the scaling information represents the scale factors and the coupling scale factors.
1 Assignment
0 Petitions
Accused Products
Abstract
An audio encoder discards spectral components of an input signal and uses channel coupling to reduce the information capacity requirements of an encoded signal. Channel coupling represents selected spectral components of multiple channels of signals in a composite form. An audio decoder synthesizes spectral components to replace the discarded spectral components and generates spectral components for individual channel signals from the coupled-channel signal. The encoder provides scale factors in the encoded signal that improve the efficiency of the decoder to generate output signals that substantially preserve the spectral energy of the original input signals.
123 Citations
65 Claims
-
1. A method for encoding a plurality of input audio signals, wherein the method comprises:
-
receiving the plurality of input audio signals and obtaining therefrom one or more baseband signals and one or more residual signals, wherein spectral components of a baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands and spectral components in an associated residual signal represent spectral components of the respective input audio signal in a second set of frequency subbands that are not represented by the baseband signal; obtaining energy measures of at least some spectral components of one or more synthesized signals to be generated during decoding, wherein the one or more synthesized signals have spectral components within the second set of frequency subbands; obtaining energy measures of at least some spectral components of each residual signal; obtaining from the plurality of input audio signals a coupled-channel signal having spectral components representing a composite of spectral components of two or more of the input audio signals in a third sec of frequency subbands; obtaining energy measures of at least some spectral components of the coupled-channel signal; obtaining energy measures of at least some of the spectral components of the two or more input audio signals represented by the coupled-channel signal in the third set of frequency subbands; calculating coupling scale factors by obtaining square roots of ratios of the energy measures of spectral components in the two or more input audio signals to the energy measures of spectral energy in the coupled-channel signal, square roots of ratios of the energy measures of spectral energy in the coupled-channel signal to the energy measures of spectral components in the two or more input audio signals, ratios of square roots of the energy measures of spectral components in the two or more input audio signals to square roots of the energy measures of spectral energy in the coupled-channel signal, or ratios of square roots of the energy measures of spectral energy in the coupled-channel signal to square roots of the energy measures of spectral components in the two or more input audio signals; calculating scale factors by obtaining square roots of ratios of the energy measures of spectral components in the residual signals to the energy measures of spectral components in the one or more synthesized signals, square roots of ratios of the energy measures of spectral components in the one or more synthesized signals to the energy measures of spectral components in the residual signals, ratios of square roots of the energy measures of spectral components in the residual signals to square roots of the energy measures of spectral components in the one or more synthesized signals, or ratios of square roots of the energy measures of spectral components in the one or more synthesized signals to square roots of the energy measures of spectral components in the residual signals; and assembling signal information and scaling information into an encoded signal, wherein the signal information represents the spectral components in the one or more baseband signals and the spectral components in the coupled-channel signal, and wherein the scaling information represents the scale factors and the coupling scale factors. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 44, 60, 61, 62)
-
-
17. An encoder for encoding a plurality of input audio signals, wherein the encoder has processing circuitry that performs a method for encoding a plurality of input audio signals that comprises:
-
receiving the plurality of input audio signals and obtaining therefrom one or more baseband signals and one or more residual signals, wherein spectral components of a baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands and spectral components in an associated residual signal represent spectral components of the respective input audio signal in a second set of frequency subbands that are not represented by the baseband signal; obtaining energy measures of at least some spectral components of one or more synthesized signals to be generated during decoding, wherein the one or more synthesized signals have spectral components within the second set of frequency subbands; obtaining energy measures of at least some spectral components of each residual signal; obtaining from the plurality of input audio signals a coupled-channel signal having spectral components representing a composite of spectral components of two or more of the input audio signals in a third set of frequency subbands; obtaining energy measures of at least some spectral components of the coupled-channel signal; obtaining energy measures of at least some of the spectral components of the two or more input audio signals represented by the coupled-channel signal in the third set of frequency subbands; calculating coupling scale factors by obtaining square roots of ratios of the energy measures of spectral components in the two or more input audio signals to the energy measures of spectral energy in the coupled-channel signal, square roots of ratios of the energy measures of spectral energy in the coupled-channel signal to the energy measures of spectral components in the two or more input audio signals, ratios of square roots of the energy measures of spectral components in the two or more input audio signals to square roots of the energy measures of spectral energy in the coupled-channel signal, or ratios of square roots of the energy measures of spectral energy in the coupled-channel signal to square roots of the energy measures of spectral components in the two or more input audio signals; calculating scale factors by obtaining square roots of ratios of the energy measures of spectral components in the residual signals to the energy measures of spectral components in the one or more synthesized signals, square roots of ratios of the energy measures of spectral components in the one or more synthesized signals to the energy measures of spectral components in the residual signals, ratios of square roots of the energy measures of spectral components in the residual signals to square roots of the energy measures of spectral components in the one or more synthesized signals, or ratios of square roots of the energy measures of spectral components in the one or more synthesized signals to square roots of the energy measures of spectral components in the residual signals; and assembling signal information and scaling information into an encoded signal, wherein the signal information represents the spectral components in the one or more baseband signals and the spectral components in the coupled-channel signal, and wherein the scaling information represents the scale factors and the coupling scale factors.
-
-
18. A method for decoding an encoded signal representing a plurality of input audio signals, wherein the method comprises:
-
obtaining from the encoded signal signal information, a coupled-channel signal and scaling information, the signal information representing spectral components for one or more baseband signals, the spectral components in each baseband signal representing spectral components of a respective input audio signal in a first set of frequency subbands, the coupled-channel signal having spectral components representing a composite of two or more of the plurality of input audio signals in a third set of frequency subbands, and the scaling information representing scale factors calculated from square roots of ratios of energy measures of spectral components or ratios of square roots of energy measures of spectral components, and representing coupling scale factors calculated from square roots of ratios of energy measures of spectral components of the two or more input audio signals in the third set of frequency subbands to the energy measures of spectral energy in the coupled-channel signal, square roots of ratios of the energy measures of spectral energy in the coupled-channel signal to the energy measures of spectral components of the two or more input audio signals in the third set of frequency subbands, ratios of square roots of the energy measures of spectral components of the two or more input audio signals in the third set of frequency subbands to square roots of the energy measures of spectral energy in the coupled-channel signal, or ratios of square roots of the energy measures of spectral energy in the coupled-channel signal to square roots of the energy measures of spectral components of the two or more input audio signals in the third set of frequency subbands; generating for each respective baseband signal an associated synthesized signal having spectral components in a second set of frequency subbands that are not represented by the respective baseband signal, wherein the spectral components in the associated synthesized signal are scaled by multiplication or division according to one or more of the scale factors; generating from the coupled-channel signal a respective decoupled signal for each of the two or more input audio signals represented by the coupled-channel signal, wherein the decoupled signals have spectral components in the third set of frequency subbands that are scaled by multiplication or division according to one or more of the coupling scale factors; and generating a plurality of output audio signals, each output audio signal representing a respective input audio signal and generated from the spectral components in a respective baseband signal and its associated synthesized signal and from the spectral components in respective decoupled signals. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56)
-
-
31. A decoder for decoding an encoded signal representing a plurality of input audio signals, wherein the decoder has processing circuitry that performs a method for decoding an encoded signal representing a plurality of input audio signals that comprises:
-
obtaining from the encoded signal signal information, a coupled-channel signal and scaling information, the signal information representing spectral components for one or more baseband signals, the spectral components in each baseband signal representing spectral components of a respective input audio signal in a first set of frequency subbands, the coupled-channel signal having spectral components representing a composite of two or more of the plurality of input audio signals in a third set of frequency subbands, and the scaling information representing scale factors calculated from square roots of ratios of energy measures of spectral components or ratios of square roots of energy measures of spectral components, and representing coupling scale factors calculated from square roots of ratios of energy measures of spectral components of the two or more input audio signals in the third set of frequency subbands to the energy measures of spectral energy in the coupled-channel signal, square roots of ratios of the energy measures of spectral energy in the coupled-channel signal to the energy measures of spectral components of the two or more input audio signals in the third set of frequency subbands, ratios of square roots of the energy measures of spectral components of the two or more input audio signals in the third set of frequency subbands to square roots of the energy measures of spectral energy in the coupled-channel signal, or ratios of square roots of the energy measures of spectral energy in the coupled-channel signal to square roots of the energy measures of spectral components of the two or more input audio signals in the third set of frequency subband; generating for each respective baseband signal an associated synthesized signal having spectral components in a second set of frequency subbands that are not represented by the respective baseband signal, wherein the spectral components in the associated synthesized signal are scaled by multiplication or division according to one or more of the scale factors; generating from the coupled-channel signal a respective decoupled signal for each of the two or more input audio signals represented by the coupled-channel signal, wherein the decoupled signals have spectral components in the third set of frequency subbands that are scaled by multiplication or division according to one or more of the coupling scale factors; and generating a plurality of output audio signals, each output audio signal representing a respective input audio signal and generated from the spectral components in a respective baseband signal and its associated synthesized signal and from the spectral components in respective decoupled signals.
-
-
32. A method for encoding a plurality of input audio signals, wherein the method comprises:
-
receiving the plurality of input audio signals and obtaining therefrom a plurality of baseband signals, a plurality of residual signals and a coupled-channel signal, wherein spectral components of a baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands and spectral components of an associated residual signal represent spectral components of the respective input audio signal in a second set of frequency subbands that are not represented by the baseband signal, and wherein spectral components of the coupled-channel signal represent a composite of spectral components of two or more of the input audio signals in a third set of frequency subbands; obtaining energy measures of at least some spectral components of each residual signal and the two or more input audio signals represented by the coupled-channel signal; and assembling control information and, signal information into an encoded signal, wherein the control information is derived from the energy measures and wherein the signal information represents the spectral components in the plurality of baseband signals and the coupled-channel signal. - View Dependent Claims (33, 34, 35, 59, 63, 64, 65)
-
-
36. A method for decoding an encoded signal representing a plurality of input audio signals, wherein the method comprises:
-
obtaining control information and signal information from the encoded signal, wherein the control information is derived from energy measures of spectral components and the signal information represents spectral components of a plurality of baseband signals and a coupled-channel signal, wherein the spectral components in each baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands and the spectral components of the coupled-channel signal represent a composite of spectral components in a third set of frequency subbands of two or more of the plurality of input audio signals; generating for each respective baseband signal an associated synthesized signal having spectral components in a second set of frequency subbands that are not represented by the respective baseband signal, wherein the spectral components in the associated synthesized signal are scaled according to the control information; generating from the coupled-channel signal a respective decoupled signal for each of the two or more input audio signals represented by the coupled-channel signal, wherein the decoupled signals have spectral components in the third set of frequency subbands that are sealed according to the control information; and generating a plurality of output audio signals, wherein each output audio signal represents a respective input audio signal and is generated from the spectral components in a respective baseband signal and its associated synthesized signal, and wherein output audio signals representing the two or more audio signals are also generated from the spectral components in the respective decoupled signals. - View Dependent Claims (37, 38, 39, 40, 41, 42, 43)
-
-
57. An encoder for encoding a plurality of input audio signals, wherein the encoder has processing circuitry that performs a method for encoding a plurality of input audio signals that comprises:
-
receiving the plurality of input audio signals and obtaining therefrom a plurality of baseband signals, a plurality of residual signals and a coupled-channel signal, wherein spectral components of a baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands and spectral components of an associated residual signal represent spectral components of the respective input audio signal in a second set of frequency subbands that are not represented by the baseband signal, and wherein spectral components of the coupled-channel signal represent a composite of spectral components of two or more of the input audio signals in a third set of frequency subbands; obtaining energy measures of at least some spectral components of each residual signal and the two or more input audio signals represented by the coupled-channel signal; and assembling control information and signal information into an encoded signal, wherein the control information is derived from the energy measures and wherein the signal information represents the spectral components in the plurality of baseband signals and the coupled-channel signal.
-
-
58. A decoder for decoding an encoded signal representing a plurality of input audio signals, wherein the decoder has processing circuitry that performs a method for decoding an encoded signal representing a plurality of input audio signals that comprises;
-
obtaining control information and signal information from the encoded signal, wherein the control information is derived from energy measures of spectral components and the signal information represents spectral components of a plurality of baseband signals and a coupled-channel signal, wherein the spectral components in each baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands and the spectral components of the coupled-channel signal represent a composite of spectral components in a third set of frequency subbands of two or more of the plurality of input audio signals; generating for each respective baseband signal an associated synthesized signal having spectral components in a second net of frequency subbands that are not represented by the respective baseband signal, wherein the spectral components in the associated synthesized signal are scaled according to the control information; generating from the coupled-channel signal a respective decoupled signal for each of the two or more input audio signals represented by the coupled-channel signal, wherein the decoupled signals have spectral components in the third set of frequency subbands that are scaled according to the control information; and generating a plurality of output audio signals, wherein each output audio signal represents a respective input audio signal and is generated from the spectral components in a respective baseband signal and its associated synthesized signal, and wherein output audio signals representing the two or more audio signals are also generated from the spectral components in the respective decoupled signals.
-
Specification