Method and apparatus for encoding and decoding multiple audio channels at low bit rates using adaptive selection of encoding method
First Claim
1. A method for generating an encoded signal by encoding a plurality of input signals each representing a respective channel of audio information, said method comprising:
- generating a plurality of channel subband signals representing said input signals in a plurality of frequency subbands,generating a composite signal representing at least a portion of the bandwidth of said input signals,generating a spatial-characteristic signal representing spatial characteristics of a soundfield associated with said composite signal in response to respective channel subband signals in a frequency subband, wherein said spatial-characteristic signal is generated in either or both of a first form conveying information representing signal levels of said respective channel subband signals and a second form representing one or more intended directions, wherein the forms in which to generate said spatial-characteristics signal are determined adaptively, andassembling said composite signal, said spatial-characteristics signal and an indication of the form of said spatial-characteristics signal into said encoded signal.
2 Assignments
0 Petitions
Accused Products
Abstract
A split-band coding system combines multiple channels of input signals into various forms of composite signals and generates spatial-characteristic signals representing soundfield spatial characteristics in a plurality of frequency subbands. The spatial-characteristics signals may be generated in either or both of two forms. In a first form, the signal represents measures of signal levels for subband signals from the input channels. In a second form, the signal represents one or more apparent directions for the soundfield. The type of the spatial-characteristics signal may be adapted dynamically in response to a variety of criteria including input signal characteristics. Temporal smoothing and spectral smoothing of the spatial-characteristics signals may be applied in an encoder. Temporal smoothing and spectral smoothing may be applied to gain factors derived from the spatial-characteristics signals in a decoder.
369 Citations
69 Claims
-
1. A method for generating an encoded signal by encoding a plurality of input signals each representing a respective channel of audio information, said method comprising:
-
generating a plurality of channel subband signals representing said input signals in a plurality of frequency subbands, generating a composite signal representing at least a portion of the bandwidth of said input signals, generating a spatial-characteristic signal representing spatial characteristics of a soundfield associated with said composite signal in response to respective channel subband signals in a frequency subband, wherein said spatial-characteristic signal is generated in either or both of a first form conveying information representing signal levels of said respective channel subband signals and a second form representing one or more intended directions, wherein the forms in which to generate said spatial-characteristics signal are determined adaptively, and assembling said composite signal, said spatial-characteristics signal and an indication of the form of said spatial-characteristics signal into said encoded signal. - View Dependent Claims (2, 3, 4, 5, 7, 8, 9, 10, 13, 14, 15)
-
-
6. A method for generating an encoded signal by encoding a plurality of input signals each representing a respective channel of audio information, said method comprising:
-
generating a plurality of channel subband signals representing said input signals in a plurality of frequency subbands, generating a composite signal representing at least a portion of the bandwidth of said input signals, generating a spatial-characteristic signal representing spatial characteristics of a soundfield associated with said composite signal in response to respective channel subband signals in a frequency subband, wherein rates of change in said spatial-characteristic signal are limited to be commensurate with temporal post-masking characteristics of a human auditory system, and assembling said composite signal and said spatial-characteristics signal into said encoded signal. - View Dependent Claims (11, 12)
-
-
16. A method for decoding an encoded signal to generate one or more output signals for presentation via one or more output transducers, said method comprising:
-
obtaining from said encoded signal one or more composite signals and a plurality of spatial-characteristics signals, and deriving a plurality of composite subband signals from said one or more composite signals, wherein each spatial-characteristics signal is associated with a respective composite subband signal and represents spatial characteristics of a respective soundfield corresponding to said respective composite subband signal, deriving from said spatial-characteristics signals a plurality of gain factors, wherein rates of change in values of said gain factors are limited to be commensurate with temporal post-masking characteristics of a human auditory system, and mapping a respective composite subband signal into one or more interim subband signals according to a respective gain factor, and generating said plurality of output signals by applying one or more inverse filter banks to said interim subband signals. - View Dependent Claims (19, 20, 21, 22, 23)
-
-
17. A method for decoding an encoded signal to generate one or more output signals for presentation via one or more output transducers, said method comprising:
-
obtaining from said encoded signal one or more composite signals, a plurality of spatial-characteristics signals and an indication of the form of said spatial-characteristics signals, and deriving a plurality of composite subband signals from said one or more composite signals, wherein each spatial-characteristics signal is associated with a respective composite subband signal and represents spatial characteristics of a respective soundfield corresponding to said respective composite subband signal, wherein said indication indicates whether said spatial-characteristics signals are in a first form representing a plurality of signal levels and/or are in a second form representing one or more directions, deriving from said spatial-characteristics signals a plurality of gain factors and mapping a respective composite subband signal into one or more interim subband signals according to a respective gain factor wherein, for said associated spatial-characteristic signal having said first form, said mapping is also according to a spatial configuration signal associated with said signal levels and a transducer configuration signal representing a spatial configuration of said output transducers, and wherein, for said associated spatial-characteristic signal having said second form, said mapping is also according to said one or more directions and said transducer configuration signal, and generating said plurality of output signals by applying one or more inverse filter banks to said interim subband signals. - View Dependent Claims (18)
-
-
24. A medium readable by a machine, embodying a program of instructions executable by said machine to perform a method for generating an encoded signal by encoding a plurality of input signals each representing a respective channel of audio information, said method comprising:
-
generating a plurality of channel subband signals representing said input signals in a plurality of frequency subbands, generating a composite signal representing at least a portion of the bandwidth of said input signals, generating a spatial-characteristic signal representing spatial characteristics of a soundfield associated with said composite signal in response to respective channel subband signals in a frequency subband, wherein said spatial-characteristic signal is generated in either or both of a first form conveying information representing signal levels of said respective channel subband signals and in a second form representing one or more intended directions, wherein the forms in which to generate said spatial-characteristics signal are determined adaptively, and assembling said composite signal, said spatial-characteristics signal and an indication of the form of said spatial-characteristics signal into said encoded signal. - View Dependent Claims (25, 26, 27, 28, 30, 31, 32, 33, 36, 37, 38)
-
-
29. A medium readable by a machine, embodying a program of instructions executable by said machine to perform a method for generating an encoded signal by encoding a plurality of input signals each representing a respective channel of audio information, said method comprising:
-
generating a plurality of channel subband signals representing said input signals in a plurality of frequency subbands, generating a composite signal representing at least a portion of the bandwidth of said input signals, generating a spatial-characteristic signal representing spatial characteristics of a soundfield associated with said composite signal in response to respective channel subband signals in a frequency subband, wherein rates of change in said spatial-characteristic signal are limited to be commensurate with temporal post-masking characteristics of a human auditory system, and assembling said composite signal and said spatial-characteristics signal into said encoded signal. - View Dependent Claims (34, 35)
-
-
39. A medium readable by a machine, embodying a program of instructions executable by said machine to perform a method for decoding an encoded signal to generate one or more output signals for presentation via one or more output transducers, said method comprising:
-
obtaining from said encoded signal one or more composite signals and a plurality of spatial-characteristics signals, and deriving a plurality of composite subband signals from said one or more composite signals, wherein each spatial-characteristics signal is associated with a respective composite subband signal and represents spatial characteristics of a respective soundfield corresponding to said respective composite subband signal, deriving from said spatial-characteristics signals a plurality of gain factors, wherein rates of change in values of said gain factors are limited to be commensurate with temporal post-masking characteristics of a human auditory system, and mapping a respective composite subband signal into one or more interim subband signals according to a respective gain factor, and generating said plurality of output signals by applying one or more inverse filter banks to said interim subband signals. - View Dependent Claims (42, 43, 44, 45, 46)
-
-
40. A medium readable by a machine, embodying a program of instructions executable by said machine to perform a method for decoding an encoded signal to generate one or more output signals for presentation via one or more output transducers, said method comprising:
-
obtaining from said encoded signal one or more composite signals, a plurality of spatial-characteristics signals and an indication of the form of said spatial-characteristics signals, and deriving a plurality of composite subband signals from said one or more composite signals, wherein each spatial-characteristics signal is associated with a respective composite subband signal and represents spatial characteristics of a respective soundfield corresponding to said respective composite subband signal, wherein said indication indicates whether said spatial-characteristics signals are in a first form representing a plurality of signal levels and/or are in a second form representing one or more directions, deriving from said spatial-characteristics signals a plurality of gain factors and mapping a respective composite subband signal into one or more interim subband signals according to a respective gain factor wherein, for said associated spatial-characteristic signal having said first form, said mapping is also according to a spatial configuration signal associated with said signal levels and a transducer configuration signal representing a spatial configuration of said output transducers, and wherein, for said associated spatial-characteristic signal having said second form, said mapping is also according to said one or more directions and said transducer configuration signal, and generating said plurality of output signals by applying one or more inverse filter banks to said interim subband signals. - View Dependent Claims (41)
-
-
47. An encoder for generating an encoded signal by encoding a plurality of input signals each representing a respective channel of audio information, said encoder comprising:
-
means for generating a plurality of channel subband signals representing said input signals in a plurality of frequency subbands, means for generating a composite signal representing at least a portion of the bandwidth of said input signals, means for generating a spatial-characteristic signal representing spatial characteristics of a soundfield associated with said composite signal in response to respective channel subband signals in a frequency subband, wherein said spatial-characteristic signal is generated in either or both of a first form conveying information representing signal levels of said respective channel subband signals and in a second form representing one or more intended directions, wherein the forms in which to generate said spatial-characteristics signal are determined adaptively, and means for assembling said composite signal, said spatial-characteristics signal and an indication of the form of said spatial-characteristics signal into said encoded signal. - View Dependent Claims (48, 49, 50, 51, 53, 54, 55, 56, 59, 60, 61)
-
-
52. An encoder for generating an encoded signal by encoding a plurality of input signals each representing a respective channel of audio information, said encoder comprising:
-
means for generating a plurality of channel subband signals representing said input signals in a plurality of frequency subbands, means for generating a composite signal representing at least a portion of the bandwidth of said input signals, means for generating a spatial-characteristic signal representing spatial characteristics of a soundfield associated with said composite signal in response to respective channel subband signals in a frequency subband, wherein rates of change in said spatial-characteristic signal are limited to be commensurate with temporal post-masking characteristics of a human auditory system, and means for assembling said composite signal and said spatial-characteristics signal into said encoded signal. - View Dependent Claims (57, 58)
-
-
62. A decoder for decoding an encoded signal to generate one or more output signals for presentation via one or more output transducers, said decoder comprising:
-
means for obtaining from said encoded signal one or more composite signals and a plurality of spatial-characteristics signals, and for deriving a plurality of composite subband signals from said one or more composite signals, wherein each spatial-characteristics signal is associated with a respective composite subband signal and represents spatial characteristics of a respective soundfield corresponding to said respective composite subband signal, means for deriving from said spatial-characteristics signals a plurality of gain factors, wherein rates of change in values of said gain factors are limited to be commensurate with temporal post-masking characteristics of a human auditory system, and for mapping a respective composite subband signal into one or more interim subband signals according to a respective gain factor, and means for generating said plurality of output signals by applying one or more inverse filter banks to said interim subband signals. - View Dependent Claims (65, 66, 67, 68, 69)
-
-
63. A decoder for decoding an encoded signal to generate one or more output signals for presentation via one or more output transducers, said decoder comprising:
-
means for obtaining from said encoded signal one or more composite signals, a plurality of spatial-characteristics signals and an indication of the form of said spatial-characteristics signals, and for deriving a plurality of composite subband signals from said one or more composite signals, wherein each spatial-characteristics signal is associated with a respective composite subband signal and represents spatial characteristics of a respective soundfield corresponding to said respective composite subband signal, wherein said indication indicates whether said spatial-characteristics signals are in a first form representing a plurality of signal levels and/or are in a second form representing one or more directions, means for deriving from said spatial-characteristics signals a plurality of gain factors and for mapping a respective composite subband signal into one or more interim subband signals according to a respective gain factor wherein, for said associated spatial-characteristic signal having said first form, said mapping is also according to a spatial configuration signal associated with said signal levels and a transducer configuration signal representing a spatial configuration of said output transducers, and wherein, for said associated spatial-characteristic signal having said second form, said mapping is also according to said one or more directions and said transducer configuration signal, and means for generating said plurality of output signals by applying one or more inverse filter banks to said interim subband signals. - View Dependent Claims (64)
-
Specification