Scalable audio coding
First Claim
1. An audio encoder for encoding an audio signal (IN), the audio encoder comprising:
- encoder means (ENC) for encoding the audio signal (IN) into a first encoded signal part; and
computation means for computing a representation of an excitation pattern of the audio signal and providing the representation of the excitation pattern as a second encoded signal part, wherein the representation of the excitation pattern comprises a representation of human auditory nerve response modeled by a filter bank of parallel auditory filters, the filters in the filter bank having values which relate to a signal level of a frequency band of a corresponding auditory filter, the excitation pattern of the audio signal thereby being a parametric spectral description of the audio signal, the computation means further for computing a representation of a masking curve based on quantized excitation parameters of the representation of the excitation pattern, and providing the representation of the masking curve to the encoder means so as to optimize encoding efficiency of the encoder means, wherein the encoder means encodes signal components of the audio signal relative to the masking curve, further wherein the second encoded signal part, included within an output bit stream of the audio encoder, along with the first signal part, provides a scalable encoded audio signal of the audio encoder.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention relates to an audio encoder and decoder and methods for audio encoding and decoding. In a preferred encoder embodiment an audio signal is encoded by deterministic encoder means to form a first encoded signal part. A spectrum of the audio signal is determined and represented by an excitation pattern, i.e. spectral values corresponding to human auditory filters, as a second encoded signal part. A masking curve is also extracted based on the excitation pattern, thus improving encoding efficiency in terms of bit rate. In a preferred decoder the first encoded signal part is decoded by deterministic decoder means. A noise generator uses the decoded first signal part together with the second signal part, i.e. the excitation pattern for the original audio signal, to generate a noise signal. The noise signal is then added to the first decoded signal part to form an output audio signal. At the decoder side the masking curve is also extracted based on the second encoded signal part, i.e. the excitation pattern. The noise signal is generated so that the output audio signal exhibits an excitation pattern nearly identical to the original audio signal. Thus, a perceived high quality audio is obtained while the encoded signal is scalable since a possible deviation between encoding and decoding of the first signal part is compensated by the noise generator at the decoder side. In preferred embodiments the coding means comprises a sinusoidal coder.
-
Citations
18 Claims
-
1. An audio encoder for encoding an audio signal (IN), the audio encoder comprising:
-
encoder means (ENC) for encoding the audio signal (IN) into a first encoded signal part; and computation means for computing a representation of an excitation pattern of the audio signal and providing the representation of the excitation pattern as a second encoded signal part, wherein the representation of the excitation pattern comprises a representation of human auditory nerve response modeled by a filter bank of parallel auditory filters, the filters in the filter bank having values which relate to a signal level of a frequency band of a corresponding auditory filter, the excitation pattern of the audio signal thereby being a parametric spectral description of the audio signal, the computation means further for computing a representation of a masking curve based on quantized excitation parameters of the representation of the excitation pattern, and providing the representation of the masking curve to the encoder means so as to optimize encoding efficiency of the encoder means, wherein the encoder means encodes signal components of the audio signal relative to the masking curve, further wherein the second encoded signal part, included within an output bit stream of the audio encoder, along with the first signal part, provides a scalable encoded audio signal of the audio encoder. - View Dependent Claims (2, 3, 4, 15)
-
-
5. An audio decoder for regenerating an audio signal from an encoded audio signal based on an original audio signal, the encoded audio signal including a first encoded audio signal part and a second encoded audio signal part, the audio decoder comprising:
-
means for generating, from the second encoded audio signal part, a representation of an excitation pattern of the original audio signal, wherein the representation of the excitation pattern comprises a representation of human auditory nerve response modeled by a filter bank of parallel auditory filters, the filters in the filter bank having values which relate to a signal level of a frequency band of a corresponding auditory filter, the excitation pattern of the audio signal thereby being a parametric spectral description of the original audio signal; decoder means for generating a first decoded signal part from (i) the first encoded signal part and (ii) a masking curve based on quantized excitation parameters of the representation of the excitation pattern; and signal generator means for generating a second decoded signal part, based on a scalable noise model, in response to the representation of the excitation pattern and the first decoded signal part, so that a sum of the first and second decoded signal parts exhibits an excitation pattern that is substantially equal to the excitation pattern of the original audio signal, for creating a resulting regenerated audio signal with perceivable spectral properties similar to the original audio signal. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 16)
-
-
13. A method of encoding an audio signal comprising the steps of:
-
computing, in an excitation pattern computation means, a representation of an excitation pattern of the audio signal, wherein the representation of the excitation pattern comprises a representation of human auditory nerve response modeled by a filter bank of parallel auditory filters, having values each of which relate to a signal level of a frequency band of a corresponding auditory filter, providing a parametric spectral description of the audio signal; computing, in a masking curve computation unit, a representation of a masking curve based on quantized excitation parameters of the representation of the excitation pattern; encoding, using encoding means, the audio signal according to an encoding scheme into a first encoded signal part by utilizing the masking curve so as to optimize an encoding efficiency of the encoding, wherein the encoding encodes signal components of the audio signal relative to the masking curve; and providing, using the excitation pattern computation means, a second encoded signal part comprising the representation of the excitation pattern of the audio signal, wherein the second encoded signal part, for being included within an output bit stream, along with the first signal part, provides a scalable encoded audio signal. - View Dependent Claims (17)
-
-
14. A method of regenerating an audio signal from an encoded audio signal based on an original audio signal, the encoded audio signal including a first encoded signal part and a second encoded signal part, the method comprising the steps of:
-
generating, using a noise modeler, from the second encoded signal part, a representation of an excitation pattern of the original audio signal, wherein the representation of the excitation pattern comprises a representation of human auditory nerve response modeled by a filter bank of parallel auditory filters, having values each of which relate to a signal level of a frequency band of a corresponding auditory filter, providing a parametric spectral description of the original audio signal; generating, using a masking curve computation unit, from the representation of the excitation pattern, a representation of a masking curve, the masking curve based on quantized excitation parameters of the representation of the excitation pattern; decoding, using decoding means, a first encoded signal part, according to a decoding scheme, into a first decoded signal part, wherein the decoding includes using the masking curve to decode the first encoded signal part; and generating, using the noise modeler, a second decoded signal part, based on a scalable noise model, in response to the representation of the excitation pattern and the first decoded signal part, so that a sum of the first and second decoded signal parts exhibits an excitation pattern that is substantially equal to the excitation pattern of the original audio signal, for creating a resulting regenerated audio signal with perceivable spectral properties similar to the original audio signal. - View Dependent Claims (18)
-
Specification