Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal
First Claim
1. Audio encoder for encoding a multichannel signal, comprising:
- a linear prediction domain encoder;
a frequency domain encoder; and
a controller for switching between the linear prediction domain encoder and the frequency domain encoder,wherein the linear prediction domain encoder comprises a downmixer for downmixing the multichannel signal to acquire a downmix signal, a linear prediction domain core encoder for encoding the downmix signal and a first joint multichannel encoder for generating first multichannel information from the multichannel signal,wherein the frequency domain encoder comprises a second joint multichannel encoder for encoding second multichannel information from the multichannel signal, wherein the second joint multichannel encoder is different from the first joint multichannel encoder, andwherein the controller is configured such that a portion of the multichannel signal is represented either by an encoded frame of the linear prediction domain encoder or by an encoded frame of the frequency domain encoder,wherein the linear prediction domain encoder comprises an ACELP processor and a TCX processor, wherein the ACELP processor is configured to operate on a downsampled downmix signal and wherein a time domain bandwidth extension processor is configured to parametrically encode a band of a portion of the downmix signal removed from the ACELP input signal by a third downsampling, and wherein the TCX processor is configured to operate on the downmix signal not downsampled or downsampled by a degree smaller than the downsampling for the ACELP processor, the TCX processor comprising a first time-frequency converter, a first parameter generator for generating a parametric representation of a first set of bands and a first quantizer encoder for generating a set of quantized encoder spectral lines for a second set of bands, orwherein the controller is configured to switch within a current frame of the multichannel signal from using the frequency domain encoder for encoding a previous frame to the linear prediction domain encoder for decoding an upcoming frame, wherein the first joint multichannel encoder is configured to calculate synthetic multichannel parameters from the multichannel signal for the current frame, and wherein the second joint multichannel encoder is configured to weight the multichannel signal using a stop window.
2 Assignments
0 Petitions
Accused Products
Abstract
A schematic block diagram of an audio encoder for encoding a multichannel audio signal is shown. The audio encoder includes a linear prediction domain encoder, a frequency domain encoder, and a controller for switching between the linear prediction domain encoder and the frequency domain encoder. The controller is configured such that a portion of the multichannel signal is represented either by an encoded frame of the linear prediction domain encoder or by an encoded frame of the frequency domain encoder. The linear prediction domain encoder includes a downmixer for downmixing the multichannel signal to obtain a downmixed signal. The linear prediction domain encoder further includes a linear prediction domain core encoder for encoding the downmix signal and furthermore, the linear prediction domain encoder includes a first joint multichannel encoder for generating first multichannel information from the multichannel signal.
22 Citations
24 Claims
-
1. Audio encoder for encoding a multichannel signal, comprising:
-
a linear prediction domain encoder; a frequency domain encoder; and a controller for switching between the linear prediction domain encoder and the frequency domain encoder, wherein the linear prediction domain encoder comprises a downmixer for downmixing the multichannel signal to acquire a downmix signal, a linear prediction domain core encoder for encoding the downmix signal and a first joint multichannel encoder for generating first multichannel information from the multichannel signal, wherein the frequency domain encoder comprises a second joint multichannel encoder for encoding second multichannel information from the multichannel signal, wherein the second joint multichannel encoder is different from the first joint multichannel encoder, and wherein the controller is configured such that a portion of the multichannel signal is represented either by an encoded frame of the linear prediction domain encoder or by an encoded frame of the frequency domain encoder, wherein the linear prediction domain encoder comprises an ACELP processor and a TCX processor, wherein the ACELP processor is configured to operate on a downsampled downmix signal and wherein a time domain bandwidth extension processor is configured to parametrically encode a band of a portion of the downmix signal removed from the ACELP input signal by a third downsampling, and wherein the TCX processor is configured to operate on the downmix signal not downsampled or downsampled by a degree smaller than the downsampling for the ACELP processor, the TCX processor comprising a first time-frequency converter, a first parameter generator for generating a parametric representation of a first set of bands and a first quantizer encoder for generating a set of quantized encoder spectral lines for a second set of bands, or wherein the controller is configured to switch within a current frame of the multichannel signal from using the frequency domain encoder for encoding a previous frame to the linear prediction domain encoder for decoding an upcoming frame, wherein the first joint multichannel encoder is configured to calculate synthetic multichannel parameters from the multichannel signal for the current frame, and wherein the second joint multichannel encoder is configured to weight the multichannel signal using a stop window. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. Audio decoder for decoding an encoded audio signal, comprising:
-
a linear prediction domain decoder; a frequency domain decoder; a first joint multichannel decoder for generating a first multichannel representation using an output of the linear prediction domain decoder and using a first multichannel information; a second joint multichannel decoder for generating a second multichannel representation using an output of the frequency domain decoder and a second multichannel information; and a first combiner for combining the first multichannel representation and the second multichannel representation to acquire a decoded audio signal, wherein the second joint multichannel decoder is different from the first joint multichannel decoder, wherein the encoded audio signal comprises a multichannel residual signal for the output of the linear prediction domain decoder, wherein the first joint multichannel decoder is configured to use the multichannel residual signal for generating the first multichannel representation, wherein the multichannel residual signal has a lower bandwidth than the first multichannel representation, and wherein the first joint multichannel decoder is configured to reconstruct an intermediate first multichannel representation using the first multichannel information and to add the multichannel residual signal to the intermediate first multichannel representation, or wherein the audio decoder is configured to switch within a current frame of the encoded audio signal from using the frequency domain decoder for decoding a previous frame to the linear prediction domain decoder for decoding an upcoming frame, wherein the first combiner is configured to calculate a synthetic mid-signal from the second multichannel representation of the current frame, wherein the first joint multichannel decoder is configured to generate the first multichannel representation using the synthetic mid-signal and the first multichannel information, and wherein the first combiner is configured to combine the first multichannel representation and the second multichannel representation to acquire a decoded current frame of the decoded audio signal, or wherein the audio decoder is configured to switch within a current frame of the encoded audio signal from using the linear prediction domain decoder for decoding a previous frame to the frequency domain decoder for decoding an upcoming frame, wherein the first joint multichannel decoder is configured to calculate a synthetic multichannel audio signal from a decoded mono signal of the linear prediction domain decoder for the current frame using multichannel information of the previous frame, wherein the second joint multichannel decoder is configured to calculate the second multichannel representation for the current frame and to weight the second multichannel representation using a start window, and wherein the first combiner is configured to combine the synthetic multichannel audio signal and the weighted second multichannel representation to acquire a decoded current frame of the decoded audio signal. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. Method of encoding a multichannel signal comprising:
-
performing a linear prediction domain encoding; performing a frequency domain encoding; and switching between the linear prediction domain encoding and the frequency domain encoding, wherein the linear prediction domain encoding comprises downmixing the multichannel signal to acquire a downmix signal, a linear prediction domain core encoding the downmix signal and a first joint multichannel encoding generating first multichannel information from the multichannel signal, wherein the frequency domain encoding comprises a second joint multichannel encoding generating second multichannel information from the multichannel signal, wherein the second joint multichannel encoding is different from the first joint multichannel encoding, and wherein the switching is performed such that a portion of the multichannel signal is represented either by an encoded frame of the linear prediction domain encoding or by an encoded frame of the frequency domain encoding, wherein the performing a linear prediction domain encoding comprises ACELP processing and TCX processing, wherein the ACELP processing comprises operating on a downsampled downmix signal and wherein a time domain bandwidth extension processing comprises parametrically encoding a band of a portion of the downmix signal removed from the ACELP input signal by a third downsampling, and wherein the TCX processing comprises operating on the downmix signal not downsampled or downsampled by a degree smaller than the downsampling for the ACELP processing, the TCX processing comprising first time-frequency converting, generating a parametric representation of a first set of bands and generating a set of quantized encoder spectral lines for a second set of bands, or wherein the switching comprises switching within a current frame of the multichannel signal from using the frequency domain encoding for encoding a previous frame to the linear prediction domain encoding for decoding an upcoming frame, wherein the first joint multichannel encoding comprises calculating synthetic multichannel parameters from the multichannel signal for the current frame, and wherein the second joint multichannel encoding comprises weighting the multichannel signal using a stop window.
-
-
22. Method of decoding an encoded audio signal, comprising:
-
linear prediction domain decoding; frequency domain decoding; first joint multichannel decoding generating a first multichannel representation using an output of the linear prediction domain decoding and using a first multichannel information; second joint multichannel decoding generating a second multichannel representation using an output of the frequency domain decoding and a second multichannel information; and combining the first multichannel representation and the second multichannel representation to acquire a decoded audio signal, wherein the second joint multichannel decoding is different from the first joint multichannel decoding, wherein the encoded audio signal comprises a multichannel residual signal for the output of the linear prediction domain decoding, wherein the first joint multichannel decoding comprises using the multichannel residual signal for generating the first multichannel representation, wherein the multichannel residual signal has a lower bandwidth than the first multichannel representation, and wherein the first joint multichannel decoding comprises reconstructing an intermediate first multichannel representation using the first multichannel information and adding the multichannel residual signal to the intermediate first multichannel representation, or wherein the decoding the encoded audio signal comprises switching within a current frame of the encoded audio signal from using the frequency domain decoding for decoding a previous frame to the linear prediction domain decoding for decoding an upcoming frame, wherein the combining comprises calculating a synthetic mid-signal from the second multichannel representation of the current frame, wherein the first joint multichannel decoding comprises generating the first multichannel representation using the synthetic mid-signal and the first multichannel information, and wherein the combining comprises combining the first multichannel representation and the second multichannel representation to acquire a decoded current frame of the decoded audio signal, or wherein the decoding the encoded audio signal comprises switching within a current frame of the encoded audio signal from using the linear prediction domain decoding for decoding a previous frame to the frequency domain decoding for decoding an upcoming frame, wherein the first joint multichannel decoding comprises calculating a synthetic multichannel audio signal from a decoded mono signal of the linear prediction domain decoding for the current frame using multichannel information of the previous frame, wherein the second joint multichannel decoding is configured to calculate the second multichannel representation for the current frame and to weight the second multichannel representation using a start window, and wherein the combining comprises combining the synthetic multichannel audio signal and the weighted second multichannel representation to acquire a decoded current frame of the decoded audio signal.
-
-
23. A non-transitory digital storage medium having a computer program stored thereon to perform, when said computer program is run by a computer, the method of encoding a multichannel signal, the method comprising:
-
performing a linear prediction domain encoding; performing a frequency domain encoding; and switching between the linear prediction domain encoding and the frequency domain encoding, wherein the linear prediction domain encoding comprises downmixing the multichannel signal to acquire a downmix signal, a linear prediction domain core encoding the downmix signal and a first joint multichannel encoding generating first multichannel information from the multichannel signal, wherein the frequency domain encoding comprises a second joint multichannel encoding generating second multichannel information from the multichannel signal, wherein the second joint multichannel encoding is different from the first joint multichannel encoding, and wherein the switching is performed such that a portion of the multichannel signal is represented either by an encoded frame of the linear prediction domain encoding or by an encoded frame of the frequency domain encoding, wherein the performing a linear prediction domain encoding comprises ACELP processing and TCX processing, wherein the ACELP processing comprises operating on a downsampled downmix signal and wherein a time domain bandwidth extension processing comprises parametrically encoding a band of a portion of the downmix signal removed from the ACELP input signal by a third downsampling, and wherein the TCX processing comprises operating on the downmix signal not downsampled or downsampled by a degree smaller than the downsampling for the ACELP processing, the TCX processing comprising first time-frequency converting, generating a parametric representation of a first set of bands and generating a set of quantized encoder spectral lines for a second set of bands, or wherein the switching comprises switching within a current frame of the multichannel signal from using the frequency domain encoding for encoding a previous frame to the linear prediction domain encoding for decoding an upcoming frame, wherein the first joint multichannel encoding comprises calculating synthetic multichannel parameters from the multichannel signal for the current frame, and wherein the second joint multichannel encoding comprises weighting the multichannel signal using a stop window.
-
-
24. A non-transitory digital storage medium having a computer program stored thereon to perform, when said computer program is run by a computer, the method of decoding an encoded audio signal, the method comprising:
-
linear prediction domain decoding; frequency domain decoding; first joint multichannel decoding generating a first multichannel representation using an output of the linear prediction domain decoding and using a first multichannel information; second joint multichannel decoding generating a second multichannel representation using an output of the frequency domain decoding and a second multichannel information; and combining the first multichannel representation and the second multichannel representation to acquire a decoded audio signal, wherein the second joint multichannel decoding is different from the first joint multichannel decoding, wherein the encoded audio signal comprises a multichannel residual signal for the output of the linear prediction domain decoding, wherein the first joint multichannel decoding comprises using the multichannel residual signal for generating the first multichannel representation, wherein the multichannel residual signal has a lower bandwidth than the first multichannel representation, and wherein the first joint multichannel decoding comprises reconstructing an intermediate first multichannel representation using the first multichannel information and adding the multichannel residual signal to the intermediate first multichannel representation, or wherein the decoding the encoded audio signal comprises switching within a current frame of the encoded audio signal from using the frequency domain decoding for decoding a previous frame to the linear prediction domain decoding for decoding an upcoming frame, wherein the combining comprises calculating a synthetic mid-signal from the second multichannel representation of the current frame, wherein the first joint multichannel decoding comprises generating the first multichannel representation using the synthetic mid-signal and the first multichannel information, and wherein the combining comprises combining the first multichannel representation and the second multichannel representation to acquire a decoded current frame of the decoded audio signal, or wherein the decoding the encoded audio signal comprises switching within a current frame of the encoded audio signal from using the linear prediction domain decoding for decoding a previous frame to the frequency domain decoding for decoding an upcoming frame, wherein the first joint multichannel decoding comprises calculating a synthetic multichannel audio signal from a decoded mono signal of the linear prediction domain decoding for the current frame using multichannel information of the previous frame, wherein the second joint multichannel decoding is configured to calculate the second multichannel representation for the current frame and to weight the second multichannel representation using a start window, and wherein the combining comprises combining the synthetic multichannel audio signal and the weighted second multichannel representation to acquire a decoded current frame of the decoded audio signal.
-
Specification