Shape and scale parameters for extended-band frequency coding
First Claim
1. In an audio encoder, a computer-implemented method comprising:
- the audio encoder receiving source audio data;
the audio encoder performing a time-to-frequency transform on the received source audio data to produce frequency-domain data for the received source audio data;
the audio encoder partitioning the frequency-domain data into a plurality of sub-bands in a baseband frequency range and an extended-band frequency range; and
the audio encoder performing frequency extension coding on the received source audio data, the frequency extension coding comprising determining one or more shape parameters for the frequency-domain data, wherein determining the one or more shape parameters comprises using a displacement vector that measures a displacement of a first sub-band in the extended-band frequency range relative to a second sub-band in the baseband frequency range;
wherein the determining one or more shape parameters comprises for the first sub-band in the extended-band frequency range, finding the second sub-band in the baseband frequency range whose sub-band shape matches that of the first sub-band in the extended-band frequency range within a tolerance wherein said finding is restricted to sub-bands in the baseband frequency range that are only at a displacement of an even number of sub-bands from said first sub-band in the extended-band frequency range, and wherein the displacement measured by the displacement vector is an even number of sub-bands, wherein the sub-band shape matching is performed using a vector quantization process.
2 Assignments
0 Petitions
Accused Products
Abstract
An audio encoder performs frequency extension coding that comprises determining one or more shape parameters using a displacement vector that corresponds to a displacement of an even number (e.g., an even number of sub-bands between a sub-band in a baseband frequency range and a sub-band in an extended-band frequency range). The shape parameters can be determined on a per-audio-block basis. Restricting a displacement to an even number (in frequency extension coding or in other signal modulation schemes) can improve the quality of reconstructed audio. An audio encoder also can perform frequency extension coding that comprises determining one or more scale parameters at one or more audio blocks, and determining one or more anchor points for interpolating the one or more scale parameters.
-
Citations
10 Claims
-
1. In an audio encoder, a computer-implemented method comprising:
-
the audio encoder receiving source audio data; the audio encoder performing a time-to-frequency transform on the received source audio data to produce frequency-domain data for the received source audio data; the audio encoder partitioning the frequency-domain data into a plurality of sub-bands in a baseband frequency range and an extended-band frequency range; and the audio encoder performing frequency extension coding on the received source audio data, the frequency extension coding comprising determining one or more shape parameters for the frequency-domain data, wherein determining the one or more shape parameters comprises using a displacement vector that measures a displacement of a first sub-band in the extended-band frequency range relative to a second sub-band in the baseband frequency range; wherein the determining one or more shape parameters comprises for the first sub-band in the extended-band frequency range, finding the second sub-band in the baseband frequency range whose sub-band shape matches that of the first sub-band in the extended-band frequency range within a tolerance wherein said finding is restricted to sub-bands in the baseband frequency range that are only at a displacement of an even number of sub-bands from said first sub-band in the extended-band frequency range, and wherein the displacement measured by the displacement vector is an even number of sub-bands, wherein the sub-band shape matching is performed using a vector quantization process. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A tangible computer-readable storage medium storing computer-executable instructions for causing a computer programmed thereby to perform the method comprising:
-
receiving source audio data; performing a time-to-frequency transform on the received source audio data to produce frequency-domain data for the received source audio data; partitioning the frequency-domain data into a plurality of sub-bands in a baseband frequency range and an extended-band frequency range; and performing frequency extension coding on the received source audio data, the frequency extension coding comprising determining one or more shape parameters for the frequency-domain data, wherein determining the one or more shape parameters comprises using a displacement vector that measures a displacement of a first sub-band in the extended-band frequency range relative to a second sub-band in the baseband frequency range; wherein the determining one or more shape parameters comprises for the first sub-band in the extended-band frequency range, finding the second sub-band in the baseband frequency range whose sub-band shape matches that of the first sub-band in the extended-band frequency range within a tolerance wherein said finding is restricted to sub-bands in the baseband frequency range that are only at a displacement of an even number of sub-bands from said first sub-band in the extended-band frequency range, and wherein the displacement measured by the displacement vector is an even number of sub-bands, wherein the sub-band shape matching is performed using a vector quantization process.
-
-
9. In an audio encoder, a computer-implemented method comprising:
-
the audio encoder receiving source audio data; the audio encoder performing a time-to-frequency transform on the received source audio data to produce frequency-domain data for the received source audio data; and the audio encoder performing frequency extension coding on the received source audio data, the frequency extension coding comprising determining one or more shape parameters and one or more scale parameters for the frequency-domain data; wherein the determining one or more shape parameters comprises for the first sub-band in the extended-band frequency range, finding the second sub-band in the baseband frequency range whose sub-band shape matches that of the first sub-band in the extended-band frequency range within a tolerance wherein said finding is restricted to sub-bands in the baseband frequency range that are only at a displacement of an even number of sub-bands from said first sub-band in the extended-band frequency range, and wherein the displacement measured by the displacement vector is an even number of sub-bands, wherein the sub-band shape matching is performed using a vector quantization process; and wherein the determining one or more scale parameters comprises; determining one or more scale parameters for the frequency-domain data at one or more audio blocks; determining one or more anchor points of audio block time windows for interpolating in time the one or more scale parameters at time windows between the anchor points; and explicitly coding the scale factors at the anchor points into an output encoded audio stream without explicitly coding the interpolated scale parameters at time windows between the anchor points.
-
-
10. A tangible computer-readable storage medium storing computer-executable instructions for causing a computer programmed thereby to perform the method comprising:
-
receiving source audio data; performing a time-to-frequency transform on the received source audio data to produce frequency-domain data for the received source audio data; and performing frequency extension coding on the received source audio data, the frequency extension coding comprising determining one or more shape parameters and one or more scale parameters for the frequency-domain data; wherein the determining one or more shape parameters comprises for the first sub-band in the extended-band frequency range, finding the second sub-band in the baseband frequency range whose sub-band shape matches that of the first sub-band in the extended-band frequency range within a tolerance wherein said finding is restricted to sub-bands in the baseband frequency range that are only at a displacement of an even number of sub-bands from said first sub-band in the extended-band frequency range, and wherein the displacement measured by the displacement vector is an even number of sub-bands, wherein the sub-band shape matching is performed using a vector quantization process; and wherein the determining one or more scale parameters comprises; determining one or more scale parameters for the frequency-domain data at one or more audio blocks; determining one or more anchor points of audio block time windows for interpolating in time the one or more scale parameters at time windows between the anchor points; and explicitly coding the scale factors at the anchor points into an output encoded audio stream without explicitly coding the interpolated scale parameters at time windows between the anchor points.
-
Specification