FLEXIBLE FREQUENCY AND TIME PARTITIONING IN PERCEPTUAL TRANSFORM CODING OF AUDIO
First Claim
1. A method of compressively encoding audio, the method comprising:
- applying a frequency transform to blocks of input audio data to produce sets of spectral coefficients;
quantizing the sets of spectral coefficients;
encoding quantized spectral coefficients in a base frequency region of the sets up to an upper bound frequency position in a compressed audio bit stream;
determining a band structure for partitioning spectral holes and an extension region above the upper bound frequency position into bands for vector quantization coding, where the spectral holes are runs of consecutive spectral coefficients in the base frequency region were quantized to a zero value;
wherein said determining a band structure for partitioning in the case of spectral holes comprises;
detecting any spectral holes in the base frequency region having a width larger than a minimum hole size threshold; and
for a detected spectral hole, determining a number of bands having a band size not exceeding a maximum band size threshold and that evenly divide the detected spectral hole; and
encoding spectral coefficients at the frequency positions of the spectral holes and the extension region using vector quantization coding in the compressed audio bit stream.
2 Assignments
0 Petitions
Accused Products
Abstract
An audio encoder/decoder performs band partitioning for vector quantization encoding of spectral holes and missing high frequencies that result from quantization when encoding at low bit rates. The encoder/decoder determines a band structure for spectral holes based on two threshold parameters: a minimum hole size threshold and a maximum band size threshold. Spectral holes wider than the minimum hole size threshold are partitioned evenly into bands not exceeding the maximum band size threshold in size. Such hole filling bands are configured up to a preset number of hole filling bands. The bands for missing high frequencies are then configured by dividing the high frequency region into bands having binary-increasing, linearly-increasing or arbitrarily-configured band sizes up to a maximum overall number of bands.
-
Citations
11 Claims
-
1. A method of compressively encoding audio, the method comprising:
-
applying a frequency transform to blocks of input audio data to produce sets of spectral coefficients; quantizing the sets of spectral coefficients; encoding quantized spectral coefficients in a base frequency region of the sets up to an upper bound frequency position in a compressed audio bit stream; determining a band structure for partitioning spectral holes and an extension region above the upper bound frequency position into bands for vector quantization coding, where the spectral holes are runs of consecutive spectral coefficients in the base frequency region were quantized to a zero value; wherein said determining a band structure for partitioning in the case of spectral holes comprises; detecting any spectral holes in the base frequency region having a width larger than a minimum hole size threshold; and for a detected spectral hole, determining a number of bands having a band size not exceeding a maximum band size threshold and that evenly divide the detected spectral hole; and encoding spectral coefficients at the frequency positions of the spectral holes and the extension region using vector quantization coding in the compressed audio bit stream. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method of compressively encoding audio, the method comprising:
-
applying a frequency transform having a first window size to input audio data to produce first sets of spectral coefficients; applying a frequency transform having a second window size to the input audio data to produce second sets of spectral coefficients; quantizing at least a first spectrum region of the first sets of spectral coefficients; encoding the quantized spectral coefficients in the first spectrum region into a compressed audio bit stream; and performing vector quantization coding of the second sets of spectral coefficients in a second spectrum region into the compressed audio bit stream. - View Dependent Claims (9, 10, 11)
-
Specification