Quantization matrices for digital audio
First Claim
1. In an audio encoder, a method comprising:
- processing a group of frequency coefficients as critical bands according to an auditory model to generate an excitation pattern; and
computing a quantization matrix directly from and in proportion to the excitation pattern, the quantization matrix including weights for quantization bands that partition the group, wherein the quantization bands differ from the critical bands.
2 Assignments
0 Petitions
Accused Products
Abstract
Quantization matrices facilitate digital audio encoding and decoding. An audio encoder generates and compresses quantization matrices; an audio decoder decompresses and applies the quantization matrices. The invention includes several techniques and tools, which can be used in combination or separately. For example, the audio encoder can generate quantization matrices from critical band patterns for blocks of audio data. The encoder can compute the quantization matrices directly from the critical band patterns, which can be computed from the same audio data that is being compressed. The audio encoder/decoder can use different modes for generating/applying quantization matrices depending on the coding channel mode of multi-channel audio data. The audio encoder/decoder can use different compression/decompression modes for the quantization matrices, including a parametric compression/decompression mode.
185 Citations
66 Claims
-
1. In an audio encoder, a method comprising:
-
processing a group of frequency coefficients as critical bands according to an auditory model to generate an excitation pattern; and
computing a quantization matrix directly from and in proportion to the excitation pattern, the quantization matrix including weights for quantization bands that partition the group, wherein the quantization bands differ from the critical bands. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer-readable medium encoded with computer-executable instructions for causing a computer programmed thereby to perform a method comprising:
-
receiving a group of frequency coefficients;
processing the group of frequency coefficients as plural critical bands according to a model of human auditory perception to generate pattern information for the group of frequency coefficients;
generating a quantization matrix for the group of frequency coefficients based at least in part upon the pattern information for the group of frequency coefficients, the quantization matrix including plural quantization bands partitioning the group of frequency coefficients, each of the plural quantization bands having a weight in the quantization matrix, wherein the plural quantization bands are different than the plural critical bands; and
applying the quantization matrix to the group of frequency coefficients. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. An audio encoder comprising:
-
a modeler for processing audio data according to a model of human auditory perception and for generating pattern information for the audio data, wherein each of plural critical bands spectrally partitions the audio data in the model of human auditory perception; and
a program module for computing a set of plural weighting factors from and in proportion to the pattern information for the audio data, wherein each of the set of plural weighting factors comprises a weight for a different one of plural quantization bands that spectrally partition the audio data, wherein the quantization bands are different than the critical bands. - View Dependent Claims (19, 20, 21, 22, 23)
-
-
24. A computer-readable medium having encoded therein computer-executable instructions for causing a computer programmed thereby to perform a method of generating quantization matrices for plural blocks, wherein each of the plural blocks has one of plural available block sizes, the method comprising:
for each of the plural blocks, normalizing the block;
computing pattern information for the normalized block in a block size-independent manner; and
generating a quantization matrix based upon the pattern information. - View Dependent Claims (25, 26, 27)
-
28. An apparatus comprising:
-
a multi-channel transformer operable to output multi-channel audio data in jointly coded channels; and
a program module for generating a single quantization matrix for weighting all of the jointly coded channels. - View Dependent Claims (29, 30, 31)
-
-
32. A computer-readable medium encoded with computer-executable instructions for causing a computer programmed thereby to perform a method comprising:
-
receiving first audio data in a first coding channel;
receiving second audio data in a second coding channel;
generating one or more quantization matrices for the first and second coding channels, wherein the generating comprises switching between different quantization matrix generation techniques based upon whether the first and second coding channels are joint coding channels; and
outputting the one or more quantization matrices. - View Dependent Claims (33, 34, 35, 36, 37, 38)
-
-
39. A computer-readable medium encoded with computer-executable instructions for causing a computer programmed thereby to perform a method comprising:
-
receiving one or more identical quantization matrices for first and second jointly coded channels of audio data, wherein each of the one or more identical quantization matrices is based at least in part upon an aggregated pattern for multiple channels of audio information; and
applying the one or more identical quantization matrices to the first and second jointly coded channels of audio data. - View Dependent Claims (40, 41)
-
-
42. An apparatus comprising:
-
a program module for applying one or more quantization matrices to multi-channel audio data in first and second coding channels in a coding channel mode-dependent manner, wherein the program module switches between plural available matrix application techniques based upon whether the first and second coding channels are joint coding channels; and
an inverse multi-channel transformer operable to switch between plural coding channel modes, a first coding channel mode of the plural coding channel modes for receiving the first and second coding channels as joint coding channels, a second channel mode of the plural coding channel modes for receiving the first and second coding channels as independent coding channels. - View Dependent Claims (43, 44)
-
-
45. A computer-readable medium encoded with computer-executable instructions for causing a computer programmed thereby to perform a method comprising:
-
processing at least one set of weighting factors according to a parametric model to switch between a direct representation and a parametric representation of the at least one set of weighting factors, wherein the parametric representation of the at least one set of weighting factors accounts for audibility of distortion according to a model of human auditory perception; and
outputting a result of the processing. - View Dependent Claims (46, 47, 48, 49, 50)
-
-
51. In an audio encoder, a method comprising:
-
receiving a band weight representation of a quantization matrix; and
compressing the band weight representation of the quantization matrix using linear predictive coding, wherein the compressing includes computing pseudo-autocorrelation values for the quantization matrix. - View Dependent Claims (52, 53, 54, 55, 56)
-
-
57. A computer-readable medium encoded with computer-executable instructions for causing a computer programmed thereby to perform a method comprising:
-
receiving a parametric representation of a quantization matrix, the quantization matrix including weights for bands of a group of frequency coefficients, wherein the parametric representation accounts for audibility of distortion according to a model of human auditory perception; and
decompressing the parametric representation of the quantization matrix, thereby producing a direct representation of the quantization matrix. - View Dependent Claims (58)
-
-
59. An audio encoder comprising:
-
a weighter for generating one or more sets of weighting factors, each of the one or more sets of weighting factors including weights for bands of spectral audio data; and
a program module for compressing the one or more sets of weighting factors according to a parametric model of compression, wherein the parametric model includes computing pseudo-autocorrelation values. - View Dependent Claims (60, 61)
-
-
62. A method of compressing a quantization matrix in an audio encoder comprising:
-
compressing a quantization matrix using a compression mode selected from among plural available compression modes, the plural available compression modes including a direct compression mode and a parametric compression mode, wherein the parametric compression mode accounts for audibility of distortion according to an auditory model; and
outputting the compressed quantization matrix. - View Dependent Claims (63, 64)
-
-
65. A computer-readable medium encoded with computer-executable instructions for causing a computer programmed thereby to perform a method of decompressing a quantization matrix in an audio decoder, the method comprising:
-
receiving a compressed quantization matrix; and
decompressing the compressed quantization matrix using a decompression mode selected from among plural available decompression modes, the plural available decompression modes including a direct decompression mode and a parametric decompression mode, the parametric decompression mode for decompressing a quantization matrix compressed according to a parametric compression mode that accounts for audibility of distortion according to an auditory model. - View Dependent Claims (66)
-
Specification