Efficient coding of digital media spectral data using wide-sense perceptual similarity
First Claim
1. An audio encoding method, comprising:
- with a computer,transforming an input audio signal block into a set of spectral coefficients,dividing the spectral coefficients into plural bands,coding values of the spectral coefficients of at least one of the bands in an output bitstream,searching the at least one of the bands coded as spectral coefficient values for a portion similar to at least one other band of the plural bands, andcoding the at least one other band in the output bitstream as a scaled version of a shape of the portion of the at least one of the bands coded as spectral coefficient values, wherein the coding the at least one other band comprises coding the at least one other band using a scale parameter and a shape parameter, the shape parameter comprising a motion vector based on results of the searching that indicates the portion of the at least one of the bands coded as spectral coefficient values, and wherein the scale parameter is a scaling factor to scale the portion.
2 Assignments
0 Petitions
Accused Products
Abstract
Traditional audio encoders may conserve coding bit-rate by encoding fewer than all spectral coefficients, which can produce a blurry low-pass sound in the reconstruction. An audio encoder using wide-sense perceptual similarity improves the quality by encoding a perceptually similar version of the omitted spectral coefficients, represented as a scaled version of already coded spectrum. The omitted spectral coefficients are divided into a number of sub-bands. The sub-bands are encoded as two parameters: a scale factor, which may represent the energy in the band; and a shape parameter, which may represent a shape of the band. The shape parameter may be in the form of a motion vector pointing to a portion of the already coded spectrum, an index to a spectral shape in a fixed code-book, or a random noise vector. The encoding thus efficiently represents a scaled version of a similarly shaped portion of spectrum to be copied at decoding.
249 Citations
38 Claims
-
1. An audio encoding method, comprising:
with a computer, transforming an input audio signal block into a set of spectral coefficients, dividing the spectral coefficients into plural bands, coding values of the spectral coefficients of at least one of the bands in an output bitstream, searching the at least one of the bands coded as spectral coefficient values for a portion similar to at least one other band of the plural bands, and coding the at least one other band in the output bitstream as a scaled version of a shape of the portion of the at least one of the bands coded as spectral coefficient values, wherein the coding the at least one other band comprises coding the at least one other band using a scale parameter and a shape parameter, the shape parameter comprising a motion vector based on results of the searching that indicates the portion of the at least one of the bands coded as spectral coefficient values, and wherein the scale parameter is a scaling factor to scale the portion. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
12. One or more computer-readable storage devices or memory comprising instructions configurable to cause a computer to perform an audio decoding method for an encoded audio bitstream, the method comprising:
-
decoding baseband spectral coefficients from the encoded audio bitstream; decoding a shape parameter from the encoded audio bitstream, the shape parameter comprising a motion vector identifying one or more baseband spectral coefficients, the motion vector including a value that was set as a result of searching the baseband spectral coefficients for a portion of the baseband spectral coefficients similar to one or more extended band spectral coefficients; and decoding the one or more extended band spectral coefficients by; copying the one or more identified baseband spectral coefficients according to the shape parameter, and scaling the copied one or more identified baseband spectral coefficients according to a scale parameter. - View Dependent Claims (13, 14, 15, 16, 17)
-
-
18. A computing device comprising:
-
a processing unit; one or more computer-readable storage media comprising instructions configured to cause the processing unit to perform an audio decoding method for an encoded audio bitstream, the method comprising; decoding baseband spectral coefficients from the encoded audio bitstream; decoding a first band of extended band spectral coefficients from the encoded audio bitstream by; decoding, from the encoded audio bitstream, a scale factor for the first band; copying one or more identified baseband spectral coefficients according to a first shape parameter, wherein the first shape parameter comprises a motion vector identifying one or more baseband spectral coefficients to be copied, the identified one or more baseband spectral coefficients describing a shape of a spectral band, the motion vector including a value that was set as a result of searching the baseband spectral coefficients for a portion of the baseband spectral coefficients similar to one or more of the first band of extended band spectral coefficients; and scaling the copied one or more identified baseband spectral coefficients according to the decoded scale factor for the first band; decoding a second band of the extended band spectral coefficients from the encoded audio bitstream by; decoding, from the encoded audio bitstream, a scale factor for the second band; copying one or more vectors from a codebook according to a second shape parameter; and scaling the copied one or more vectors from the codebook according to the decoded scale factor for the second band; and performing an inverse transform on the decoded baseband spectral coefficients and the decoded extended band spectral coefficients to make a reconstructed audio signal. - View Dependent Claims (19, 20)
-
-
21. One or more computer-readable storage devices or memory comprising instructions configurable to cause a computer to perform an audio encoding method, the method comprising:
-
transforming an input audio signal block into a set of spectral coefficients, dividing the spectral coefficients into plural bands, coding values of the spectral coefficients of at least one of the bands in an output bitstream, searching the at least one of the bands coded as spectral coefficient values for a portion similar to at least one other band of the plural bands, and coding the at least one other band in the output bitstream as a scaled version of a shape of the portion of the at least one of the bands coded as spectral coefficient values, wherein the coding the at least one other band comprises coding the at least one other band using a scale parameter and a shape parameter, the shape parameter comprising a motion vector based on results of the searching that indicates the portion of the at least one of the bands coded as spectral coefficient values, and wherein the scale parameter is a scaling factor to scale the portion. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30, 31)
-
-
32. A computing device comprising:
-
a processing unit; one or more computer-readable storage media comprising instructions configured to cause the processing unit to perform an audio encoding method, the method comprising; transforming an input audio signal block into a set of spectral coefficients, dividing the spectral coefficients into plural bands, coding values of the spectral coefficients of at least one of the bands in an output bitstream, searching the at least one of the bands coded as spectral coefficient values for a portion similar to at least one other band of the plural bands, and coding the at least one other band in the output bitstream as a scaled version of a shape of the portion of the at least one of the bands coded as spectral coefficient values, wherein the coding the at least one other band comprises coding the at least one other band using a scale parameter and a shape parameter, the shape parameter comprising a motion vector based on results of the searching that indicates the portion of the at least one of the bands coded as spectral coefficient values, and wherein the scale parameter is a scaling factor to scale the portion. - View Dependent Claims (33, 34, 35, 36, 37, 38)
-
Specification