EFFICIENT CODING OF DIGITAL MEDIA SPECTRAL DATA USING WIDE-SENSE PERCEPTUAL SIMILARITY
First Claim
1. An audio encoding method, comprising:
- transforming an input audio signal block into a set of spectral coefficients;
dividing the spectral coefficients into plural bands;
coding values of the spectral coefficients of at least one of the bands in an output bit stream; and
for at least one of the other bands, coding the at least one other band in the output bit-stream as a scaled version of a shape of a portion of the at least one of the bands coded as spectral coefficient values, wherein the coding the at least one other band comprises coding the other band using a scale parameter and a shape parameter, wherein the shape parameter comprises a motion vector and indicates the portion of the at least one of the bands coded as spectral coefficient values, and wherein the scale parameter is a scaling factor to scale the portion.
2 Assignments
0 Petitions
Accused Products
Abstract
Traditional audio encoders may conserve coding bit-rate by encoding fewer than all spectral coefficients, which can produce a blurry low-pass sound in the reconstruction. An audio encoder using wide-sense perceptual similarity improves the quality by encoding a perceptually similar version of the omitted spectral coefficients, represented as a scaled version of already coded spectrum. The omitted spectral coefficients are divided into a number of sub-bands. The sub-bands are encoded as two parameters: a scale factor, which may represent the energy in the band; and a shape parameter, which may represent a shape of the band. The shape parameter may be in the form of a motion vector pointing to a portion of the already coded spectrum, an index to a spectral shape in a fixed code-book, or a random noise vector. The encoding thus efficiently represents a scaled version of a similarly shaped portion of spectrum to be copied at decoding.
-
Citations
20 Claims
-
1. An audio encoding method, comprising:
-
transforming an input audio signal block into a set of spectral coefficients; dividing the spectral coefficients into plural bands; coding values of the spectral coefficients of at least one of the bands in an output bit stream; and for at least one of the other bands, coding the at least one other band in the output bit-stream as a scaled version of a shape of a portion of the at least one of the bands coded as spectral coefficient values, wherein the coding the at least one other band comprises coding the other band using a scale parameter and a shape parameter, wherein the shape parameter comprises a motion vector and indicates the portion of the at least one of the bands coded as spectral coefficient values, and wherein the scale parameter is a scaling factor to scale the portion. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. One or more computer-readable storage media comprising instructions configurable to cause a computer to perform an audio decoding method for an encoded audio bitstream, the method comprising:
-
decoding one or more baseband spectral coefficients from the encoded audio bitstream; decoding one or more extended band spectral coefficients by; copying one or more identified baseband spectral coefficients according to a shape parameter, wherein the shape parameter comprises a motion vector identifying one or more baseband spectral coefficients to be copied; and scaling the copied one or more identified baseband spectral coefficients according to a scale parameter. - View Dependent Claims (13, 14, 15, 16, 17)
-
-
18. A computing device comprising:
-
a processing unit; one or more computer-readable storage media comprising instructions configured to cause the processing unit to perform an audio decoding method for an encoded audio bitstream, the method comprising; decoding one or more baseband spectral coefficients from the encoded audio bitstream; decoding a first band of extended spectral coefficients from the encoded audio bitstream by; decoding, from the encoded audio bitstream, a scale factor for the first band; copying one or more identified baseband spectral coefficients according to a first shape parameter, wherein the shape parameter comprises a motion vector identifying one or more baseband spectral coefficients to be copied, the identified one or more baseband spectral coefficients describing a shape of a spectral band; and scaling the copied one or more identified baseband spectral coefficients according to the decoded scale factor for the first band; decoding a second band of extended spectral coefficients from the encoded audio bitstream by; decoding, from the encoded audio bitstream, a scale factor for the second band; copying one or more vectors from a codebook according to a second shape parameter; and scaling the copied one or more vectors from the codebook according to the decoded scale factor for the second band; and performing an inverse transform on the decoded one or more baseband spectral coefficients and the decoded one or more extended band spectral coefficients to make a reconstructed audio signal. - View Dependent Claims (19, 20)
-
Specification