Low-complexity encoding/decoding of quantized MDCT spectrum in scalable speech and audio codecs
First Claim
1. A method for encoding in a scalable speech and audio codec, comprising:
- obtaining a residual signal from a Code Excited Linear Prediction (CELP)-based encoding layer, where the residual signal is a difference between an original audio signal and a reconstructed version of the original audio signal;
transforming the residual signal at a Discrete Cosine Transform (DCT)-type transform layer to obtain a corresponding transform spectrum having a plurality of spectral lines; and
encoding the transform spectrum spectral lines using a combinatorial position coding technique.
1 Assignment
0 Petitions
Accused Products
Abstract
A scalable speech and audio codec is provided that implements combinatorial spectrum encoding. A residual signal is obtained from a Code Excited Linear Prediction (CELP)-based encoding layer, where the residual signal is a difference between an original audio signal and a reconstructed version of the original audio signal. The residual signal is transformed at a Discrete Cosine Transform (DCT)-type transform layer to obtain a corresponding transform spectrum having a plurality of spectral lines. The transform spectrum spectral lines are transformed using a combinatorial position coding technique.
The combinatorial position coding technique includes generating a lexicographical index for a selected subset of spectral lines, where each lexicographic index represents one of a plurality of possible binary strings representing the positions of the selected subset of spectral lines. The lexicographical index represents non-zero spectral lines in a binary string in fewer bits than the length of the binary string.
-
Citations
40 Claims
-
1. A method for encoding in a scalable speech and audio codec, comprising:
-
obtaining a residual signal from a Code Excited Linear Prediction (CELP)-based encoding layer, where the residual signal is a difference between an original audio signal and a reconstructed version of the original audio signal; transforming the residual signal at a Discrete Cosine Transform (DCT)-type transform layer to obtain a corresponding transform spectrum having a plurality of spectral lines; and encoding the transform spectrum spectral lines using a combinatorial position coding technique. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A scalable speech and audio encoder device, comprising:
-
a Discrete Cosine Transform (DCT)-type transform layer module adapted to obtain a residual signal from a Code Excited Linear Prediction (CELP)-based encoding layer module, where the residual signal is a difference between an original audio signal and a reconstructed version of the original audio signal; and transform the residual signal at to obtain a corresponding transform spectrum having a plurality of spectral lines; and a combinatorial spectrum encoder adapted to encode the transform spectrum spectral lines using a combinatorial position coding technique. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A scalable speech and audio encoder device, comprising:
-
means for obtaining a residual signal from a Code Excited Linear Prediction (CELP)-based encoding layer, where the residual signal is a difference between an original audio signal and a reconstructed version of the original audio signal; means for transforming the residual signal at a Discrete Cosine Transform (DCT)-type transform layer to obtain a corresponding transform spectrum having a plurality of spectral lines; and means for encoding the transform spectrum spectral lines using a combinatorial position coding technique.
-
-
25. A processor including a scalable speech and audio encoding circuit adapted to:
-
obtain a residual signal from a Code Excited Linear Prediction (CELP)-based encoding layer, where the residual signal is a difference between an original audio signal and a reconstructed version of the original audio signal; transform the residual signal at a Discrete Cosine Transform (DCT)-type transform layer to obtain a corresponding transform spectrum having a plurality of spectral lines; and encode the transform spectrum spectral lines using a combinatorial position coding technique.
-
-
26. A machine-readable medium comprising instructions operational for scalable speech and audio encoding, which when executed by one or more processors causes the processors to:
-
obtain a residual signal from a Code Excited Linear Prediction (CELP)-based encoding layer, where the residual signal is a difference between an original audio signal and a reconstructed version of the original audio signal; transform the residual signal at a Discrete Cosine Transform (DCT)-type transform layer to obtain a corresponding transform spectrum having a plurality of spectral lines; and encode the transform spectrum spectral lines using a combinatorial position coding technique.
-
-
27. A method for scalable speech and audio decoding, comprising:
-
obtaining an index representing a plurality of transform spectrum spectral lines of a residual signal, where the residual signal is a difference between an original audio signal and a reconstructed version of the original audio signal from a Code Excited Linear Prediction (CELP)-based encoding layer; decoding the index by reversing a combinatorial position coding technique used to encode the plurality of transform spectrum spectral lines; and synthesizing a version of the residual signal using the decoded plurality of transform spectrum spectral lines at an Inverse Discrete Cosine Transform (IDCT)-type inverse transform layer. - View Dependent Claims (28, 29, 30, 31, 32, 33)
-
-
34. A scalable speech and audio decoder device, comprising:
-
a combinatorial spectrum decoder adapted to obtain an index representing a plurality of transform spectrum spectral lines of a residual signal, where the residual signal is a difference between an original audio signal and a reconstructed version of the original audio signal from a Code Excited Linear Prediction (CELP)-based encoding layer; decode the index by reversing a combinatorial position coding technique used to encode the plurality of transform spectrum spectral lines; and an Inverse Discrete Cosine Transform (IDCT)-type inverse transform layer module adapted to synthesize a version of the residual signal using the decoded plurality of transform spectrum spectral lines. - View Dependent Claims (35, 36, 37)
-
-
38. A scalable speech and audio decoder device, comprising:
-
means for obtaining an index representing a plurality of transform spectrum spectral lines of a residual signal, where the residual signal is a difference between an original audio signal and a reconstructed version of the original audio signal from a Code Excited Linear Prediction (CELP)-based encoding layer; means for decoding the index by reversing a combinatorial position coding technique used to encode the plurality of transform spectrum spectral lines; and means for synthesizing a version of the residual signal using the decoded plurality of transform spectrum spectral lines at an Inverse Discrete Cosine Transform (IDCT)-type inverse transform layer.
-
-
39. A processor including a scalable speech and audio decoding circuit adapted to:
-
obtain an index representing a plurality of transform spectrum spectral lines of a residual signal, where the residual signal is a difference between an original audio signal and a reconstructed version of the original audio signal from a Code Excited Linear Prediction (CELP)-based encoding layer; decode the index by reversing a combinatorial position coding technique used to encode the plurality of transform spectrum spectral lines; and synthesize a version of the residual signal using the decoded plurality of transform spectrum spectral lines at an Inverse Discrete Cosine Transform (IDCT)-type inverse transform layer.
-
-
40. A machine-readable medium comprising instructions operational for scalable speech and audio decoding, which when executed by one or more processors causes the processors to:
-
obtain an index representing a plurality of transform spectrum spectral lines of a residual signal, where the residual signal is a difference between an original audio signal and a reconstructed version of the original audio signal from a Code Excited Linear Prediction (CELP)-based encoding layer; decode the index by reversing a combinatorial position coding technique used to encode the plurality of transform spectrum spectral lines; and synthesize a version of the residual signal using the decoded plurality of transform spectrum spectral lines at an Inverse Discrete Cosine Transform (IDCT)-type inverse transform layer.
-
Specification