Spectral magnitude representation for multi-band excitation speech coders
First Claim
1. A method for encoding a speech signal into a plurality of digital bits from which the speech signal can later be synthesized, the method comprising the steps of:
- processing the speech signal to divide the signal into a plurality of speech frames, each of the speech frames representing a time interval of the speech signal;
processing the speech frames to determine voicing information for a plurality of frequency bands of the speech frames;
processing the speech frames to determine spectral magnitudes representative of the magnitudes of the spectrum at determined frequencies across the frequency bands, andquantizing and encoding the spectral magnitudes and the voicing information for subsequent use in decoding and synthesizing the speech signal,wherein the processing of the speech frames to determine spectral magnitudes and the quantizing and encoding of the spectral magnitudes is done in such a manner that spectral magnitudes independent of the voicing information are available for later synthesizing.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for encoding a speech signal into digital bits including the steps of dividing the speech signal into speech frames representing time intervals of the speech signal, determining voicing information for frequency bands of the speech frames, and determining spectral magnitudes representative of the magnitudes of the spectrum at determined frequencies across the frequency bands. The method further includes quantizing and encoding the spectral magnitudes and the voicing information. The steps of determining, quantizing and encoding the spectral magnitudes is done is such a manner that the spectral magnitudes independent of voicing information are available for later synthesizing.
120 Citations
19 Claims
-
1. A method for encoding a speech signal into a plurality of digital bits from which the speech signal can later be synthesized, the method comprising the steps of:
-
processing the speech signal to divide the signal into a plurality of speech frames, each of the speech frames representing a time interval of the speech signal; processing the speech frames to determine voicing information for a plurality of frequency bands of the speech frames; processing the speech frames to determine spectral magnitudes representative of the magnitudes of the spectrum at determined frequencies across the frequency bands, and quantizing and encoding the spectral magnitudes and the voicing information for subsequent use in decoding and synthesizing the speech signal, wherein the processing of the speech frames to determine spectral magnitudes and the quantizing and encoding of the spectral magnitudes is done in such a manner that spectral magnitudes independent of the voicing information are available for later synthesizing. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
2. Apparatus for encoding a speech signal into a plurality of digital bits from which the speech signal can later be synthesized, the apparatus comprising:
-
means for processing the speech signal to divide the signal into a plurality of speech frames, each of the speech frames representing a time interval of the speech signal; means for processing the speech frames to determine voicing information for a plurality of frequency bands of the speech frames; means for processing the speech frames to determine spectral magnitudes representative of the magnitudes of the spectrum at determined frequencies across the frequency bands, and means for quantizing and encoding the spectral magnitudes and the voicing information for subsequent use in decoding and synthesizing the speech signal, wherein the processing of the speech frames to determine spectral magnitudes and the quantizing and encoding of the spectral magnitudes is done in such a manner that spectral magnitudes independent of the voicing information are available for later synthesizing.
-
-
15. A method for encoding a speech signal into a plurality of digital bits from which the speech signal can later be synthesized, the method comprising the steps of:
-
processing the speech signal to divide the signal into a plurality of speech frames, each of the speech frames representing a time interval of the speech signal; processing the speech frames to determine voicing information for a plurality of frequency bands of the speech frames; processing the speech frames to determine spectral magnitudes representative of the magnitudes of the spectrum at determined frequencies across the frequency bands, and quantizing and encoding the spectral magnitudes and the voicing information for subsequent use in decoding and synthesizing the speech signal, wherein the processing to determine spectral magnitudes includes a spectral transformation of the speech frames from time domain samples to frequency samples, and wherein the spectral magnitudes are formed as weighted sums of the frequency samples. - View Dependent Claims (17, 18, 19)
-
-
16. Apparatus for encoding a speech signal into a plurality of digital bits from which the speech signal can later be synthesized, the apparatus comprising:
-
means for processing the speech signal to divide the signal into a plurality of speech frames, each of the speech frames representing a time interval of the speech signal, means for processing the speech frames to determine voicing information for a plurality of frequency bands of the speech frames; means for processing the speech frames to determine spectral magnitudes representative of the magnitudes of the spectrum at determined frequencies across the frequency bands, and means for quantizing and encoding the spectral magnitudes and the voicing information for subsequent use in decoding and synthesizing the speech signal, wherein the processing to determine spectral magnitudes includes a spectral transformation of the speech frames from time domain samples to frequency samples, and wherein the spectral magnitudes are formed as weighted sums of the frequency samples.
-
Specification