Perceptual speech coding using prediction residuals, having harmonic magnitude codebook for voiced and waveform codebook for unvoiced frames
First Claim
Patent Images
1. A speech encoding method for an input speech signal divided on the time axis into blocks as units and for encoding the divided signal on a block-by-block basis, comprising the steps of:
- finding short-term prediction residuals at least for a voiced portion of the input speech signal;
finding sinusoidal analytic encoding parameters based on the short-term prediction residuals thus found;
performing perceptually weighted vector quantization for each harmonic magnitude on the sinusoidal analytic encoding parameters to produce an encoded voiced portion of the input speech signal; and
encoding an unvoiced portion of the input speech signal by waveform encoding to produce an encoded unvoiced portion of the input speech signal.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech encoding method and apparatus for encoding an input speech signal on a block-by-block or frame-by-frame basis wherein short-term prediction residuals are found and then sinusoidal analytic encoding parameters are produced based on those short-term prediction residuals. Perceptually weighted vector quantization is performed for voiced blocks or frames by encoding their sinusoidal frequency or analytic harmonic magnitudes and, in the case of unvoiced blocks or frames, the time waveforms of the unvoiced blocks are encoded.
46 Citations
8 Claims
-
1. A speech encoding method for an input speech signal divided on the time axis into blocks as units and for encoding the divided signal on a block-by-block basis, comprising the steps of:
-
finding short-term prediction residuals at least for a voiced portion of the input speech signal; finding sinusoidal analytic encoding parameters based on the short-term prediction residuals thus found; performing perceptually weighted vector quantization for each harmonic magnitude on the sinusoidal analytic encoding parameters to produce an encoded voiced portion of the input speech signal; and encoding an unvoiced portion of the input speech signal by waveform encoding to produce an encoded unvoiced portion of the input speech signal. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A speech encoding apparatus receiving an input speech signal divided on the time axis into blocks for encoding the divided signal on a block-by-block basis, comprising:
-
means for finding short-term prediction residuals of at least a voiced portion of the input speech signal; means for finding sinusoidal analytic encoding parameters including a spectral harmonic magnitude envelope from the short-term prediction residuals thus found; means for performing perceptually weighted vector quantization at least on the spectral harmonic magnitude envelope; and means for encoding an unvoiced portion of the input speech signal by waveform encoding.
-
-
7. A speech encoding apparatus receiving an input speech signal divided on the time axis into blocks for encoding the signal on a block-by-block basis, comprising:
-
means for finding short-term prediction residuals at least for a voiced portion of the input speech signal; means for finding linear spectral pairs of encoding parameters including a spectral magnitude harmonic envelope from the short-term prediction residuals; and means performing perceptually weighted multiple-stage vector quantization on the linear spectral pairs of encoding parameters limited in the frequency axis.
-
-
8. A portable radio terminal device comprising:
-
amplifying means for amplifying input speech signals; A/D converting means for A/D conversion of the amplified speech signals; speech encoding means for encoding a speech signal output from said A/D converting means; transmission path encoding means for channel encoding the encoded speech signal; modulating means for modulating an output of said transmission path encoding means; D/A converting means for D/A converting the resulting modulated signal to an analog signal; and amplifier means for amplifying the analog signal from said D/A converting means and supplying the resulting amplified signal to an antenna, wherein said speech encoding means includes means for finding a short-term prediction residual of at least a voiced portion of said input speech signal; means for finding sinusoidal analytic encoding parameters from the short-term prediction residuals thus found; means for performing perceptually weighted vector quantization on said sinusoidal analytic encoding parameters; and means for encoding an unvoiced portion of said input speech signal by waveform encoding.
-
Specification