Voicing index controls for CELP speech coding
First Claim
1. A method of improving synthesized speech quality comprising:
- obtaining an input speech signal;
coding said input speech using a Code Excited Linear Prediction coder to generate code parameters for synthesis of said input speech; and
using a voicing index representing a characteristic of said input speech in enhancing said synthesis of said input speech.
2 Assignments
0 Petitions
Accused Products
Abstract
An approach for improving quality of speech synthesized using analysis-by-synthesis (ABS) coders is presented. An unstable perceptual quality in analysis-by-synthesis type speech coding (e.g. CELP) may occur because the periodicity degree in a voiced speech signal may vary significantly for different segments of the voiced speech. Thus, the present invention uses a voicing index, which may indicate the periodicity degree of the speech signal, to control and improve ABS type speech coding. The voicing index may be used to improve the quality stability by controlling encoder and/or decoder in: fixed-codebook short-term enhancement including the spectrum tilt; perceptual weighting filter; sub-fixed codebook determination; LPC interpolation; fixed-codebook pitch enhancement; post-pitch enhancement; noise injection into the high-frequency band at decoder; LTP Sinc window; signal decomposition, etc.
-
Citations
45 Claims
-
1. A method of improving synthesized speech quality comprising:
-
obtaining an input speech signal;
coding said input speech using a Code Excited Linear Prediction coder to generate code parameters for synthesis of said input speech; and
using a voicing index representing a characteristic of said input speech in enhancing said synthesis of said input speech. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method of improving synthesized speech quality comprising:
-
obtaining code parameters of an input speech signal;
obtaining a voicing index for use in enhancing synthesis of said input speech signal from said code parameters; and
processing said code parameters through a Code Excited Linear Prediction coder using information provided by said voicing index to generate a synthesized version of said input speech signal. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. An apparatus for improving synthesized speech quality comprising:
-
an input speech signal;
a Code Excited Linear Prediction coder for coding said input speech signal to generate code parameters for synthesis of said input speech; and
a voicing index having a characteristic of said input speech for use in enhancing said synthesis of said input speech. - View Dependent Claims (23, 24, 25, 26, 27)
-
-
28. An apparatus for improving synthesized speech quality comprising:
-
a set of code parameters of an input speech signal;
a voicing index for use in enhancing synthesis of said input speech signal from said code parameters; and
a Code Excited Linear Prediction coder using said code parameters and information provided by said voicing index to generate a synthesized version of said input speech signal. - View Dependent Claims (29, 30, 31, 32, 33)
-
-
34. A method of improving synthesized speech quality comprising:
-
generating a plurality of frames from an input speech signal;
coding each frame of said plurality of frames using a Code Excited Linear Prediction coder to generate code parameters for synthesis of said each frame of said input speech; and
transmitting a voicing index having a plurality of bits indicative of a classification of said each frame of said input speech. - View Dependent Claims (35, 36, 37, 38, 39)
-
-
40. A method of improving synthesized speech quality comprising:
-
receiving a frame of an input speech signal, said frame having a plurality of code parameters and a voicing index, wherein said voicing index comprises a plurality of bits;
determining a classification for said frame of said input speech signal from said plurality of bits of said voicing index; and
decoding said frame using a Code Excited Linear Prediction coder based on said classification to synthesize said input speech. - View Dependent Claims (41, 42, 43, 44, 45)
-
Specification