Voicing index controls for CELP speech coding

US 20040181411A1
Filed: 03/11/2004
Published: 09/16/2004
Est. Priority Date: 03/15/2003
Status: Abandoned Application

First Claim

Patent Images

1. A method of improving synthesized speech quality comprising:

obtaining an input speech signal;

coding said input speech using a Code Excited Linear Prediction coder to generate code parameters for synthesis of said input speech; and

using a voicing index representing a characteristic of said input speech in enhancing said synthesis of said input speech.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

An approach for improving quality of speech synthesized using analysis-by-synthesis (ABS) coders is presented. An unstable perceptual quality in analysis-by-synthesis type speech coding (e.g. CELP) may occur because the periodicity degree in a voiced speech signal may vary significantly for different segments of the voiced speech. Thus, the present invention uses a voicing index, which may indicate the periodicity degree of the speech signal, to control and improve ABS type speech coding. The voicing index may be used to improve the quality stability by controlling encoder and/or decoder in: fixed-codebook short-term enhancement including the spectrum tilt; perceptual weighting filter; sub-fixed codebook determination; LPC interpolation; fixed-codebook pitch enhancement; post-pitch enhancement; noise injection into the high-frequency band at decoder; LTP Sinc window; signal decomposition, etc.

Citations

45 Claims

1. A method of improving synthesized speech quality comprising:
- obtaining an input speech signal;
  
  coding said input speech using a Code Excited Linear Prediction coder to generate code parameters for synthesis of said input speech; and
  
  using a voicing index representing a characteristic of said input speech in enhancing said synthesis of said input speech.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
- - 2. The method of claim 1, wherein said characteristic of said input speech is periodicity of said input speech.
  - 3. The method of claim 1, wherein said enhancing said synthesis of said input speech is by controlling an adaptive highpass filter with said voicing index to enhance high frequency region during said coding.
  - 4. The method of claim 1, wherein said enhancing said synthesis of said input speech is by controlling an adaptive perceptual weighting filter in said Code Excited Linear Prediction coder with said voicing index.
  - 5. The method of claim 1, wherein said enhancing said synthesis of said input speech is by controlling an adaptive Sinc window used in said Code Excited Linear Prediction coder for pitch contribution with said voicing index.
  - 6. The method of claim 1, wherein said enhancing said synthesis of said input speech is by controlling spectrum tilt of said input speech by short-term enhancement of a fixed-codebook of said Code Excited Linear Prediction coder with said voicing index.
  - 7. The method of claim 1, wherein said enhancing said synthesis of said input speech is by controlling a perceptual weighting filter of said Code Excited Linear Prediction coder with said voicing index.
  - 8. The method of claim 1, wherein said enhancing said synthesis of said input speech is by controlling a linear prediction coder of said Code Excited Linear Prediction coder with said voicing index.
  - 9. The method of claim 1, wherein said enhancing said synthesis of said input speech is by controlling a pitch enhancement fixed-codebook of said Code Excited Linear Prediction coder with said voicing index.
  - 10. The method of claim 1, wherein said enhancing said synthesis of said input speech is by controlling post pitch enhancement of said Code Excited Linear Prediction coder with said voicing index.
  - 11. The method of claim 1, wherein said voicing index selects at least one sub-codebook from a plurality of sub-codebooks of said Code Excited Linear Prediction coder based on said characteristic of said input speech signal.

12. A method of improving synthesized speech quality comprising:
- obtaining code parameters of an input speech signal;
  
  obtaining a voicing index for use in enhancing synthesis of said input speech signal from said code parameters; and
  
  processing said code parameters through a Code Excited Linear Prediction coder using information provided by said voicing index to generate a synthesized version of said input speech signal.
- View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21)
- - 13. The method of claim 12, wherein said voicing index provides periodicity of said input speech signal.
  - 14. The method of claim 12, wherein said voicing index provides characteristics of an adaptive highpass filter used to enhance high frequency region of said excitation during generation of said code parameters for said input speech.
  - 15. The method of claim 12, wherein said voicing index provides characteristics of an adaptive perceptual weighting filter used to enhance perceptual quality of said input speech during generation of said code parameters for said input speech.
  - 16. The method of claim 12, wherein said voicing index provides characteristics of an adaptive Sinc window for pitch contribution used to enhance perceptual quality of said input speech during generation of said code parameters for said input speech.
  - 17. The method of claim 12, wherein said enhancing synthesis of said input speech is by controlling spectrum tilt of said input speech by short-term enhancement of a fixed-codebook of said Code Excited Linear Prediction coder with said voicing index.
  - 18. The method of claim 12, wherein said enhancing of said synthesis of said input speech is by controlling a linear prediction coder filter of said Code Excited Linear Prediction coder with said voicing index.
  - 19. The method of claim 12, wherein said enhancing of said synthesis of said input speech is by controlling a pitch enhancement fixed-codebook of said Code Excited Linear Prediction coder with said voicing index.
  - 20. The method of claim 12, wherein said enhancing said synthesis of said input speech is by controlling post pitch enhancement of said Code Excited Linear Prediction coder with said voicing index.
  - 21. The method of claim 12, wherein said voicing index selects at least one sub-codebook from a plurality of sub-codebooks of said Code Excited Linear Prediction coder based on said characteristic of said input speech signal.

22. An apparatus for improving synthesized speech quality comprising:
- an input speech signal;
  
  a Code Excited Linear Prediction coder for coding said input speech signal to generate code parameters for synthesis of said input speech; and
  
  a voicing index having a characteristic of said input speech for use in enhancing said synthesis of said input speech.
- View Dependent Claims (23, 24, 25, 26, 27)
- - 23. The apparatus of claim 22, wherein said characteristic of said input speech is periodicity of said input speech.
  - 24. The apparatus of claim 22, wherein said characteristic of said input speech is a characteristic of an adaptive highpass filter used to enhance high frequency region of said excitation during said coding.
  - 25. The apparatus of claim 22, wherein said characteristic of said input speech is a characteristic of an adaptive perceptual weighting filter used in said Code Excited Linear Prediction coder.
  - 26. The apparatus of claim 22, wherein said characteristic of said input speech is a characteristic of an adaptive Sinc window used in said Code Excited Linear Prediction coder.
  - 27. The apparatus of claim 22, wherein said voicing index selects at least one sub-codebook from a plurality of sub-codebooks of said Code Excited Linear Prediction coder based on said characteristic of said input speech signal.

28. An apparatus for improving synthesized speech quality comprising:
- a set of code parameters of an input speech signal;
  
  a voicing index for use in enhancing synthesis of said input speech signal from said code parameters; and
  
  a Code Excited Linear Prediction coder using said code parameters and information provided by said voicing index to generate a synthesized version of said input speech signal.
- View Dependent Claims (29, 30, 31, 32, 33)
- - 29. The apparatus of claim 28, wherein said voicing index provides periodicity of said input speech signal.
  - 30. The apparatus of claim 28, wherein said voicing index provides characteristics of a highpass filter used to enhance high frequency region of said excitation during generation of said code parameters for said input speech.
  - 31. The apparatus of claim 28, wherein said voicing index provides characteristics of an adaptive perceptual weighting filter used to enhance perceptual quality of said input speech during generation of said code parameters for said input speech.
  - 32. The apparatus of claim 28, wherein said voicing index provides characteristics of an adaptive Sinc window used to enhance perceptual quality of said input speech during generation of said code parameters for said input speech.
  - 33. The apparatus of claim 28, wherein said voicing index selects at least one sub-codebook from a plurality of sub-codebooks of said Code Excited Linear Prediction coder based on characteristics of said input speech signal.

34. A method of improving synthesized speech quality comprising:
- generating a plurality of frames from an input speech signal;
  
  coding each frame of said plurality of frames using a Code Excited Linear Prediction coder to generate code parameters for synthesis of said each frame of said input speech; and
  
  transmitting a voicing index having a plurality of bits indicative of a classification of said each frame of said input speech.
- View Dependent Claims (35, 36, 37, 38, 39)
- - 35. The method of claim 34, wherein said plurality of bits are three bits.
  - 36. The method of claim 34, wherein said classification is indicative of periodicity of said input speech signal.
  - 37. The method of claim 34, wherein said classification is indicative of an irregular voiced speech signal.
  - 38. The method of claim 34, wherein said classification is indicative of a periodic index.
  - 39. The method of claim 38, wherein said periodic index ranges from low periodic index to high periodic index.

40. A method of improving synthesized speech quality comprising:
- receiving a frame of an input speech signal, said frame having a plurality of code parameters and a voicing index, wherein said voicing index comprises a plurality of bits;
  
  determining a classification for said frame of said input speech signal from said plurality of bits of said voicing index; and
  
  decoding said frame using a Code Excited Linear Prediction coder based on said classification to synthesize said input speech.
- View Dependent Claims (41, 42, 43, 44, 45)
- - 41. The method of claim 40, wherein said plurality of bits are three bits.
  - 42. The method of claim 40, wherein said classification is indicative of a noisy speech signal.
  - 43. The method of claim 40, wherein said classification is indicative of an irregular voiced speech signal.
  - 44. The method of claim 40, wherein said classification is indicative of a periodic index.
  - 45. The method of claim 44, wherein said periodic index ranges from low periodic index to high periodic index.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Mindspeed Technologies Inc. (MACOM Technology Solutions Holdings, Inc.)
Original Assignee
Mindspeed Technologies Inc. (MACOM Technology Solutions Holdings, Inc.)
Inventors
Gao, Yang

Application Number

US10/799,503
Publication Number

US 20040181411A1
Time in Patent Office

Days
Field of Search
US Class Current

704/262
CPC Class Codes

G10L 19/005   Correction of errors induce...

G10L 19/087   using mixed excitation mode...

G10L 19/09   Long term prediction, i.e. ...

G10L 19/12   the excitation function bei...

G10L 19/20   using sound class specific ...

G10L 19/265   Pre-filtering, e.g. high fr...

G10L 21/0208   Noise filtering

G10L 21/0232   Processing in the frequency...

G10L 21/038   using band spreading techni...

G10L 25/90   Pitch determination of spee...

Voicing index controls for CELP speech coding

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

45 Claims

Specification

Solutions

Use Cases

Quick Links

Voicing index controls for CELP speech coding

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

45 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links