Apparatus and quality enhancement algorithm for mixed excitation linear predictive (MELP) and other speech coders
First Claim
Patent Images
1. A method of enhancing the speech quality of a speech coder encoded data transmission, comprising:
- digitally sampling speech to create a speech waveform over a plurality of frames;
identifying frames that contain a plosive signal distinguished from other transitory signals;
analyzing the plosive signal to create plosive signal parameters;
applying the plosive signal parameters to a linear prediction residual plosive signal to synthesize the plosive signal for frames that contain a plosive signal; and
adding the synthesized plosive signal to the synthesized speech for the frame that contains the plosive.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for enhancing the speech quality of the mixed excitation linear predictive (MELP) coder and other low bit-rate speech coders. The system and method employ a plosive analysis/synthesis method, which detects the frame containing a plosive signal, applies a simple model to synthesize the plosive signal, and adds the synthesized plosive to the coded speech. The system and method remains compatible with the existing MELP coder bit stream.
-
Citations
17 Claims
-
1. A method of enhancing the speech quality of a speech coder encoded data transmission, comprising:
-
digitally sampling speech to create a speech waveform over a plurality of frames;
identifying frames that contain a plosive signal distinguished from other transitory signals;
analyzing the plosive signal to create plosive signal parameters;
applying the plosive signal parameters to a linear prediction residual plosive signal to synthesize the plosive signal for frames that contain a plosive signal; and
adding the synthesized plosive signal to the synthesized speech for the frame that contains the plosive. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
applying the plosive signal parameters to a previously-stored linear prediction residual plosive signal and applying a linear prediction synthesis filter.
-
-
4. The method of claim 1, wherein the step of analyzing comprises identifying a subdivision of the frame that contains the plosive and calculating the amplitude of the plosive.
-
5. The method of claim 3, wherein applying the plosive signal parameters comprises scaling a previously-stored plosive signal by the plosive amplitude.
-
6. The method of claim 5, wherein the length of a previously-stored linear prediction residual plosive signal is equal to the length of a subdivision of the frame containing the plosive.
-
7. The method of claim 2, wherein detecting peakiness in the linear prediction residual signal comprises computing the ratio of the L1 and L2 norm of the linear prediction residual signal with a sliding sample window.
-
8. The method of claim 4, wherein the step of adding the synthesized plosive signal comprises adding the synthesized plosive signal to the identified subdivision of the frame.
-
9. A speech coder, comprising:
-
means for digitally sampling speech to create a speech waveform over a plurality of frames;
means for identifying frames that contain a plosive signal distinguished from other transitory signals;
means for analyzing the plosive signal to create plosive signal parameters;
means for applying the plosive signal parameters to a linear prediction residual signal to synthesize the plosive signal for frames that contain the plosive; and
means for adding the plosive signal to the synthesized speech for frames that contain the plosive. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17)
scaling a previously-stored signal by the plosive amplitude.
-
-
15. The coder of claim 10, wherein the means for identifying further comprises:
detecting peakiness in the linear prediction residual signal.
-
16. The coder of claim 15, wherein detecting peakiness comprises computing the ratio of the L1 and L2 norm of the linear prediction residual signal with a sliding sample window.
-
17. The coder of claim 12, wherein the means for adding the plosive signal to the synthesized speech comprises adding the synthesized plosive signal to the subdivision.
Specification