Linear predictive residual representation via non-iterative spectral reconstruction
First Claim
1. A method of encoding a linear predictive residual signal as derived from an analog speech signal, wherein said linear predictive residual signal is in the form of a plurality of frames of digital speech data, said method comprising the steps of:
- transforming each frame of digital speech data to a frame of digital speech data at least approximating minimum phase; and
subjecting the transformed frame of digital speech data at least approximating minimum phase to a Fourier Transform procedure, thereby providing an encoded version of the frame in which one of the magnitude and the phase information is representative of the original frame of digital speech data which forms part of the original linear predictive residual signal, and the other of the magnitude and the phase information does not occur in the encoded version of the frame.
1 Assignment
0 Petitions
Accused Products
Abstract
Method of encoding speech at medium to high bit rates while maintaining very high speech quality, as specifically directed to the coding of the linear predictive (LPC) residual signal using either its Fourier Transform magnitude or phase. In particular, the LPC residual of the speech signal is coded using minimum phase spectral reconstruction techniques by transforming the LPC residual signal in a manner approximately a minimum phase signal, and then applying spectral reconstruction techniques for representing the LPC residual signal by either its Fourier Transform magnitude or phase. The non-iterative spectral reconstruction technique is based upon cepstral coefficients through which the magnitude and phase of a minimum phase signal are related. The LPC residual as reconstructed and regenerated is used as an excitation signal to a LPC synthesis filter in the generation of analog speech signals via speech synthesis from which audible speech may be produced.
-
Citations
11 Claims
-
1. A method of encoding a linear predictive residual signal as derived from an analog speech signal, wherein said linear predictive residual signal is in the form of a plurality of frames of digital speech data, said method comprising the steps of:
-
transforming each frame of digital speech data to a frame of digital speech data at least approximating minimum phase; and subjecting the transformed frame of digital speech data at least approximating minimum phase to a Fourier Transform procedure, thereby providing an encoded version of the frame in which one of the magnitude and the phase information is representative of the original frame of digital speech data which forms part of the original linear predictive residual signal, and the other of the magnitude and the phase information does not occur in the encoded version of the frame. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method of encoding a linear predictive residual signal as derived from an analog speech signal, wherein said linear predictive residual signal is in the form of a plurality of frames of digital speech data, said method comprising the steps of:
-
searching each frame of digital speech data to detect the peak residual value occurring therein; time-shifting the digital speech data included in the frame to align the peak residual value with the origin of the frame; determining a dispersion measure D for the frame in accordance with the relationship ##EQU7## where n is the number of samples included in the frame of digital speech data, and x is the energy value of a respective sample of the frame; weighting the frame of digital speech data in a manner inversely proportional to the dispersion measure D to provide a transformed frame of digital speech data at least approximating a minimum phase signal; and subjecting the weighted frame of digital speech data to a Fourier Transform procedure, thereby providing an encoded version of the frame in which one of the magnitude and the phase information is representative of the original frame of digital speech data which forms part of the original linear predictive residual signal. - View Dependent Claims (7, 8, 9, 10, 11)
-
8. A method as set forth in claim 7, wherein the magnitude information is the encoded version of the frame representative of the original frame of digital speech data.
-
9. A method as set forth in claim 7, wherein the phase information is the encoded version representative of the original frame of digital speech data.
-
10. A method as set forth in claim 7, further including restoring the encoded version of the frame to the transformed frame of digital speech data at least approximating minimum phase by employing a non-iterative spectral reconstruction, and
removing the weighting of the frame of digital speech data and time-shifting the digital speech data included in the frame to return the peak residual value occurring therein to its original position, thereby regenerating the original frame of digital speech data which forms part of the original linear predictive residual signal. -
11. A method as set forth in claim 10, further including employing the regenerated linear predictive residual signal as an excitation signal with linear predictive speech parameters in a linear predictive coding speech synthesis filter from which audible speech is to be derived.
-
Specification