Phase excited linear prediction encoder
First Claim
1. A speech encoder, comprising:
- a content extraction module including, a band pass filter that receives a speech input signal and generates a band limited speech signal, a first speech buffer connected to the band pass filter that stores the band limited speech signal, an LP analysis block connected to the first speech buffer that reads the stored speech signal and generates a plurality of LP coefficients therefrom, an LPC to LSF block connected to the LP analysis block for converting the LP coefficients to a line spectral frequency (LSF) vector, an LP analysis filter connected to the LPC to LSF block that extracts an LP residual signal from the LSF vector; and
an LSF quantizer connected to the LPC to LSF block that receives the LSF vector and determines an LSF index therefor;
a pitch detector connected to the LP analysis block of the content extraction module, the pitch detector classifying the band filtered speech signal as one of a voiced signal and an unvoiced signal; and
a naturalness enhancement module connected to the content extraction module and the pitch detector, the naturalness enhancement module including, means for extracting parameters from the LP residual signal, wherein for an unvoiced signal the extracted parameters include pitch and gain and for a voiced signal the extracted parameters include pitch, gain and excitation level; and
a quantizer for quantizing the extracted parameters and generating quantized parameters.
16 Assignments
0 Petitions
Accused Products
Abstract
A low bit rate phase excited linear prediction type speech encoder filters a speech signal to limit its bandwidth and then fragments the filtered speech signal into speech segments. The speech segments are decomposed into a spectral envelope and an LP residual signal. The spectral envelope is represented by LP filter coefficients. The LP filter coefficients are converted into line spectral frequencies (LSF). Each speech segment is also classified as one of a voiced segment and an unvoiced segment based on a pitch of the segment. Parameters are extracted from the LP residual signal, where for an unvoiced segment the extracted parameters include pitch and gain and for a voiced segment the extracted parameters include pitch, gain and excitation level. The extracted parameters are then quantized.
-
Citations
53 Claims
-
1. A speech encoder, comprising:
-
a content extraction module including, a band pass filter that receives a speech input signal and generates a band limited speech signal, a first speech buffer connected to the band pass filter that stores the band limited speech signal, an LP analysis block connected to the first speech buffer that reads the stored speech signal and generates a plurality of LP coefficients therefrom, an LPC to LSF block connected to the LP analysis block for converting the LP coefficients to a line spectral frequency (LSF) vector, an LP analysis filter connected to the LPC to LSF block that extracts an LP residual signal from the LSF vector; and
an LSF quantizer connected to the LPC to LSF block that receives the LSF vector and determines an LSF index therefor;
a pitch detector connected to the LP analysis block of the content extraction module, the pitch detector classifying the band filtered speech signal as one of a voiced signal and an unvoiced signal; and
a naturalness enhancement module connected to the content extraction module and the pitch detector, the naturalness enhancement module including, means for extracting parameters from the LP residual signal, wherein for an unvoiced signal the extracted parameters include pitch and gain and for a voiced signal the extracted parameters include pitch, gain and excitation level; and
a quantizer for quantizing the extracted parameters and generating quantized parameters. - View Dependent Claims (2, 4, 5, 6, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21)
-
-
3. The speech encoder of claim 3, wherein the IIR filter includes a fourth order low-pass section and a fourth order high pass section.
-
7. The speech encoder of claim 7, wherein a bandwidth expansion block expands the plurality of LP coefficients to generate bandwidth expanded LP coefficients.
-
22. A content extraction module for a speech encoder, the content extraction module comprising:
-
a band pass filter that receives a speech input signal and generates a band limited speech signal, a first speech buffer connected to the band pass filter that stores the band limited speech signal, an LP analysis block connected to the first speech buffer that reads the stored speech signal and generates a plurality of LP coefficients therefrom, an LPC to LSF block connected to the LP analysis block for converting the LP coefficients to a line spectral frequency (LSF) vector, an LP analysis filter connected to the LPC to LSF block that extracts an LP residual signal from the LSF vector; and
an LSF quantizer connected to the LPC to LSF block that receives the LSF vector and determines an LSF index therefor. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29)
-
-
30. A naturalness enhancement module for a speech encoder, wherein the speech encoder includes a pitch detector for determining whether an input speech signal is a voiced signal or an unvoiced signal and a content extraction module for generating an LP residual signal from the input speech signal, the naturalness enhancement module comprising:
-
means for extracting parameters from the LP residual signal, wherein for an unvoiced signal the extracted parameters include pitch and gain and for a voiced signal the extracted parameters include pitch, gain and excitation level; and
a quantizer for quantizing the extracted parameters and generating quantized parameters. - View Dependent Claims (31, 32, 33, 34, 35, 36)
-
-
37. A pitch detector for a speech encoder, the pitch detector comprising:
-
a first operation level for analyzing a speech signal and, based on a first predetermined ambiguity value of the speech signal, generating a first estimated pitch period; and
a second operation level for analyzing the speech signal and, based on a second predetermined ambiguity value of the speech signal, generating a second estimated pitch period. - View Dependent Claims (38, 39, 40, 41, 42, 43)
-
-
44. A speech signal preprocessor for preprocessing an input speech signal prior to providing said speech signal to a speech encoder, the preprocessor comprising:
-
a band pass filter that receives said speech input signal and generates a band limited speech signal; and
a scale down unit connected to the band pass filter for limiting a dynamic range of the band limited speech signal. - View Dependent Claims (45, 46, 47)
-
-
48. A method of encoding a speech signal, comprising the steps of:
-
filtering the speech signal to limit a bandwidth thereof;
fragmenting the filtered speech signal into speech segments;
decomposing the speech segments into a spectral envelope and an LP residual signal, wherein the spectral envelope is represented by a plurality of LP filter coefficients (LPC);
converting the LPC into a plurality of line spectral frequencies (LSF);
classifying each speech segment as one of a voiced segment and an unvoiced segment based on a pitch of the segment;
extracting parameters from the LP residual signal, wherein for an unvoiced segment the extracted parameters include pitch and gain and for a voiced segment the extracted parameters include pitch, gain and excitation level; and
quantizing the extracted parameters and generating quantized parameters. - View Dependent Claims (49, 50, 51, 52, 53)
-
Specification