Wideband speech parameterization for high quality synthesis, transformation and quantization
First Claim
1. A method for speech parameterization and coding of a continuous speech signal, comprising:
- receiving a continuous speech signal representing speech recorded by at least one microphone,dividing said continuous speech signal into a plurality of speech frames, and for each one of said plurality of speech frames;
modeling said speech frame by a first harmonic modeling to produce a plurality of harmonic model parameter values, wherein said first harmonic modeling is estimated by computing a cost function between a plurality of sine function signals and said speech frame, wherein each of said plurality of sine function signals comprises one of a plurality of harmonic frequencies, an amplitude value and a phrase value;
reconstructing an estimated frame signal from said plurality of harmonic model parameter values;
subtracting said estimated frame signal from said speech frame to produce a harmonic model residual signal;
performing at least one second harmonic modeling analysis on said first harmonic model residual to determine at least one set of second harmonic model component values;
removing said at least one set of second harmonic model component values from said first harmonic model residual signal to produce a harmonically-filtered residual signal; and
processing said harmonically-filtered residual signal with analysis by synthesis techniques to produce vectors of codebook indices and corresponding gains, andsending said plurality of harmonic model parameter values and said codebook vector indices and corresponding gains to a speech processor configured to compute at least one of a speech transformation, a signal compression and a conversion to an audible sound output.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for speech parameterization and coding of a continuous speech signal. The method comprises dividing said speech signal into a plurality of speech frames, and for each one of the plurality of speech frames, modeling said speech frame by a first harmonic modeling to produce a plurality of harmonic model parameters, reconstructing an estimated frame signal from the plurality of harmonic model parameters, subtracting the estimated frame signal from the speech frame to produce a harmonic model residual, performing at least one second harmonic modeling analysis on the first harmonic model residual to determine at least one set of second harmonic model components, removing the at least one set of second harmonic model components from the first harmonic model residual to produce a harmonically-filtered residual signal, and processing the harmonically-filtered residual signal with analysis by synthesis techniques to produce vectors of codebook indices and corresponding gains.
18 Citations
20 Claims
-
1. A method for speech parameterization and coding of a continuous speech signal, comprising:
-
receiving a continuous speech signal representing speech recorded by at least one microphone, dividing said continuous speech signal into a plurality of speech frames, and for each one of said plurality of speech frames; modeling said speech frame by a first harmonic modeling to produce a plurality of harmonic model parameter values, wherein said first harmonic modeling is estimated by computing a cost function between a plurality of sine function signals and said speech frame, wherein each of said plurality of sine function signals comprises one of a plurality of harmonic frequencies, an amplitude value and a phrase value; reconstructing an estimated frame signal from said plurality of harmonic model parameter values; subtracting said estimated frame signal from said speech frame to produce a harmonic model residual signal; performing at least one second harmonic modeling analysis on said first harmonic model residual to determine at least one set of second harmonic model component values; removing said at least one set of second harmonic model component values from said first harmonic model residual signal to produce a harmonically-filtered residual signal; and processing said harmonically-filtered residual signal with analysis by synthesis techniques to produce vectors of codebook indices and corresponding gains, and sending said plurality of harmonic model parameter values and said codebook vector indices and corresponding gains to a speech processor configured to compute at least one of a speech transformation, a signal compression and a conversion to an audible sound output. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method for speech parameterization and coding of a continuous speech signal, comprising:
-
receiving a continuous speech signal representing speech recorded by at least one microphone, dividing said speech signal into a plurality of speech frames; for each one of said plurality of speech frames; modeling said speech frame by a first harmonic modeling to produce a plurality of harmonic model parameter values, wherein said first harmonic modeling is estimated by computing a cost function between a plurality of sine function signals and said speech frame, wherein each of said plurality of sine function signals comprises one of a plurality of harmonic frequencies, an amplitude value and a phrase value; reconstructing an estimated frame signal from said plurality of harmonic model parameter values; subtracting said estimated frame signal from said speech frame to produce a harmonic model residual signal; removing at least one harmonic component value from said first harmonic model residual signal to produce a harmonically-filtered residual signal; removing periodic energy envelope modulation using a second modeling of said harmonically-filtered residual signal using a sum of multiple instances of a periodic function at arbitrary frequencies taking into account the time-domain energy envelope signal estimate with imposed periodicity; and processing said harmonically-filtered residual signal with analysis by synthesis techniques to produce vectors of codebook indices and corresponding gains, and sending said plurality of harmonic model parameter values and said codebook vector indices and corresponding gains to a speech processor configured to compute at least one of a speech transformation, a signal compression and a conversion to an audible sound output. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. An apparatus for speech parameterization and coding of a continuous speech signal, comprising:
-
at least one input interface for receiving and digitizing said continuous speech signal; at least one processing unit for performing the actions of; receiving a continuous speech signal representing speech recorded by at least one microphone, dividing said continuous speech signal into a plurality of speech frames, and for each one of said plurality of speech frames; modeling said speech frame by a first harmonic model to produce a plurality of frame model parameter values and harmonic model residual, wherein said first harmonic modeling is estimated by computing a cost function between a plurality of sine function signals and said speech frame, wherein each of said plurality of sine function signals comprises one of a plurality of harmonic frequencies, an amplitude value and a phrase value; performing at least one second harmonic modeling analysis on said first harmonic model residual to remove at least one set of second harmonic model component values from said first harmonic model residual signal to produce a harmonically-filtered residual signal; and processing said harmonically-filtered residual signal with analysis by synthesis techniques to produce vectors of codebook indices and corresponding gains, and sending said plurality of harmonic model parameter values and said codebook vector indices and corresponding gains to a speech processor configured to compute at least one of a speech transformation, a signal compression and a conversion to an audible sound output; at least one output interface to send said plurality of speech parameter values and codes; and a housing for containing said at least one input interface, said at least one processing unit, and said at least one output interface, said housing being configured and suitable for the apparatus environment. - View Dependent Claims (18, 19, 20)
-
Specification