Speech synthesis method
First Claim
Patent Images
1. A speech synthesis method comprising:
- generating a representative speech pitch wave from a reference speech signal by subjecting the reference speech signal to one of Fourier transform and Fourier series expansion to produce a discrete spectrum, interpolating the discrete spectrum to generate a consecutive spectrum, and subjecting the consecutive spectrum to inverse Fourier transform;
generating a linear prediction coefficient by subjecting the reference speech signal to a linear prediction analysis;
subjecting the representative speech pitch wave to inverse-filtering based on the linear prediction coefficient to produce a residual pitch wave;
storing information on the residual pitch wave and the linear prediction coefficient in a storage;
generating a voiced speech source signal based on the residual pitch wave from the storage;
generating an unvoiced speech source signal; and
driving a vocal tract filter having the linear prediction coefficient by the voiced speech source signal or the unvoiced speech source signal to generate a synthesis speech.
0 Assignments
0 Petitions
Accused Products
Abstract
A speech synthesis method subjects a reference speech signal to windowing to extract an aperiodic speech pitch wave from the reference speech signal. A linear prediction coefficient is generated by subjecting the reference speech signal to a linear prediction analysis. The aperiodic speech pitch wave is subjected to inverse-filtering based on the linear prediction coefficient to produce a residual pitch wave. Information regarding the residual pitch wave is stored as information of a speech synthesis unit and a voiced period in the storage. The speech is then synthesized using the information of the speech synthesis unit.
192 Citations
1 Claim
-
1. A speech synthesis method comprising:
-
generating a representative speech pitch wave from a reference speech signal by subjecting the reference speech signal to one of Fourier transform and Fourier series expansion to produce a discrete spectrum, interpolating the discrete spectrum to generate a consecutive spectrum, and subjecting the consecutive spectrum to inverse Fourier transform;
generating a linear prediction coefficient by subjecting the reference speech signal to a linear prediction analysis;
subjecting the representative speech pitch wave to inverse-filtering based on the linear prediction coefficient to produce a residual pitch wave;
storing information on the residual pitch wave and the linear prediction coefficient in a storage;
generating a voiced speech source signal based on the residual pitch wave from the storage;
generating an unvoiced speech source signal; and
driving a vocal tract filter having the linear prediction coefficient by the voiced speech source signal or the unvoiced speech source signal to generate a synthesis speech.
-
Specification