Speech synthesis method and speech synthesizer
First Claim
Patent Images
1. A speech synthesis method comprising:
- storing a plurality of formant parameter groups each including a number of formant parameters in a storage in units of a synthesis unit, the formant parameters representing a formant frequency, a formant phase and a windowing function;
selecting predetermined formant parameters from the formant parameters stored in the storage according to a phoneme symbol string;
generating a plurality of sine waves based on formant frequencies and formant phases corresponding to the formant parameters selected;
multiplying the sine waves by the windowing functions corresponding to the selected formant parameters, respectively, to generate a plurality of formant waveforms each having a characteristic of one formant;
adding the formant waveforms to generate a pitch waveform having characteristics of a plurality of formants; and
superposing pitch waveforms each corresponding to the pitch waveform according to a pitch period to generate a speech signal.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech synthesis method comprises selecting a predetermined formant parameters from formant parameters according to a pitch pattern, phoneme duration, and phoneme symbol string, generating a plurality of sine waves based on formant frequency and formant phase of the formant parameters selected, multiplying the sine waves by windowing functions of the selected formant parameters, respectively, to generate a plurality of formant waveforms, adding the formant waveforms to generate a plurality of pitch waveforms, and superposing the pitch waveforms according to a pitch period to generate a speech signal.
17 Citations
20 Claims
-
1. A speech synthesis method comprising:
-
storing a plurality of formant parameter groups each including a number of formant parameters in a storage in units of a synthesis unit, the formant parameters representing a formant frequency, a formant phase and a windowing function; selecting predetermined formant parameters from the formant parameters stored in the storage according to a phoneme symbol string; generating a plurality of sine waves based on formant frequencies and formant phases corresponding to the formant parameters selected; multiplying the sine waves by the windowing functions corresponding to the selected formant parameters, respectively, to generate a plurality of formant waveforms each having a characteristic of one formant; adding the formant waveforms to generate a pitch waveform having characteristics of a plurality of formants; and superposing pitch waveforms each corresponding to the pitch waveform according to a pitch period to generate a speech signal. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A speech synthesizer supplied with a pitch pattern, phoneme duration and phoneme symbol string, comprising:
-
a pitch mark generator configured to generate pitch marks referring to the pitch pattern and phoneme duration; a pitch waveform generator configured to generate pitch waveforms corresponding to the pitch marks, referring to the phoneme symbol string; a waveform superposition device configured to superpose the pitch waveforms on the pitch marks according to a pitch period to generate a voiced speech signal; a unvoiced speech generator configured to generate an unvoiced speech; an adder configured to add the voiced speech and the unvoiced speech to generate a synthesized speech, the pitch waveform generator including; a storage configured to store a plurality of formant parameter groups each including a plurality of formant parameters in units of a synthesis unit, the formant parameters representing a formant frequency, a formant phase and a windowing function, a parameter selector configured to select the formant parameters for one frame corresponding to the pitch marks from the storage referring to the phoneme symbol string, a plurality of sine wave generators configured to generate a plurality of sine waves according to formant frequencies and formant phases corresponding to the selected formant parameters, a multiplier configured to multiply the sine waves by the windowing functions of the selected formant parameters to generate a plurality of formant waveforms each having a characteristic of one formant, an adder configured to add the formant waveforms to generate a pitch waveform having characteristics of a plurality of formants. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
-
-
19. A speech synthesis program recorded on a computer readable medium, the program comprising:
-
means for instructing a computer to store a number of formant parameters in a storage, the formant parameters representing a formant frequency, a formant phase and a windowing function; means for instructing the computer to select predetermined formant parameters from the formant parameters stored in the storage according to a phoneme symbol string; means for instructing the computer to generate a plurality of sine waves based on formant frequencies and formant phases corresponding to the formant parameters selected; means for instructing the computer to multiply the sine waves by the windowing functions corresponding to the selected formant parameters, respectively, to generate a plurality of formant waveforms each having a characteristic of one formant; means for instructing the computer to add the formant waveforms to generate a pitch waveform having characteristics of a plurality of formants; and means for instructing the computer to superpose pitch waveforms each corresponding to the pitch waveform according to a pitch period to generate a speech signal. - View Dependent Claims (20)
-
Specification