Speech synthesis method and speech synthesizer
First Claim
Patent Images
1. A speech synthesis method comprising:
- storing a number of formant parameters in a storage, the formant parameters representing formant frequencies and windowing functions;
selecting predetermined formant parameters from the formant parameters according to a phoneme symbol string;
generating a plurality of sine waves based on the formant frequency corresponding to the formant parameters selected;
multiplying the sine waves by the windowing functions corresponding to the selected formant parameters, respectively, to generate a plurality of formant waveforms;
adding the formant waveforms to generate a plurality of pitch waveforms; and
superposing the pitch waveforms according to a pitch period to generate a speech signal.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech synthesis method comprises selecting a predetermined formant parameters from formant parameters according to a pitch pattern, phoneme duration, and phoneme symbol string, generating a plurality of sine waves based on formant frequency and formant phase of the formant parameters selected, multiplying the sine waves by windowing functions of the selected formant parameters, respectively, to generate a plurality of formant waveforms, adding the formant waveforms to generate a plurality of pitch waveforms, and superposing the pitch waveforms according to a pitch period to generate a speech signal.
-
Citations
20 Claims
-
1. A speech synthesis method comprising:
-
storing a number of formant parameters in a storage, the formant parameters representing formant frequencies and windowing functions;
selecting predetermined formant parameters from the formant parameters according to a phoneme symbol string;
generating a plurality of sine waves based on the formant frequency corresponding to the formant parameters selected;
multiplying the sine waves by the windowing functions corresponding to the selected formant parameters, respectively, to generate a plurality of formant waveforms;
adding the formant waveforms to generate a plurality of pitch waveforms; and
superposing the pitch waveforms according to a pitch period to generate a speech signal. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A speech synthesizer supplied with a pitch pattern, phoneme duration and phoneme symbol string, comprising:
-
a pitch mark generator configured to generate pitch marks referring to the pitch pattern and phoneme duration;
a pitch waveform generator configured to generate pitch waveforms corresponding to the pitch marks, referring to the phoneme symbol string;
a waveform superposition device configured to superpose the pitch waveforms on the pitch marks to generate a voiced speech signal;
a unvoiced speech generator configured to generate an unvoiced speech;
an adder configured to add the voiced speech and the unvoiced speech to generate synthesized speech, the pitch waveform generator including;
a storage configured to store a plurality of formant parameters in units of a synthesis unit, a parameter selector configured to select the formant parameters for one frame corresponding to the pitch marks from the storage referring to the phoneme symbol string, a sine wave generator configured to generate sine waves according to formant frequencies of the read formant parameters, a multiplier configured to multiply the sine waves by the windowing functions of the selected formant parameters to generate formant waveforms, an adder configured to add the formant waveforms to generate the pitch waveforms. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 20)
-
-
19. A speech synthesis program recorded on a computer readable medium, the program comprising:
-
means for instructing a computer to store a number of formant parameters in a storage, the formant parameters representing formant frequencies and windowing functions;
means for instructing the computer to select predetermined formant parameters from the formant parameters according to a phoneme symbol string;
means for instructing the computer to generate a plurality of sine waves based on the formant frequency corresponding to the formant parameters selected;
means for instructing the computer to multiply the sine waves by the windowing functions corresponding to the selected formant parameters, respectively, to generate a plurality of formant waveforms;
means for instructing the computer to add the formant waveforms to generate a plurality of pitch waveforms; and
means for instructing the computer to superpose the pitch waveforms according to a pitch period to generate a speech signal.
-
Specification