Speech synthesis apparatus
First Claim
1. A speech synthesis apparatus comprising:
- a text analyzer operable to generate a phonetic and prosodic symbol string from character information of an input text;
a word dictionary storing a reading and an accent of a word;
a voice segment dictionary storing a phoneme that is a basic unit of speech;
a parameter generator operable to generate synthesizing parameters including at least a phoneme, a duration of the phoneme and a fundamental frequency for the phonetic and prosodic symbol string, the parameter generator including a calculating means operable to obtain a sum of phrase components and a sum of accent components and to calculate a mora average from the sum of the phrase components and the sum of the accent components, and a determining means operable to determine a base pitch from the mora average; and
a waveform generator operable to generate a synthesized waveform by making waveform-overlapping referring to the synthesizing parameters generated by the parameter generator and the voice segment dictionary.
5 Assignments
0 Petitions
Accused Products
Abstract
The speech synthesis apparatus of the present invention includes: a text analyzer operable to generate a phonetic and prosodic symbol string from character information of an input text; a word dictionary storing a reading and an accent of a word; an voice segment dictionary storing a phoneme that is a basic unit of speech; a parameter generator operable to generate synthesizing parameters including at least a phoneme, a duration of the phoneme and a fundamental frequency for the phonetic and prosodic symbol string, the parameter generator including a calculating means operable to obtain a sum of phrase components and a sum of accent components and to calculate an average pitch from the sum of the phrase components and the sum of the accent components, and a determining means operable to determine a base pitch from the average pitch; and a waveform generator operable to generate a synthesized waveform by making waveform-overlapping referring to the synthesizing parameters generated by the parameter generator and the voice segment dictionary.
211 Citations
4 Claims
-
1. A speech synthesis apparatus comprising:
-
a text analyzer operable to generate a phonetic and prosodic symbol string from character information of an input text;
a word dictionary storing a reading and an accent of a word;
a voice segment dictionary storing a phoneme that is a basic unit of speech;
a parameter generator operable to generate synthesizing parameters including at least a phoneme, a duration of the phoneme and a fundamental frequency for the phonetic and prosodic symbol string, the parameter generator including a calculating means operable to obtain a sum of phrase components and a sum of accent components and to calculate a mora average from the sum of the phrase components and the sum of the accent components, and a determining means operable to determine a base pitch from the mora average; and
a waveform generator operable to generate a synthesized waveform by making waveform-overlapping referring to the synthesizing parameters generated by the parameter generator and the voice segment dictionary. - View Dependent Claims (2)
the determining means determines the base pitch in such a manner that a value obtained by adding the mora average and the base pitch becomes constant.
-
-
3. A speech synthesis apparatus comprising:
-
a text analyzer operable to generate a phonetic and prosodic symbol string from character information of an input text;
a word dictionary storing a reading and an accent of a word;
a voice segment dictionary storing a phoneme that is a basic unit of speech;
a parameter generator operable to generate synthesizing parameters including at least a phoneme, a duration of the phoneme and a fundamental frequency for the phonetic and prosodic symbol string, the parameter generator including a calculating means operable to overlap a phrase component and an accent component, obtain an approximation of a pitch contour from the overlapped phrase and accent components and calculate at least a maximum value of the approximation of the pitch contour, and a modifying means operable to modify a value of the phrase component and a value of the accent component by using at least the maximum value; and
a waveform generator operable to generate a synthesized waveform by making waveform-overlapping referring to the synthesizing parameters generated by the parameter generator and the voice segment dictionary. - View Dependent Claims (4)
the modifying means modifies the magnitude of the phrase component and the magnitude of the accent component in such a manner that a difference between the maximum value and the minimum value is made substantially the same as an intonation value set by a user.
-
Specification