Speech synthesis system
First Claim
1. A speech synthesis system wherein synthesis parameters necessary for speech synthesis are input, and a speech segment combination matching said synthesis parameters is selected from a speech segment inventory and concatenated, thereby generating and outputting a speech waveform for said synthesis parameters, comprising:
- a speech segment storage unit that stores said speech segment;
a speech segment selection information storage unit that, with respect to a given speech unit sequence, correlates with the speech unit sequence information regarding appropriateness of a combination of speech segment data to be selected from among a plurality of speech segment data stored in said speech segment storage unit that synthesizes the speech unit sequence and that stores speech segment selection information;
a speech segment selection unit that selects a speech segment combination that is most appropriate for said synthesis parameters from said speech segment storage unit based on speech segment selection information stored in said speech segment selection information storage unit; and
a speech synthesis unit that generates and outputs speech waveform data based on a speech segment combination selected by said speech segment selection unit.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech synthesizing system producing a speech of an improved quality of voice by selecting a combination of speech segment most suitable for a synthesis speech unit sequence. The speech synthesizing system comprises a speech segment storage section where speech segment is stored, a speech segment selection information storage section where speech segment selection information including combinations of speech segment constituted of speech segment stored in the speech segment storage section for an arbitrary speech unit sequence and the appropriateness information representing the appropriatenesses of the combinations are stored, a speech segment selecting section for selecting a combination of speech segment most suitable for a synthesis parameter according to the speech segment selection information stored in the speech segment storage section, and a waveform generating section for generating speech waveform data from the combination of speech segment selected by the speech segment selecting section.
180 Citations
6 Claims
-
1. A speech synthesis system wherein synthesis parameters necessary for speech synthesis are input, and a speech segment combination matching said synthesis parameters is selected from a speech segment inventory and concatenated, thereby generating and outputting a speech waveform for said synthesis parameters, comprising:
-
a speech segment storage unit that stores said speech segment; a speech segment selection information storage unit that, with respect to a given speech unit sequence, correlates with the speech unit sequence information regarding appropriateness of a combination of speech segment data to be selected from among a plurality of speech segment data stored in said speech segment storage unit that synthesizes the speech unit sequence and that stores speech segment selection information; a speech segment selection unit that selects a speech segment combination that is most appropriate for said synthesis parameters from said speech segment storage unit based on speech segment selection information stored in said speech segment selection information storage unit; and a speech synthesis unit that generates and outputs speech waveform data based on a speech segment combination selected by said speech segment selection unit. - View Dependent Claims (2, 3)
-
-
4. A speech synthesis method wherein synthesis parameters necessary for speech synthesis are input, and a speech segment combination matching said synthesis parameters is selected from a speech segment inventory and concatenated, thereby generating and outputting a speech waveform for said synthesis parameters, the method comprising:
-
storing said speech segment; storing speech segment selection information with respect to a given speech unit sequence, wherein storing speech segment selection information includes correlating with the speech unit sequence information regarding appropriateness of a combination of speech segment data to be selected from among a plurality of speech segment data stored as speech segment selection information, synthesizing the speech unit sequence, and storing speech segment selection information;
selecting a speech segment combination that is most appropriate for said synthesis parameters based on stored speech segment selection information; andgenerating and outputting speech waveform data based on the selected speech segment combination. - View Dependent Claims (5)
-
-
6. A computer-readable storage medium encoded with processing instructions for causing a processor to execute a speech synthesis method, wherein synthesis parameters necessary for speech synthesis are input, and a speech segment combination matching said synthesis parameters is selected from a speech segment inventory and concatenated, thereby generating and outputting a speech waveform for said synthesis parameters, the method comprising:
-
storing said speech segment; storing speech segment selection information with respect to a given speech unit sequence, wherein storing speech segment selection information includes correlating with the speech unit sequence information regarding appropriateness of a combination of speech segment data to be selected from among a plurality of speech segment data stored as speech segment selection information, synthesizing the speech unit sequence, and storing speech segment selection information; selecting a speech segment combination that is most appropriate for said synthesis parameters based on stored speech segment selection information; and generating and outputting speech waveform data based on said speech segment combination.
-
Specification