Simultaneous plural-voice text-to-speech synthesizer
First Claim
1. A text-to-speech synthesizer for selecting necessary speech segment information from speech segment database based on reading and word class information on input text information and generating a speech signal based on the selected speech segment information, comprising:
- text analyzing means for analyzing the input text information and obtaining reading and word class information;
prosody generating means for generating prosody information based on the reading and the word class information;
plural speech instructing means for instructing simultaneous speaking of an identical input text by a plurality of voices; and
plural speech synthesizing means for generating a plurality of synthesized speech signals based on prosody information from the prosody generating means and speech segment information selected from the speech segment database upon reception of an instruction from the plural speech instructing means.
1 Assignment
0 Petitions
Accused Products
Abstract
A multiple-voice instructing unit (17) instructs pitch deforming ratio and mixing ratio to a multiple-voice synthesis unit (16). The multiple voice synthesis unit (16) generates a standard voice signal by means of waveform superimposition based on voice element data read from a voice element database (15) and prosodic information from a voice element selecting unit (14), expands/contracts the time base of the above standard voice signal based on the prosodic information and instruction information from the multiple-voice instructing unit (17) to change a voice pitch, and mixes the standard voice signal with an expansion/contraction voice signal for outputting via an output terminal (18). Accordingly, a concurrent vocalization by multiple speakers based on the same text can be implemented without the need of time-division, parallel text analyzing and prosody generating and of adding pitch converting as post-processing.
-
Citations
18 Claims
-
1. A text-to-speech synthesizer for selecting necessary speech segment information from speech segment database based on reading and word class information on input text information and generating a speech signal based on the selected speech segment information, comprising:
-
text analyzing means for analyzing the input text information and obtaining reading and word class information; prosody generating means for generating prosody information based on the reading and the word class information; plural speech instructing means for instructing simultaneous speaking of an identical input text by a plurality of voices; and plural speech synthesizing means for generating a plurality of synthesized speech signals based on prosody information from the prosody generating means and speech segment information selected from the speech segment database upon reception of an instruction from the plural speech instructing means. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A computer readable program storage medium. storing a text-to-speech synthesis processing program for causing a computer to perform the steps of:
-
analyzing input text information and obtaining reading and word class information; generating prosody information based on the reading and the word class information; instructing simultaneous speaking of an identical input text by a plurality of voices; generating a plurality of synthesized speech signals based on prosody information and speech segment information selected from a speech segment database upon reception of an instruction.
-
Specification