Text voice synthesis device and program recording medium
First Claim
1. A text-to-speech synthesizer for selecting necessary speech segment information from speech segment database based on reading and word class information on input text information and generating a speech signal based on the selected speech segment information, comprising:
- text analyzing means (12) for analyzing the input text information and obtaining reading and word class information;
prosody generating means (13) for generating prosody information based on the reading and the word class information;
plural speech instructing means (17) for instructing simultaneous speaking of an identical input text by a plurality of voices; and
plural speech synthesizing means (16) for generating a plurality of synthesized speech signals based on prosody information from the prosody generating means (13) and speech segment information selected from the speech segment database (15) upon reception of an instruction from the plural speech instructing means (17).
1 Assignment
0 Petitions
Accused Products
Abstract
A multiple-voice instructing unit (17) instructs pitch deforming ratio and mixing ratio to a multiple-voice synthesis unit (16). The multiple voice synthesis unit (16) generates a standard voice signal by means of waveform superimposition based on voice element data read from a voice element database (15) and prosodic information from a voice element selecting unit (14), expands/contracts the time base of the above standard voice signal based on the prosodic information and instruction information from the multiple-voice instructing unit (17) to change a voice pitch, and mixes the standard voice signal with an expansion/contraction voice signal for outputting via an output terminal (18). Accordingly, a concurrent vocalization by multiple speakers based on the same text can be implemented without the need of time-division, parallel text analyzing and prosody generating and of adding pitch converting as post-processing.
29 Citations
17 Claims
-
1. A text-to-speech synthesizer for selecting necessary speech segment information from speech segment database based on reading and word class information on input text information and generating a speech signal based on the selected speech segment information, comprising:
-
text analyzing means (12) for analyzing the input text information and obtaining reading and word class information;
prosody generating means (13) for generating prosody information based on the reading and the word class information;
plural speech instructing means (17) for instructing simultaneous speaking of an identical input text by a plurality of voices; and
plural speech synthesizing means (16) for generating a plurality of synthesized speech signals based on prosody information from the prosody generating means (13) and speech segment information selected from the speech segment database (15) upon reception of an instruction from the plural speech instructing means (17). - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
Specification