Text-to-speech conversion system
First Claim
1. A text-to-speech conversion system for converting a text into a speech waveform, and outputting the speech waveform, said system comprising;
- a conversion processing unit for converting a text inputted from outside into a speech waveform;
a phrase dictionary for previously registering sound-related terms to be expressed as natural sound data of actually recorded sounds; and
a waveform dictionary for previously registering waveform data corresponding to the sound-related terms, obtained from the actually recorded sounds, wherein said conversion processing unit has a function such that as for a term in the text matching a sound-related term registered in said phrase dictionary upon collation of the former with the latter, waveform data corresponding to the relevant sound-related term matching the term in the text, registered in said waveform dictionary, is outputted as a speech waveform of the term.
3 Assignments
0 Petitions
Accused Products
Abstract
The system according to the invention comprises a text-to-speech conversion processing unit, and a phrase dictionary as well as a waveform dictionary, connected independently from each other to the conversion processing unit. The conversion processing unit is for converting any Japanese text inputted from outside into speech. In the phrase dictionary, sound-related terms representing the actually recorded sounds, for example, notations of terms such as onomatopoeic words, background sounds, lyrics, music titles, and so forth, are previously registered. Further, in the waveform dictionary, waveform data obtained from the actually recorded sounds, corresponding to the sound-related terms, are previously registered. Furthermore, the conversion processing unit is constituted such that as for a term in the text matching the sound-related term registered in the phrase dictionary upon collation of the former with the latter, actually recorded speech waveform data corresponding to the relevant sound-related term matching the term in the text, registered in the waveform dictionary, is outputted as a speech waveform of the term.
55 Citations
49 Claims
-
1. A text-to-speech conversion system for converting a text into a speech waveform, and outputting the speech waveform, said system comprising;
-
a conversion processing unit for converting a text inputted from outside into a speech waveform;
a phrase dictionary for previously registering sound-related terms to be expressed as natural sound data of actually recorded sounds; and
a waveform dictionary for previously registering waveform data corresponding to the sound-related terms, obtained from the actually recorded sounds, wherein said conversion processing unit has a function such that as for a term in the text matching a sound-related term registered in said phrase dictionary upon collation of the former with the latter, waveform data corresponding to the relevant sound-related term matching the term in the text, registered in said waveform dictionary, is outputted as a speech waveform of the term. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 36, 37, 46)
-
-
9. A text-to-speech conversion system for converting a text into a speech waveform, and outputting the speech waveform, said system comprising;
-
a conversion processing unit for converting a text inputted from outside into a speech waveform;
a phrase dictionary for previously registering sound-related terms to be expressed as natural sound data of actually recorded sounds; and
a waveform dictionary for previously registering waveform data corresponding to the sound-related terms, obtained from the actually recorded sounds, wherein said conversion processing unit has a function such that in the case where there is a match between a term in the text and a sound-related terms registered in said phrase dictionary upon collation of the former with the latter, waveform data corresponding to the relevant sound-related term matching the term in the text, registered in said waveform dictionary, is superimposed on a speech waveform of the text before outputted. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 38, 39, 40, 41, 42, 43, 47)
-
-
21. A text-to-speech conversion system for converting a text into a speech waveform, and outputting the speech waveform, said system comprising;
-
a conversion processing unit for converting a text containing lyrics, inputted from outside, into a speech waveform;
a song phrase dictionary for previously registering pairs of lyrics and song phonetic/prosodic symbol strings corresponding thereto; and
a song phonetic/prosodic symbol string processing unit for analyzing a song phonetic/prosodic symbol string in order to convert said song phonetic/prosodic symbol string into a synthesized speech waveform of a singing voice, wherein said conversion processing unit has a function such that as for lyrics in the text, matching lyrics registered in said song phrase dictionary upon collation of the former with the latter, a speech waveform of a singing voice, converted on the basis of the song phonetic/prosodic symbol string paired off with registered lyrics that have matched, registered in said song phrase dictionary, is outputted as a speech waveform of the relevant lyrics. - View Dependent Claims (22, 23, 24, 25, 26, 44, 48)
-
-
27. A text-to-speech conversion system for converting a text into a speech waveform, and outputting the speech waveform, said system comprising;
-
a conversion processing unit for converting a text containing a music title, inputted from outside, into a speech waveform;
a music title dictionary for previously registering music titles; and
a musical sound waveform generator for generating a musical sound waveform corresponding to the relevant music title, wherein said musical sound waveform generator comprises a music dictionary for previously registering music data for use in performance, corresponding to the music titles registered in said music title dictionary, and a musical sound synthesizer for converting the relevant music data for use in performance into a musical sound waveform of music, and said conversion processing unit has a function such that as for a music title in the text, matching a music title registered in said music title dictionary upon collation of the former with the latter, the musical sound waveform of music corresponding to the registered music title is superimposed on a speech waveform of the text before outputted. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35, 45, 49)
-
Specification