Speech-to-speech generation system and method
First Claim
1. A speech-to-speech generation system, comprising:
- speech recognition means, for recognizing the speech of language A and creating the corresponding text of language A;
machine translation means for translating the text from language A to language B;
text-to-speech generation means, for generating the speech of language B according to the text of language B, said speech-to-speech translation system is characterized by further comprising;
expressive parameter detection means, for extracting expressive parameters from the speech of language A; and
expressive parameter mapping means for mapping the expressive parameters extracted by the expressive parameter detection means from language A to language B, and driving the text-to-speech generation means by the mapping results to synthesize expressive speech.
1 Assignment
0 Petitions
Accused Products
Abstract
An expressive speech-to-speech generation system and method which can generate expressive speech output by using expressive parameters extracted from the original speech signal to drive the standard TTS system. The system comprises: speech recognition means, machine translation means, text-to-speech generation means, expressive parameter detection means for extracting expressive parameters from the speech of language A, and expressive parameter mapping means for mapping the expressive parameters extracted by the expressive parameter detection means from language A to language B, and driving the text-to-speech generation means by the mapping results to synthesize expressive speech. The system and method can improve the quality of the speech output of the translating system or TTS system.
51 Citations
20 Claims
-
1. A speech-to-speech generation system, comprising:
-
speech recognition means, for recognizing the speech of language A and creating the corresponding text of language A;
machine translation means for translating the text from language A to language B;
text-to-speech generation means, for generating the speech of language B according to the text of language B, said speech-to-speech translation system is characterized by further comprising;
expressive parameter detection means, for extracting expressive parameters from the speech of language A; and
expressive parameter mapping means for mapping the expressive parameters extracted by the expressive parameter detection means from language A to language B, and driving the text-to-speech generation means by the mapping results to synthesize expressive speech. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A speech-to-speech generation system, comprising:
-
speech recognition means for recognizing the speech of dialect A and creating the corresponding text;
text-to-speech generation means for generating the speech of another dialect B according to the text, said speech-to-speech generation system is characterized by further comprising;
expressive parameter detection means, for extracting expressive parameters from the speech of dialect A; and
expressive parameter mapping means, for mapping the expressive parameters extracted by the expressive parameter detection means from dialect A to dialect B, and driving the text-to-speech generation means by the mapping results to synthesize expressive speech. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A speech-to-speech generation method, comprising the steps of:
-
recognizing the speech of language A and creating the corresponding text of language A;
translating the text from language A to language B;
generating the speech of language B according to the text of language B, said expressive speech-to-speech method is characterized by further comprising the steps of;
extracting expressive parameters from the speech of language A; and
mapping the expressive parameters extracted by the detecting steps from language A to language B, and driving the text-to-speech generation process by the mapping results to synthesize expressive speech. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A speech-to-speech generation method, comprising the steps of:
-
recognizing the speech of dialect A and creating the corresponding text;
generating the speech of another dialect B according to the text, said speech-to-speech generation method is characterized by further comprising steps;
extracting expressive parameters from the speech of dialect A; and
mapping the expressive parameters extracted by the detecting steps from dialect A to dialect B, and driving the text-to-speech generating process by the mapping results to synthesis expressive speech. - View Dependent Claims (17, 18, 19, 20)
-
Specification