Front-end architecture for a multi-lingual text-to-speech system
First Claim
Patent Images
1. A text processing system for processing a sentence of multi-lingual text for a speech synthesizer, the text processing system comprising:
- a database having sampled speech units of a first language and of a second language;
a first language dependent module for performing at least one of text and prosody analysis on a first portion of the sentence comprising the first language;
a second language dependent module for performing at least one of text and prosody analysis on a second portion of the sentence comprising the second language;
a third module adapted to receive outputs from the first and second language dependent modules and perform prosodic and phonetic context modification over the outputs based on an intonation for the entire sentence, the third module generating an output sentence; and
a speech unit concatenation module for receiving the output sentence, selecting speech units from the database corresponding to the output sentence, and concatenating the speech units to form an utterance of the output sentence.
2 Assignments
0 Petitions
Accused Products
Abstract
A text processing system for processing multi-lingual text for a speech synthesizer includes a first language dependent module for performing at least one of text and prosody analysis on a portion of input text comprising a first language. A second language dependent module performs at least one of text and prosody analysis on a second portion of input text comprising a second language. A third module is adapted to receive outputs from the first and second dependent module and performs prosodic and phonetic context abstraction over the outputs based on multi-lingual text.
415 Citations
23 Claims
-
1. A text processing system for processing a sentence of multi-lingual text for a speech synthesizer, the text processing system comprising:
-
a database having sampled speech units of a first language and of a second language; a first language dependent module for performing at least one of text and prosody analysis on a first portion of the sentence comprising the first language; a second language dependent module for performing at least one of text and prosody analysis on a second portion of the sentence comprising the second language; a third module adapted to receive outputs from the first and second language dependent modules and perform prosodic and phonetic context modification over the outputs based on an intonation for the entire sentence, the third module generating an output sentence; and a speech unit concatenation module for receiving the output sentence, selecting speech units from the database corresponding to the output sentence, and concatenating the speech units to form an utterance of the output sentence. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method for text processing of multi-lingual text for a speech synthesizer, the method comprising:
-
storing in a database sampled speech units of a first language and of a second language; receiving input text forming a sentence and identifying portions comprising the first language and portions comprising the second language; performing at least one of text and prosody analysis on the portions comprising the first language with a first language dependent module and performing at least one of text and prosody analysis on the portions comprising the second language with a second language dependent module; receiving outputs from the first and second language dependent modules; performing prosodic and phonetic context analysis over the outputs together based on a position in the sentence of each portion relative to the other portions and generating an output sentence; selecting speech units from the database corresponding to the output sentence; and concatenating the selected speech units to form an utterance of the output sentence. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
-
17. A computer readable storage media having instructions stored thereon, that when executed by a processor, perform speech synthesis, the instructions comprising:
-
a database having sampled speech units of a first language and of a second language; a text processing module including; a first language dependent module for performing at least one of text and prosody analysis on a first portion of input text from a sentence comprising the first language; a second language dependent module for performing at least one of text and prosody analysis on a second portion of input text from the sentence comprising a second language; a third module adapted to receive outputs from the first and second language dependent modules and perform prosodic and phonetic context modification over the outputs based on an intonation for the sentence using a combination of the first portion and the second portion of input text; and a speech unit concatenation and synthesis module adapted to receive an output from the third module, select speech units from the database corresponding to the output from the third module, concatenate the selected speech units to form an utterance of the output from the third module, and generate synthesized speech waveforms of the utterance. - View Dependent Claims (18, 19, 20, 21, 22, 23)
-
Specification