Multilingual text-to-speech system with limited resources
First Claim
Patent Images
1. A multilingual text-to-speech system, comprising:
- a source datastore of primary source parameters providing information mainly about a speaker of a primary language;
a plurality of primary filter parameters providing information mainly about sounds in the primary language; and
a plurality of secondary filter parameters providing information mainly about sounds in a secondary language, wherein at least one secondary filter parameter of the plurality of secondary filter parameters is normalized to the plurality of primary filter parameters based on similarities between a) voice characteristics of the sounds whose information is provided by the plurality of primary filter parameters and b) voice characteristics of the sounds whose information is provided by the at least one secondary filter parameter, wherein the at least one secondary filter parameter is mapped to a primary source parameter.
6 Assignments
0 Petitions
Accused Products
Abstract
A multilingual text-to-speech system includes a source datastore of primary source parameters providing information about a speaker of a primary language. A plurality of primary filter parameters provides information about sounds in the primary language. A plurality of secondary filter parameters provides information about sounds in a secondary language. One or more secondary filter parameters is normalized to the primary filter parameters and mapped to a primary source parameter.
275 Citations
36 Claims
-
1. A multilingual text-to-speech system, comprising:
-
a source datastore of primary source parameters providing information mainly about a speaker of a primary language; a plurality of primary filter parameters providing information mainly about sounds in the primary language; and a plurality of secondary filter parameters providing information mainly about sounds in a secondary language, wherein at least one secondary filter parameter of the plurality of secondary filter parameters is normalized to the plurality of primary filter parameters based on similarities between a) voice characteristics of the sounds whose information is provided by the plurality of primary filter parameters and b) voice characteristics of the sounds whose information is provided by the at least one secondary filter parameter, wherein the at least one secondary filter parameter is mapped to a primary source parameter. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A method of operation for use with a multilingual text-to-speech system, comprising:
-
accessing primary source parameters providing information mainly about a speaker of a primary language; accessing primary filter parameters providing information mainly about sounds in the primary language; accessing secondary filter parameters providing information mainly about sounds in a secondary language, wherein at least one secondary filter parameter of the secondary filter parameters is normalized to the primary filter parameters based on similarities between a) voice characteristics of the sounds whose information is provided by the primary filter parameters and b) voice characteristics of the sounds whose information is provided by the at least one secondary filter parameter, wherein the at least one secondary filter parameter is mapped to a primary source parameter receiving text; and converting the text to speech based on the primary filter parameters and the secondary filter parameters. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35)
-
-
36. A multilingual text-to-speech system, comprising:
-
a primary source module having a plurality of primary source parameters providing information mainly about a speaker of a primary language, wherein the plurality of source parameters defines a first sound source, of human speech, that generates a first excitation signal in the primary language; a primary filter module having a plurality of primary filter parameters providing information mainly about sounds in the primary language, wherein the plurality of primary filter parameters define shaping applied to the first excitation signal to produce signal waveform of the sounds in the primary language; and a secondary filter module having a plurality of secondary filter parameters providing information mainly about sounds in a secondary language, wherein the plurality of secondary filter parameters define shaping applied to a second excitation signal, generated by a second sound source of human speech, to produce signal waveform of the sounds in the secondary language, wherein at least one of the plurality of secondary filter parameters is normalized to the primary filter parameters to imitate voice characteristics of the first sound source; and a mapping module that selects at least one from the plurality of primary source parameters to substitute at least one of a plurality of secondary source parameters based on linguistic similarities between a target sound defined by the substituted at least one secondary source parameter and a target sound defined by the selected at least one primary source parameter, wherein the plurality of secondary source parameters define the second sound source, wherein the system selectively applies at least one of the plurality of secondary filter parameters to the selected at least one primary source parameter.
-
Specification