Method and system for preselection of suitable units for concatenative speech
First Claim
1. A triphone preselection cost database for use in speech synthesis, the database generated according to a method comprising:
- 1) selecting a triphone sequence u1-u2-u3;
2) calculating a preselection cost for each 5-phoneme sequence ua-u1-u2-u3-ub, where u2 is allowed to match any identically labeled phoneme in a database and the units ua and ub vary over the entire phoneme universe; and
3) storing a group of the selected triphone sequences exhibiting the lowest costs in a triphone preselection cost database.
10 Assignments
0 Petitions
Accused Products
Abstract
A system and method for improving the response time of text-to-speech synthesis utilizes “triphone contexts” (i.e., triplets comprising a central phoneme and its immediate context) as the basic unit, instead of performing phoneme-by-phoneme synthesis. The method comprises a method of generating a triphone preselection cost database for use in speech synthesis, the method comprising 1) selecting a triphone sequence u1-u2-u3, 2) calculating a preselection cost for each 5-phoneme sequence ua-u1-u2-u3-ub, where u2 is allowed to match any identically labeled phoneme in a database and the units ua and ub vary over the entire phoneme universe and 3) storing a group of the selected triphone sequences exhibiting the lowest costs in a triphone preselection cost database.
186 Citations
15 Claims
-
1. A triphone preselection cost database for use in speech synthesis, the database generated according to a method comprising:
-
1) selecting a triphone sequence u1-u2-u3;
2) calculating a preselection cost for each 5-phoneme sequence ua-u1-u2-u3-ub, where u2 is allowed to match any identically labeled phoneme in a database and the units ua and ub vary over the entire phoneme universe; and
3) storing a group of the selected triphone sequences exhibiting the lowest costs in a triphone preselection cost database. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer-readable medium storing a triphone preselection cost database for use in speech synthesis, the database generated according to a method comprising:
-
1) selecting a triphone sequence u1-u2-u3;
2) calculating a preselection cost for each 5-phoneme sequence ua-u1-u2-u3-ub, where u2 is allowed to match any identically labeled phoneme in a database and the units ua and ub vary over the entire phoneme universe; and
3) storing a group of the selected triphone sequences exhibiting the lowest costs in a triphone preselection cost database. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A method of generating a triphone preselection cost database for use in speech synthesis, the method comprising:
-
1) selecting a triphone sequence u1-u2-u3;
2) calculating a preselection cost for each 5-phoneme sequence ua-u1-u2-u3-ub, where u2 is allowed to match any identically labeled phoneme in a database and the units ua and ub vary over the entire phoneme universe; and
3) storing a group of the selected triphone sequences exhibiting the lowest costs in a triphone preselection cost database. - View Dependent Claims (12, 13, 14, 15)
-
Specification