Speech synthesis apparatus and method, and storage medium
First Claim
1. A synthesis unit selection apparatus comprising:
- n-best obtaining means for obtaining one or more best sequences of synthesis units corresponding to a phonetic string on the basis of a distortion obtained by concatenating synthesis units;
obtaining means for obtaining a plurality of sequences by applying said n-best obtaining means to a corpus including a plurality of phonetic strings; and
selection means for selecting synthesis units on the basis of the plurality of sequences obtained by said obtaining means.
0 Assignments
0 Petitions
Accused Products
Abstract
Input text data undergoes language analysis to generate prosody, and a speech database is searched for a synthesis unit on the basis of the prosody. A modification distortion of the found synthesis unit, and concatenation distortions upon connecting that synthesis unit to those in the preceding phoneme are computed, and a distortion determination unit weights the modification and concatenation distortions to determine the total distortion. An Nbest determination unit obtains N best paths that can minimize the distortion using the A* search algorithm, and a registration unit determination unit selects a synthesis unit to be registered in a synthesis unit inventory on the basis of the N best paths in the order of frequencies of occurrence, and registers it in the synthesis unit inventory.
21 Citations
23 Claims
-
1. A synthesis unit selection apparatus comprising:
-
n-best obtaining means for obtaining one or more best sequences of synthesis units corresponding to a phonetic string on the basis of a distortion obtained by concatenating synthesis units;
obtaining means for obtaining a plurality of sequences by applying said n-best obtaining means to a corpus including a plurality of phonetic strings; and
selection means for selecting synthesis units on the basis of the plurality of sequences obtained by said obtaining means. - View Dependent Claims (2, 3, 4, 5, 8, 9)
-
-
6. (canceled)
-
7. (canceled)
-
10. (canceled)
-
11. A synthesis unit selection method comprising:
-
an n-best obtaining step of obtaining one or more best sequences of synthesis units corresponding to a phonetic string on the basis of a distortion obtained by concatenating synthesis units;
an obtaining step of obtaining a plurality of sequences by applying said n-best obtaining step to a corpus including a plurality of phonetic strings; and
a selection step of selecting synthesis units on the basis of the plurality of sequences obtained in said obtaining step. - View Dependent Claims (12, 13, 14, 15, 18, 19, 21)
-
-
16. (canceled)
-
17. (canceled)
-
20. (canceled)
-
22. A synthesis unit selection apparatus comprising:
-
an n-best obtaining unit configured to obtain one or more best sequences of synthesis units corresponding to a phonetic string on the basis of a distortion obtained by concatenating synthesis units;
an obtaining unit configured to obtain a plurality of sequences by applying said n-best obtaining unit to a corpus including a plurality of phonetic strings; and
a selection unit configured to select synthesis units on the basis of the plurality of sequences obtained by said obtaining unit.
-
-
23. A program for implementing a synthesis unit selection method comprising:
-
an n-best obtaining step module for obtaining one or more best sequences of synthesis units corresponding to a phonetic string on the basis of a distortion obtained by concatenating synthesis units;
an obtaining step module for obtaining a plurality of sequences by applying said n-best obtaining step module to a corpus including a plurality of phonetic strings; and
a selection step module for selecting synthesis units on the basis of the plurality of sequences obtained by said obtaining step module.
-
Specification