Speech synthesis apparatus and method, and storage medium
First Claim
1. A speech synthesis apparatus comprising:
- distortion output means for obtaining a distortion produced upon modifying a synthesis unit on the basis of predetermined prosody information; and
unit registration means for selecting a synthesis unit to be registered in a synthesis unit inventory used in speech synthesis on the basis of the distortion output from said distortion output means.
1 Assignment
0 Petitions
Accused Products
Abstract
Input text data undergoes language analysis to generate prosody, and a speech database is searched for a synthesis unit on the basis of the prosody. A modification distortion of the found synthesis unit, and concatenation distortions upon connecting that synthesis unit to those in the preceding phoneme are computed, and a distortion determination unit weights the modification and concatenation distortions to determine the total distortion. An Nbest determination unit obtains N best paths that can minimize the distortion using the A* search algorithm, and a registration unit determination unit selects a synthesis unit to be registered in a synthesis unit inventory on the basis of the N best paths in the order of frequencies of occurrence, and registers it in the synthesis unit inventory.
20 Citations
21 Claims
-
1. A speech synthesis apparatus comprising:
-
distortion output means for obtaining a distortion produced upon modifying a synthesis unit on the basis of predetermined prosody information; and
unit registration means for selecting a synthesis unit to be registered in a synthesis unit inventory used in speech synthesis on the basis of the distortion output from said distortion output means. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 19)
-
-
11. A speech synthesis method comprising:
-
a distortion output step of obtaining a distortion produced upon modifying a synthesis unit on the basis of predetermined prosody information; and
a unit registration step of selecting a synthesis unit to be registered in a synthesis unit inventory used in speech synthesis on the basis of the distortion output from the distortion output step. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 20, 21)
-
Specification