SYSTEM AND METHOD FOR UNIT SELECTION TEXT-TO-SPEECH USING A MODIFIED VITERBI APPROACH
First Claim
1. A system for speech synthesis, the system comprising:
- a processor;
a first module controlling the processor to receive a set of ordered lists of speech units;
a second module controlling the processor, for each respective speech unit in each ordered list in the set of ordered lists, to construct a sublist of speech units from a next ordered list which are suitable for concatenation;
a third module controlling the processor to perform a cost analysis of paths through the set of ordered lists of speech units based on the sublist of speech units for each respective speech unit; and
a fourth module controlling the processor to synthesize speech using a lowest cost path of speech units through the set of ordered lists based on the cost analysis.
8 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for speech synthesis. A system practicing the method receives a set of ordered lists of speech units, for each respective speech unit in each ordered list in the set of ordered lists, constructs a sublist of speech units from a next ordered list which are suitable for concatenation, performs a cost analysis of paths through the set of ordered lists of speech units based on the sublist of speech units for each respective speech unit, and synthesizes speech using a lowest cost path of speech units through the set of ordered lists based on the cost analysis. The ordered lists can be ordered based on the respective pitch of each speech unit. In one embodiment, speech units which do not have an assigned pitch can be assigned a pitch.
14 Citations
18 Claims
-
1. A system for speech synthesis, the system comprising:
-
a processor; a first module controlling the processor to receive a set of ordered lists of speech units; a second module controlling the processor, for each respective speech unit in each ordered list in the set of ordered lists, to construct a sublist of speech units from a next ordered list which are suitable for concatenation; a third module controlling the processor to perform a cost analysis of paths through the set of ordered lists of speech units based on the sublist of speech units for each respective speech unit; and a fourth module controlling the processor to synthesize speech using a lowest cost path of speech units through the set of ordered lists based on the cost analysis. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method of speech synthesis, the method comprising:
-
in a text-to-speech synthesis system that uses unit selection, imposing ordering constraints on speech units, the ordering constraints indicating speech unit pairs which are suitable for concatenation based on a respective pitch of each speech unit; and when performing unit selection to synthesize speech, considering speech unit pairs in which a difference in pitch is below a threshold value based on the imposed ordering constraints. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A non-transitory computer-readable storage medium storing instructions which, when executed by a computing device, cause the computing device to perform speech synthesis, the instructions comprising:
-
receiving a set of ordered lists of speech units; for each respective speech unit in each ordered list in the set of ordered lists, constructing a sublist of speech units from a next ordered list which are suitable for concatenation; performing a cost analysis of paths through the set of ordered lists of speech units based on the sublist of speech units for each respective speech unit; and synthesizing speech using a lowest cost path of speech units through the set of ordered lists based on the cost analysis. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification