Speech synthesis using concatenation of speech waveforms
First Claim
Patent Images
1. A system for speech unit selection comprising:
- a large speech database referencing speech waveforms and associated symbolic prosodic features, wherein the speech database is accessed by speech waveform designators, at least one designator being associated with a sequence of one or more diphones; and
a speech waveform selector, in communication with the speech database, that selects based, at least in part, on the symbolic prosodic features stored in the speech database, waveforms referenced by the speech database.
10 Assignments
0 Petitions
Accused Products
Abstract
A high quality speech synthesizer in various embodiments concatenates speech waveforms referenced by a large speech database. Speech quality is further improved by speech unit selection and concatenation smoothing.
206 Citations
7 Claims
-
1. A system for speech unit selection comprising:
-
a large speech database referencing speech waveforms and associated symbolic prosodic features, wherein the speech database is accessed by speech waveform designators, at least one designator being associated with a sequence of one or more diphones; and
a speech waveform selector, in communication with the speech database, that selects based, at least in part, on the symbolic prosodic features stored in the speech database, waveforms referenced by the speech database. - View Dependent Claims (2, 4, 5, 6, 7)
-
-
3. A system for speech unit selection comprising:
-
a large speech database referencing speech waveforms;
a speech waveform selector, in communication with the speech database, that selects waveforms referenced by the speech database using criteria that, at least in part, favor (i) waveform candidates based directly on high level prosody features, and (ii) approximately equally all waveform candidates having low level prosody features within a target range determined as a function of high level linguistic features.
-
Specification