Synthesis by generation and concatenation of multi-form segments
First Claim
Patent Images
1. A speech synthesis system implemented using at least one hardware implemented processor, the system comprising:
- a speech segment database referencing speech segments having a plurality of different types of speech representational structures including;
i. statistical state model based speech signals, andii. template based speech signals;
a speech segment selector for selecting from the speech segment database a sequence of statistical state model based and template based speech segment candidates corresponding to a target text;
a speech segment sequencer for generating from the speech segment candidates sequenced statistical state model based and template based speech segments corresponding to the target text; and
a speech segment synthesizer for combining the sequenced statistical state model based and template based speech segments to produce a synthesized speech signal output corresponding to the target text.
5 Assignments
0 Petitions
Accused Products
Abstract
A speech synthesis system and method is described. A speech segment database references speech segments having various different speech representational structures. A speech segment selector selects from the speech segment database a sequence of speech segment candidates corresponding to a target text. A speech segment sequencer generates from the speech segment candidates sequenced speech segments corresponding to the target text. A speech segment synthesizer combines the selected sequenced speech segments to produce a synthesized speech signal output corresponding to the target text.
-
Citations
38 Claims
-
1. A speech synthesis system implemented using at least one hardware implemented processor, the system comprising:
-
a speech segment database referencing speech segments having a plurality of different types of speech representational structures including; i. statistical state model based speech signals, and ii. template based speech signals; a speech segment selector for selecting from the speech segment database a sequence of statistical state model based and template based speech segment candidates corresponding to a target text; a speech segment sequencer for generating from the speech segment candidates sequenced statistical state model based and template based speech segments corresponding to the target text; and a speech segment synthesizer for combining the sequenced statistical state model based and template based speech segments to produce a synthesized speech signal output corresponding to the target text. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A method of speech synthesis comprising:
-
with a system implemented using at least one hardware implemented processor; referencing in a speech segment database speech segments having a plurality of different types of speech representational structures including; i. statistical state model based speech signals, and ii. template based speech signals; selecting from the speech segment database a sequence of statistical state model based and template based speech segment candidates corresponding to a target text; generating from the speech segment candidates sequenced statistical state model based and template based speech segments corresponding to the target text; and combining the sequenced statistical state model based and template based speech segments to produce a synthesized speech signal output corresponding to the target text. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
Specification