Defining atom units between phone and syllable for TTS systems
First Claim
1. A method of developing a unit inventory for use by a text to speech system, comprising:
- identifying a list of phones for a target language;
receiving a lexicon containing phonetic transcriptions of a plurality of words having a plurality of syllables;
identifying a set of common multi-phone atom units for the lexicon; and
adding the set of common multi-phone atom units to the unit inventory for the target language.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for identifying common multiphone units to add to a unit inventory for a text-to-speech generator is disclosed. The common multiphone units are units that are larger than a phone, but smaller than a syllable. The method slices each syllable into a plurality of slices. These slices are then sorted and the frequency of each slice is determined. Those slices whose frequencies exceed a threshold are added to the unit inventory. The remaining slices are decomposed according to a predetermined set of rules to determine if they contain slices that should be added to the unit inventory.
-
Citations
19 Claims
-
1. A method of developing a unit inventory for use by a text to speech system, comprising:
-
identifying a list of phones for a target language;
receiving a lexicon containing phonetic transcriptions of a plurality of words having a plurality of syllables;
identifying a set of common multi-phone atom units for the lexicon; and
adding the set of common multi-phone atom units to the unit inventory for the target language. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. An apparatus for generating speech from text, comprising:
-
a unit inventory for storing a set of phoneme based atom units for at least one target speaker;
a text analyzer for obtaining a string of phonetic symbols representative of a text to be converted to speech; and
a concatenation module for selecting stored phoneme-based atom units from the unit inventory based on the context of the phonetic symbols for the text; and
synthesizing the selected phoneme-based atom units to generate speech corresponding to the text. - View Dependent Claims (13, 14, 15)
-
-
16. A unit inventory for use in text-to-speech generation, comprising:
-
a set of monophone units for a target language; and
a set of atom units for the target language. - View Dependent Claims (17, 18, 19)
-
Specification