HMM-based text-to-phoneme parser and method for training same
First Claim
Patent Images
1. A method for training a text-to-phoneme parser system, comprising:
- generating first information based on pronunciations within a phonetic dictionary, said first information identifying a plurality of potential diphones;
pruning said plurality of potential diphones based on frequency of occurrence information to produce pruned diphones;
forming an extended set of phonemes that includes said pruned diphones as legal phonemes; and
generating second information, based on said extended set of phonemes, for use in performing text-to-phoneme parsing.
2 Assignments
0 Petitions
Accused Products
Abstract
An HMM-based text-to-phoneme parser uses probability information within a probability database to generate one or more phoneme strings for a written input word. Techniques for training the text-to-phoneme parser are provided.
-
Citations
35 Claims
-
1. A method for training a text-to-phoneme parser system, comprising:
-
generating first information based on pronunciations within a phonetic dictionary, said first information identifying a plurality of potential diphones;
pruning said plurality of potential diphones based on frequency of occurrence information to produce pruned diphones;
forming an extended set of phonemes that includes said pruned diphones as legal phonemes; and
generating second information, based on said extended set of phonemes, for use in performing text-to-phoneme parsing. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A method for use in training a text-to-phoneme parser system, comprising:
-
segmenting words based on known word pronunciations to generate segmentation results;
generating probability information using said segmentations results, said probability information including a plurality of probability values;
identifying probability values within said probability information that are below a first threshold value; and
changing said identified probability values to a predetermined value. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. A method for use in training a text-to-phoneme parser system, comprising:
-
segmenting words based on known word pronunciations to generate segmentation results; and
generating probability information using said segmentation results, said probability information including generalized transition probability information, said generalized transition probability information including a probability that a specific phoneme will be induced given a previous phoneme and a letter string emitted by said previous phoneme. - View Dependent Claims (28, 29, 30, 31)
-
-
32. A text-to-phoneme parsing system, comprising:
-
a probability database including generalized transition probability information, said generalized transition probability information including a probability that a specific phoneme will occur given a previous phoneme and a letter string emitted by said previous phoneme, and a text-to-phoneme parser to generate at least one phoneme string for a written input word based on information within said probability database. - View Dependent Claims (33, 34, 35)
-
Specification