TECHNIQUE FOR TRAINING A PHONETIC DECISION TREE WITH LIMITED PHONETIC EXCEPTIONAL TERMS
First Claim
1. A semi-automated method for generating a phonetic decision tree with limited phonetic exceptions for a text-to-speech system comprising:
- selecting an initial subset of a set of input data;
creating an initial phonetic decision tree from the selected subset;
incorporating a predetermined set of terms from the input data to the selected subset;
testing the phonetic decision tree with the increased subset, wherein each term of the subset is phonetized using the phonetic decision tree;
categorizing a result of the testing step into a set of correctly phonetized terms and a set of incorrectly phonetized terms;
generating an exception-limited phonetic decision tree with the set of correctly phonetized terms;
determining if one or more termination conditions are satisfied; and
when the one or more termination conditions are unsatisfactorily met, automatically repeating the incorporating, testing, categorizing, generating, and determining steps.
8 Assignments
0 Petitions
Accused Products
Abstract
The present invention discloses a method for training an exception-limited phonetic decision tree. An initial subset of data can be selected and used for creating an initial phonetic decision tree. Additional terms can then be incorporated into the subset. The enlarged subset can be used to evaluate the phonetic decision tree with the results being categorized as either correctly or incorrectly phonetized. An exception-limited phonetic tree can be generated from the set of correctly phonetized terms. If the termination conditions for the method have been determined to be unsatisfactorily met, then steps of the method can be repeated.
231 Citations
20 Claims
-
1. A semi-automated method for generating a phonetic decision tree with limited phonetic exceptions for a text-to-speech system comprising:
-
selecting an initial subset of a set of input data; creating an initial phonetic decision tree from the selected subset; incorporating a predetermined set of terms from the input data to the selected subset; testing the phonetic decision tree with the increased subset, wherein each term of the subset is phonetized using the phonetic decision tree; categorizing a result of the testing step into a set of correctly phonetized terms and a set of incorrectly phonetized terms; generating an exception-limited phonetic decision tree with the set of correctly phonetized terms; determining if one or more termination conditions are satisfied; and when the one or more termination conditions are unsatisfactorily met, automatically repeating the incorporating, testing, categorizing, generating, and determining steps. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system for generating a phonetic decision tree with limited exceptions for text-to-speech processing comprising:
-
a training data set containing terms for evaluating a phonetic decision tree; a training engine configured to evaluate the phonetic decision tree using the training data set and a set of standard pronunciations, wherein the training engine categorizes the training data set into a set of correctly phonetized terms and a set of incorrectly phonetized terms; and a phonetic tree generation engine configured to create an exception-limited phonetic decision tree from the set of correctly phonetized terms. - View Dependent Claims (13, 14)
-
-
15. A method for creating a phonetic tree for speech synthesis comprising:
-
generating an initial phonetic tree from a training data set of words and corresponding word pronunciations; converting each word in the data set using the phonetic tree; comparing a text-to-speech converted word against a corresponding word pronunciation from the data set; removing from the training data set those words that were not correctly text-to-speech converted using the phonetic tree; and creating a new phonetic tree using the modified training data set resulting from the removing step, wherein the new phonetic tree is at least one of an intermediate tree used to produce a production tree and a production tree, wherein a production tree is a phonetic tree used by a speech synthesis engine to generate speech output from text input in a runtime environment. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification