Method and apparatus for generating decision tree questions for speech processing
First Claim
Patent Images
1. A method of forming a decision tree used in speech processing, the method comprising:
- grouping at least two tokens to form a first possible cluster;
determining a mutual information score based on the first possible cluster;
grouping at least two tokens to form a second possible cluster;
determining a mutual information score based on the second possible cluster;
selecting one of the first cluster and the second cluster based on the mutual information scores associated with the first cluster and the second cluster; and
using the selected cluster to form a question in the decision tree.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention automatically builds question sets for a decision tree. Under the invention, mutual information is used to cluster tokens, representing either phones or letters. Each cluster is formed so as to limit the loss in mutual information in a set of training data caused by the formation of the cluster. The resulting sets of clusters represent questions that can be used at the nodes of the decision tree.
21 Citations
19 Claims
-
1. A method of forming a decision tree used in speech processing, the method comprising:
-
grouping at least two tokens to form a first possible cluster;
determining a mutual information score based on the first possible cluster;
grouping at least two tokens to form a second possible cluster;
determining a mutual information score based on the second possible cluster;
selecting one of the first cluster and the second cluster based on the mutual information scores associated with the first cluster and the second cluster; and
using the selected cluster to form a question in the decision tree. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-readable medium having computer-executable instructions for performing steps comprising:
-
using mutual information to form clusters of tokens found in training data;
building a decision tree by utilizing at least one of the clusters of tokens to form a question for a node in the decision tree; and
using the decision tree to identify a leaf node of the tree based on an input. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A method of forming a decision tree, the method comprising:
-
identifying at least two possible clusters of tokens in a set of training data;
using co-occurrence frequency counts of clusters to select one of the at least two possible clusters;
using the selected cluster as a question for a node in the decision tree. - View Dependent Claims (16, 17, 18, 19)
-
Specification