Unified clustering tree
First Claim
Patent Images
1. A speech recognition system comprising:
- a clustering tree configured to classify a series of sounds into predefined clusters based on one of the sounds and on a predetermined number of neighboring sounds that surround the one of the sounds, where the clustering tree comprises;
a first level with a first hierarchical arrangement of decision nodes in which the decision nodes of the first hierarchical arrangement are associated with a first group of questions relating to the series of sounds,a second level with a second hierarchical arrangement of decision nodes in which the decision nodes of the second hierarchical arrangement are associated with a second group of questions relating to the series of sounds, the second group of questions discriminating at a finer level of granularity within the series of sounds than the first group of questions, anda third level with a third hierarchical arrangement of decision nodes in which the decision nodes of the third hierarchical arrangement are associated with a third group of questions discriminating at a finer level of granularity within the series of sounds than the second group of questions; and
a plurality of speech recognition models trained to recognize speech based on the predefined clusters, the plurality of speech recognition models comprising;
a first model associated with the first level and including a triphone non-crossword speech recognition model,a second model associated with the second level and including a quinphone non-crossword speech recognition model, anda third model associated with the third level and including a quinphone crossword speech recognition model.
5 Assignments
0 Petitions
Accused Products
Abstract
A unified clustering tree (500) generates phoneme clusters based on an input sequence of phonemes. The number of possible clusters is significantly less than the number of possible combinations of input phonemes. Nodes (510, 511) in the unified clustering tree are arranged into levels such that the clustering tree generates clusters for multiple speech recognition models. Models that correspond to higher levels in the unified clustering tree are coarse models relative to more fine-grain models at lower levels of the clustering tree.
-
Citations
14 Claims
-
1. A speech recognition system comprising:
-
a clustering tree configured to classify a series of sounds into predefined clusters based on one of the sounds and on a predetermined number of neighboring sounds that surround the one of the sounds, where the clustering tree comprises; a first level with a first hierarchical arrangement of decision nodes in which the decision nodes of the first hierarchical arrangement are associated with a first group of questions relating to the series of sounds, a second level with a second hierarchical arrangement of decision nodes in which the decision nodes of the second hierarchical arrangement are associated with a second group of questions relating to the series of sounds, the second group of questions discriminating at a finer level of granularity within the series of sounds than the first group of questions, and a third level with a third hierarchical arrangement of decision nodes in which the decision nodes of the third hierarchical arrangement are associated with a third group of questions discriminating at a finer level of granularity within the series of sounds than the second group of questions; and a plurality of speech recognition models trained to recognize speech based on the predefined clusters, the plurality of speech recognition models comprising; a first model associated with the first level and including a triphone non-crossword speech recognition model, a second model associated with the second level and including a quinphone non-crossword speech recognition model, and a third model associated with the third level and including a quinphone crossword speech recognition model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A device comprising:
-
means for classifying a series of sounds into predefined clusters using a clustering tree and based on one of the sounds and a predetermined number of neighboring sounds that surround the one of the sounds, where the clustering tree includes; a first level with a first hierarchical arrangement of decision nodes in which the decision nodes of the first hierarchical arrangement are associated with a first group of questions relating to the series of sounds; a second level with a second hierarchical arrangement of decision nodes in which the decision nodes of the second hierarchical arrangement are associated with a second group of questions relating to the series of sounds, the second group of questions discriminating at a finer level of granularity within the series of sounds than the first group of questions; and a third level with a third hierarchical arrangement of decision nodes in which the decision nodes of the third hierarchical arrangement are associated with a third group of questions discriminating at a finer level of granularity within the series of sounds than the second group of questions; and means for training a plurality of speech recognition models to recognize speech based on the predefined clusters, the speech recognition models including; a first model associated with the first level and including a triphone non-crossword speech recognition model, a second model associated with the second level and including a quinphone non-crossword speech recognition model, and a third model associated with the third level and including a quinphone crossword speech recognition model.
-
Specification