Technique for adaptation of hidden markov models for speech recognition
First Claim
1. Apparatus for recognizing selected speech based on acoustic models, the apparatus comprising:
- a processor responsive to selected data representing a sample of the selected speech for modifying the acoustic models; and
a mechanism for defining a hierarchical structure which includes a plurality of levels, each level having one or more nodes thereon, each level in the structure arranged higher than every other level having a smaller number of nodes thereon than the other level, each node being associated with a probability measure which is determined based on at least the selected data, each node on each level having two or more nodes thereon being connected to a node on a second level higher than the level having two or more nodes thereon, the probability measure associated with each node on each level having two or more nodes thereon being a function of at least the probability measure associated with the node connected thereto on the second level, the acoustic models being modified based on at least the probability measure associated with each node on a selected level.
3 Assignments
0 Petitions
Accused Products
Abstract
A speech recognition system learns characteristics of speech by a user during a learning phase to improve its performance. Adaptation data derived from the user'"'"'s speech and its recognized result is collected during the learning phase. Parameters characterizing hidden Markov Models (HMMs) used in the system for speech recognition are modified based on the adaptation data. To that end, a hierarchical structure is defined in an HMM parameter space. This structure may assume the form of a tree structure having multiple layers, each of which includes one or more nodes. Each node on each layer is connected to at least one node on another layer. The nodes on the lowest layer of the tree structure are referred to as "leaf nodes." Each node in the tree structure represents a subset of the HMM parameters, and is associated with a probability measure which is derived from the adaptation data. In particular, each leaf node represents a different one of the HMM parameters, which is derivable from the probability measure associated with the leaf node. This probability measure is a function of the probability measures which are associated with the nodes connected to the leaf node, and which represent "hierarchical priors" to such a probability measure.
-
Citations
38 Claims
-
1. Apparatus for recognizing selected speech based on acoustic models, the apparatus comprising:
-
a processor responsive to selected data representing a sample of the selected speech for modifying the acoustic models; and a mechanism for defining a hierarchical structure which includes a plurality of levels, each level having one or more nodes thereon, each level in the structure arranged higher than every other level having a smaller number of nodes thereon than the other level, each node being associated with a probability measure which is determined based on at least the selected data, each node on each level having two or more nodes thereon being connected to a node on a second level higher than the level having two or more nodes thereon, the probability measure associated with each node on each level having two or more nodes thereon being a function of at least the probability measure associated with the node connected thereto on the second level, the acoustic models being modified based on at least the probability measure associated with each node on a selected level. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for recognizing selected speech based on acoustic models, which are characterized by a plurality of parameters, the system comprising:
-
a device for providing selected data representing a sample of the selected speech; a processor for defining a structure which includes a plurality of levels, each level including one or more nodes, each node being associated with a respective probability measure, which is derived from at least the selected data; a mechanism for identifying at least one sequence of nodes from different levels; and an adaptor for modifying at least one of the parameters based on at least the probability measure associated with a selected node in the sequence, the probability measure associated with the selected node being a function of the probability measure associated with every other node in the sequence. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A method for use in an apparatus for recognizing selected speech based on acoustic models, the method comprising:
-
modifying the acoustic models in response to selected data representing a sample of the selected speech; and defining a hierarchical structure which includes a plurality of levels, each level having one or more nodes thereon, each level in the structure arranged higher than every other level having a smaller number of nodes thereon than the other level, each node being associated with a probability measure which is determined based on at least the selected data, each node on each level having two or more nodes thereon being connected to a node on a second level higher than the level having two or more nodes thereon, the probability measure associated with each node on each level having two or more nodes thereon being a function of at least the probability measure associated with the node connected thereto on the second level, the acoustic models being modified based on at least the probability measure associated with each node on a selected level. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29)
-
-
30. A method for use in a system for recognizing selected speech based on acoustic models, which are characterized by a plurality of parameters, the method comprising:
-
providing selected data representing a sample of the selected speech; defining a structure which includes a plurality of levels, each level including one or more nodes, each node being associated with a respective probability measure, which is derived from at least the selected data; identifying at least one sequence of nodes from different levels; and modifying at least one of the parameters based on at least the probability measure associated with a selected node in the sequence, the probability measure associated with the selected node being a function of the probability measure associated with every other node in the sequence. - View Dependent Claims (31, 32, 33, 34, 35, 36, 37, 38)
-
Specification