Speech Recognition Based on a Multilingual Acoustic Model
First Claim
1. A computer-implemented method for generating a multilingual acoustic model for use in a speech recognition system, comprising:
- providing to a processor from memory a main acoustic model including a set of probability distribution functions and a probabilistic state sequence model;
providing to the processor from memory at least one second acoustic model including a set of probability distribution functions and a probabilistic state sequence model;
in a computer process, replacing each of the probability distribution function of the at least second acoustic model by one of the probability distribution functions from the main codebook and/or each state of the probabilistic state sequence model from the second acoustic model by a state of the probabilistic state sequence model from the main acoustic model based upon a criteria set to form a modified second acoustic model; and
in a computer process, combining the main acoustic model and the at least one modified second acoustic model to form the multilingual acoustic model.
7 Assignments
0 Petitions
Accused Products
Abstract
Embodiments of the invention relate to methods for generating a multilingual acoustic model. A main acoustic model comprising a main acoustic model having probability distribution functions and a probabilistic state sequence model including first states is provided to a processor. At least one second acoustic model including probability distribution functions and a probabilistic state sequence model including states is also provided to the processor. The processor replaces each of the probability distribution functions of the at least one second acoustic model by one of the probability distribution functions and/or each of the states of the probabilistic state sequence model of the at least one second acoustic model with the state of the probabilistic state sequence model of the main acoustic model based on a criteria set to obtain at least one modified second acoustic model. The criteria set may be a distance measurement. The processor then combines the main acoustic model and the at least one modified second acoustic model to obtain the multilingual acoustic model.
30 Citations
40 Claims
-
1. A computer-implemented method for generating a multilingual acoustic model for use in a speech recognition system, comprising:
-
providing to a processor from memory a main acoustic model including a set of probability distribution functions and a probabilistic state sequence model; providing to the processor from memory at least one second acoustic model including a set of probability distribution functions and a probabilistic state sequence model; in a computer process, replacing each of the probability distribution function of the at least second acoustic model by one of the probability distribution functions from the main codebook and/or each state of the probabilistic state sequence model from the second acoustic model by a state of the probabilistic state sequence model from the main acoustic model based upon a criteria set to form a modified second acoustic model; and in a computer process, combining the main acoustic model and the at least one modified second acoustic model to form the multilingual acoustic model. - View Dependent Claims (2, 3, 4, 10, 14, 16, 17, 18, 37, 39)
-
-
5. A computer-implemented method for generating a speech recognizer comprising a multilingual acoustic model, comprising:
-
providing to a processor from memory a main acoustic model including a set of probability distribution functions and a probabilistic state sequence model; providing to the processor from memory at least one second acoustic model including a set of probability distribution functions and a probabilistic state sequence model; in a computer process, determining mean vectors of states for the main acoustic model; in a computer process, determining new probabilistic state sequence models for the main acoustic model based on the determined mean vectors of states; in a computer process, replacing the second probabilistic state sequence model of the at least one second acoustic model by the closest new probabilistic state sequence model of the main acoustic model to obtain at least one modified second acoustic model; and in a computer process combining the main acoustic model and the at least one modified second acoustic model to obtain the multilingual acoustic model. - View Dependent Claims (6, 7, 8, 9, 11, 12, 13, 15, 38, 40)
-
-
19. A computer program product comprising a tangible computer readable medium having executable computer code thereon for generating a multilingual acoustic model, the computer code comprising:
-
computer code for retrieving a main acoustic model including a plurality of probability distribution functions and a probabilistic state sequence model that includes first states; computer code for retrieving least one second acoustic model including a plurality of second probability distribution functions and a second probabilistic state sequence model that includes states; computer code for replacing each of the probability distribution functions of the at least one second acoustic model by one of the probability distribution functions of the main acoustic model and/or each of the states of the probabilistic state sequence model of the at least one second acoustic model with a state of the probabilistic state sequence model of the main acoustic model based on a criteria set to obtain at least one modified second acoustic model; and computer code for combining the main acoustic model and the at least one modified second acoustic model to obtain the multilingual acoustic model. - View Dependent Claims (20, 21, 22)
-
-
23. A computer program product comprising a tangible computer readable medium having executable computer code thereon for generating a speech recognizer comprising a multilingual acoustic model, the computer code comprising:
-
computer code for retrieving a main acoustic model including first probability distribution functions and a probabilistic state sequence model having states; computer code for retrieving at least one second acoustic model including probability distribution functions and a probabilistic state sequence model including states; computer code for determining mean vectors of states for the states of the probabilistic state sequence model of the main acoustic model; computer code for determining new probabilistic state sequence models for the main acoustic model based on the determined mean vectors of states; computer code for replacing the second probabilistic state sequence model of the at least one second acoustic model by a state sequence model of the main acoustic model based upon a criteria set to obtain at least one modified second acoustic model; and computer code for combining the main acoustic model and the at least one modified second acoustic model to obtain the multilingual acoustic model. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36)
-
Specification