Adaptation of compressed acoustic models
First Claim
Patent Images
1. A method of adapting an acoustic model for use in a speech recognition engine, comprising:
- subspace coding the acoustic model by a computer to obtain a plurality of codebooks each including a plurality of codewords, the plurality of codebooks including at least one codebook per subspace.adapting the codewords in the codebooks based on adaptation training data, by applying an adaptation transform to the codewords, regardless of whether the acoustic model is recomputed based on the adaptation training data.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention is used to adapt acoustic models, quantized in subspaces, using adaptation training data (such as speaker-dependent training data). The acoustic model is compressed into multi-dimensional subspaces. A codebook is generated for each subspace. An adaptation transform is estimated, and it is applied to codewords in the codebooks, rather than to the means themselves.
24 Citations
20 Claims
-
1. A method of adapting an acoustic model for use in a speech recognition engine, comprising:
-
subspace coding the acoustic model by a computer to obtain a plurality of codebooks each including a plurality of codewords, the plurality of codebooks including at least one codebook per subspace. adapting the codewords in the codebooks based on adaptation training data, by applying an adaptation transform to the codewords, regardless of whether the acoustic model is recomputed based on the adaptation training data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer implemented method of training an acoustic model in a speech recognizer, comprising:
-
generating by the computer a subspace coded acoustic model having a plurality of codebooks, one codebook corresponding to each acoustic subspace into which the acoustic model is coded, each codebook having a plurality of codewords therein, each codeword representing at least one component of an acoustic characteristic of a modeled speech unit modifying the codewords based on adaptation training data without recomputing the acoustic model based on the adaptation training data. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer storage medium storing instructions which, when executed, cause a computer to perform steps of:
-
receiving a subspace coded acoustic model including a codebook corresponding to each subspace and a plurality of codewords in each codebook; receiving training data; and adapting the codewords in the codebooks based on the training data, by grouping the codewords in each codebook into classes, and adapting the codewords differently depending on a class to which the codewords belong. - View Dependent Claims (20)
-
Specification