Adaptation of compressed acoustic models
First Claim
Patent Images
1. A method of adapting an acoustic model for use in a speech recognition engine, comprising:
- subspace coding the acoustic model to obtain a plurality of codebooks each including a plurality of codewords, the plurality of codebooks including at least one codebook per subspace; and
adapting the codewords in the codebooks based on adaptation training data.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention is used to adapt acoustic models, quantized in subspaces, using adaptation training data (such as speaker-dependent training data). The acoustic model is compressed into multi-dimensional subspaces. A codebook is generated for each subspace. An adaptation transform is estimated, and it is applied to codewords in the codebooks, rather than to the means themselves.
43 Citations
21 Claims
-
1. A method of adapting an acoustic model for use in a speech recognition engine, comprising:
-
subspace coding the acoustic model to obtain a plurality of codebooks each including a plurality of codewords, the plurality of codebooks including at least one codebook per subspace; and
adapting the codewords in the codebooks based on adaptation training data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer implemented method of training an acoustic model in a speech recognizer, comprising:
-
generating a subspace coded acoustic model having a plurality of codebooks, one codebook corresponding to each acoustic subspace into which the acoustic model is coded, each codebook having a plurality of codewords therein, each codeword representing at least one component of an acoustic characteristic of a modeled speech unit; and
modifying the codewords based on adaptation training data. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer readable medium storing instructions which, when executed, cause a computer to perform steps of:
-
receiving a subspace coded acoustic model including a codebook corresponding to each subspace and a plurality of codewords in each codebook;
receiving training data; and
adapting the codewords in the codebooks based on the training data. - View Dependent Claims (20, 21)
-
Specification