Method of calculating HMM output probability and speech recognition apparatus
First Claim
1. ) a method of calculating an hmm output probability, in which a feature vector obtained by speech analysis of an input speech is vector quantized using vector quantization means and a state output probability in accordance with a result of quantization is obtained by referring to tables, wherein the vector quantization means used for vector quantization are vector quantization means associated with respective subword, and the tables are provided in association with the vector quantization means associated with the subwords, and the feature vector obtained by speech analysis of the input speech is vector quantized using one of the vector quantization means associated with subwords, a result of quantization is output, and a state output probability in accordance with the result of quantization is obtained by referring to a table associated with the vector quantization means that has output the result of quantization:
1 Assignment
0 Petitions
Accused Products
Abstract
In speech recognition based on HMM, in which speech recognition is performed by performing vector quantization and obtaining an output probability by table reference, the amount of computation and use of memory area are minimized while achieving a high ability of recognition.
Codebooks used for vector quantization are, for example, if phonemes are used as subwords, codebooks for respective phonemes, such that a codebook CB1 is a codebook for a phoneme /a/ and a codebook CB2 is a codebook for a phoneme /i/, and these codebooks are associated with respective phoneme HMMs 11 and 12. When a feature vector obtained by speech analysis is vector quantized based on, for example, the codebook CB1 and a code (label) is output, tables T1, T2, and T3 for respective states of the phoneme HMM 11 associated with the codebook CB1 are each referred to in order to obtain state output probabilities corresponding to the label, and speech recognition is performed using the state output probabilities as a parameter.
10 Citations
18 Claims
-
1. ) a method of calculating an hmm output probability, in which a feature vector obtained by speech analysis of an input speech is vector quantized using vector quantization means and a state output probability in accordance with a result of quantization is obtained by referring to tables,
wherein the vector quantization means used for vector quantization are vector quantization means associated with respective subword, and the tables are provided in association with the vector quantization means associated with the subwords, and the feature vector obtained by speech analysis of the input speech is vector quantized using one of the vector quantization means associated with subwords, a result of quantization is output, and a state output probability in accordance with the result of quantization is obtained by referring to a table associated with the vector quantization means that has output the result of quantization:
-
10. ) A speech recognition apparatus for vector quantizing a feature vector obtained by speech analysis of an input speech using vector quantization means, obtaining a state output probability in accordance with a result of quantization by referring to tables, and performing speech recognition using the state output probability as a parameter, said speech recognition apparatus comprising:
-
a speech analysis unit for performing speech feature analysis of an input speech to output a feature vector sequence;
a vector quantization means storage unit storing vector quantization means associated with respective subwords;
a vector quantization processing unit for vector quantizing the feature vector sequence output from the speech analysis unit using the vector quantization means associated with subwords, stored in the vector quantization storage means, and outputting a result of quantization;
a subword HMM storage unit including various parameters required for subword HMM calculation and tables for referring to state output probabilities, provided in association with the vector quantization means associated with subwords; and
a speech recognition processing unit for referring to the table associated with vector quantization means used for vector quantization upon receiving a result of quantization from the vector quantization means, obtaining a state output probability in accordance with the result of quantization, and performing speech recognition using the state output probability as a parameter to output a result of recognition. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification