Method of calculating HMM output probability and speech recognition apparatus
First Claim
1. A method of calculating an HMM output probability, comprising:
- performing vector quantization, using vector quantization devices, of a feature vector obtained by speech analysis of an input speech; and
obtaining a state output probability in accordance with a result of quantization by referring to tables;
the vector quantization devices being associated with respective subwords, and the tables being provided in association with the vector quantization devices associated with the subwords, andthe feature vector obtained by speech analysis of the input speech being vector quantized using one of the vector quantization devices associated with subwords, a result of quantization being output, and a state output probability in accordance with the result of quantization being obtained by referring to a table associated with the vector quantization devices that has output the result of quantization.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention relates to speech recognition based on HMM, in which speech recognition is performed by performing vector quantization and obtaining an output probability by table reference, and the amount of computation and use of memory area are minimized while achieving a high ability of recognition. Exemplary codebooks used for vector quantization can be provided as follows: if phonemes are used as subwords, codebooks for respective phonemes, such that a codebook CB1 is a codebook for a phoneme /a/ and a codebook CB2 is a codebook for a phoneme /i/, and these codebooks are associated with respective phoneme HMMs. When a feature vector obtained by speech analysis is vector quantized based on, for example, the codebook CB1 and a code (label) is output, tables for respective states of the phoneme HMM associated with the codebook CB1 are each referred to in order to obtain state output probabilities corresponding to the label, and speech recognition is performed using the state output probabilities as a parameter.
-
Citations
18 Claims
-
1. A method of calculating an HMM output probability, comprising:
-
performing vector quantization, using vector quantization devices, of a feature vector obtained by speech analysis of an input speech; and obtaining a state output probability in accordance with a result of quantization by referring to tables; the vector quantization devices being associated with respective subwords, and the tables being provided in association with the vector quantization devices associated with the subwords, and the feature vector obtained by speech analysis of the input speech being vector quantized using one of the vector quantization devices associated with subwords, a result of quantization being output, and a state output probability in accordance with the result of quantization being obtained by referring to a table associated with the vector quantization devices that has output the result of quantization. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A speech recognition apparatus for vector quantizing a feature vector obtained by speech analysis of an input speech using vector quantization devices, obtaining a state output probability in accordance with a result of quantization by referring to tables, and performing speech recognition using the state output probability as a parameter, the speech recognition apparatus comprising:
-
a speech analysis unit to perform speech feature analysis of an input speech to output a feature vector sequence; a vector quantization device storage unit to store vector quantization devices associated with respective subwords; a vector quantization processing unit to vector quantize the feature vector sequence output from the speech analysis unit using the vector quantization devices associated with subwords, stored in the vector quantization storage device storage unit, and to output a result of quantization; a subword HMM storage unit including various parameters required for subword HMM calculation and tables to refer to state output probabilities, provided in association with the vector quantization devices associated with subwords; and a speech recognition processing unit to refer to the table associated with vector quantization devices used for vector quantization upon receiving a result of quantization from the vector quantization devices, to obtain a state output probability in accordance with the result of quantization, and to perform speech recognition using the state output probability as a parameter to output a result of recognition. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification