METHOD AND APPARATUS FOR IMPROVING SPEECH RECOGNITION PROCESSING PERFORMANCE
First Claim
Patent Images
1. A method for improving computation time of speech recognition processing in an electronic device, the method comprising:
- by a processor;
obtaining a table value, from a plurality of table values each corresponding to a unique summation of vector element values of a first vector, via an index corresponding to an encoded form of a combination of quantized element values of a second vector; and
computing a dot product value of the first and second vectors using the table value obtained, the vector element values, and the quantized element values, the processor using fewer mathematical operations to compute the dot product value relative to a standard dot product computation of the first and second vectors, the speech recognition processing performing multiple dot product computations.
1 Assignment
0 Petitions
Accused Products
Abstract
Computing the feature Maximum Mutual Information (fMMI) method requires multiplication of vectors with a huge matrix. The huge matrix is subdivided into block sub-matrices. The sub-matrices are quantized into different values and compressed by replacing the quantized element values with 1 or 2 bit indices. Fast multiplication with those compressed matrices with far fewer multiply/accumulate operations compared to standard matrix computation is enabled and additionally obviates a de-compression method for decompressing the sub-matrices before use.
-
Citations
20 Claims
-
1. A method for improving computation time of speech recognition processing in an electronic device, the method comprising:
by a processor; obtaining a table value, from a plurality of table values each corresponding to a unique summation of vector element values of a first vector, via an index corresponding to an encoded form of a combination of quantized element values of a second vector; and computing a dot product value of the first and second vectors using the table value obtained, the vector element values, and the quantized element values, the processor using fewer mathematical operations to compute the dot product value relative to a standard dot product computation of the first and second vectors, the speech recognition processing performing multiple dot product computations. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
11. An apparatus configured to perform speech recognition processing, the apparatus comprising:
by a processor; an obtaining unit to obtain a table value, from a plurality of table values each corresponding to a unique summation of vector element values of a first vector, via an index corresponding to an encoded form of a combination of quantized element values of a second vector; and a computation unit to compute a dot product value of the first and second vectors using the table value obtained, the vector element values, and the quantized element values, the processor using fewer mathematical operations to compute the dot product value relative to a standard dot product computation of the first and second vectors, the speech recognition processing performing multiple dot product computations. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
20. A non-transitory computer-readable medium having encoded thereon a sequence of instructions which, when executed by a processor, causes the processor to:
-
obtain a table value, from a plurality of table values each corresponding to a unique summation of vector element values of a first vector, via an index corresponding to an encoded form of a combination of quantized element values of a second vector; and compute a dot product value of the first and second vectors using the table value obtained, the vector element values, and the quantized element values, the processor using fewer mathematical operations to compute the dot product value relative to a standard dot product computation of the first and second vectors, the processor performing multiple dot product computations.
-
Specification