Quantization using frequency and mean compensated frequency input data for robust speech recognition
First Claim
1. A signal recognition system comprising:
- a frequency parameter mean compensation module to receive frequency parameters of an input signal and to generate mean compensated frequency parameters from the received input signal frequency parameters;
a first quantizer to receive the input signal frequency parameters and to quantize the input signal frequency parameters;
a second quantizer to receive the input signal mean compensated frequency parameters and to quantize the input signal mean compensated frequency parameters; and
a backend processor to receive the quantized input signal frequency parameters and the input signal mean compensated input signal frequency parameters and to generate an input signal classification therefrom.
7 Assignments
0 Petitions
Accused Products
Abstract
A speech recognition system utilizes multiple quantizers to process frequency parameters and mean compensated frequency parameters derived from an input signal. The quantizers may be matrix and vector quantizer pairs, and such quantizer pairs may also function as front ends to a second stage speech classifiers such as hidden Markov models (HMMs) and/or utilizes neural network postprocessing to, for example, improve speech recognition performance. Mean compensating the frequency parameters can remove noise frequency components that remain approximately constant during the duration of the input signal. HMM initial state and state transition probabilities derived from common quantizer types and the same input signal may be consolidated to improve recognition system performance and efficiency. Matrix quantization exploits the “evolution” of the speech short-term spectral envelopes as well as frequency domain information, and vector quantization (VQ) primarily operates on frequency domain information. Time domain information may be substantially limited which may introduce error into the matrix quantization, and the VQ may provide error compensation. The matrix and vector quantizers may split spectral subbands to target selected frequencies for enhanced processing and may use fuzzy associations to develop fuzzy observation sequence data. A mixer may provide a variety of input data to the neural network for classification determination. Fuzzy operators may be utilized to reduce quantization error. Multiple codebooks may also be combined to form single respective codebooks for split matrix and split vector quantization to reduce processing resources demand.
-
Citations
28 Claims
-
1. A signal recognition system comprising:
-
a frequency parameter mean compensation module to receive frequency parameters of an input signal and to generate mean compensated frequency parameters from the received input signal frequency parameters;
a first quantizer to receive the input signal frequency parameters and to quantize the input signal frequency parameters;
a second quantizer to receive the input signal mean compensated frequency parameters and to quantize the input signal mean compensated frequency parameters; and
a backend processor to receive the quantized input signal frequency parameters and the input signal mean compensated input signal frequency parameters and to generate an input signal classification therefrom. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
a first stochastic module to receive quantized training output data from the first quantizer to determine the respective probabilities of each of the first group of hidden Markov models; and
a second stochastic module to receive quantized training output data from the second quantizer to determine the respective probabilities of each of the second group of hidden Markov models.
-
-
9. The signal recognition system as in claim 8 wherein the stochastic module comprises a Viterbi algorithm.
-
10. The signal recognition system as in claim 1 wherein the backend processor comprises a neural network to receive respective quantized output data from the first and second quantizers.
-
11. The signal recognition system as in claim 1 further comprising:
-
a memory having code to implement the frequency parameter compensation module, the first quantizer, the second quantizer, and the backend processor; and
a processor coupled to the memory to execute the code.
-
-
12. A method comprising the steps of:
-
sampling an input signal having a noise component;
characterizing the sampled input signal with frequency parameters;
generating mean compensated frequency parameters from the frequency parameters to substantially remove the noise component;
providing the frequency parameters to a first quantizer;
providing the mean compensated frequency parameters to a second quantizer; and
quantizing the frequency parameters with the first quantizer to generate first quantization data;
quantizing the mean compensated frequency parameters with the second quantizer to generate second quantization data; and
providing the first and second quantization data to a backend processor to classify the input signal. - View Dependent Claims (13, 14, 15, 16, 17)
mean compensating each of D frequency parameters for TO frames of the sampled input signal wherein the ith mean compensated frequency parameter, s(i)j,(m), i=1, 2, . . . , D, is generated in accordance with;
-
-
14. The method as in claim 12 wherein the step of quantizing the frequency parameters with the first quantizer comprises the steps of:
-
quantizing the frequency parameters with a matrix quantizer;
quantizing the frequency parameters with a vector quantizer; and
combining the frequency parameters quantized with the matrix quantizer with the frequency parameters quantized with the vector quantized data to generate the first quantization data.
-
-
15. The method as in claim 12 wherein the step of quantizing the mean compensated frequency parameters with the second quantizer comprises the steps of:
-
quantizing the mean compensated frequency parameters with a matrix quantizer;
quantizing the mean compensated frequency parameters with a vector quantizer; and
combining the mean compensated frequency parameters quantized with the matrix quantizer with the mean compensated frequency parameters quantized with the vector quantized data to generate the second quantization data.
-
-
16. The method as in claim 12 wherein the step of providing the first and second quantization data to a backend processor comprises the steps of:
-
providing the first and second quantization data to a stochastic module having access to data from a plurality of hidden Markov models; and
utilizing the stochastic module to determine classification probabilities from each of the respective hidden Markov models.
-
-
17. The method as in claim 16 wherein the step of providing the first and second quantization data to a backend processor further comprises the step of:
providing the classification probabilities from each of the respective hidden Markov models to a neural network.
-
18. A signal recognition system comprising:
-
a frequency parameter mean compensation module to receive frequency parameters of an input signal having a noise component and to generate mean compensated frequency parameters from the received input signal frequency parameters in order to substantially remove the noise component;
a first quantizer to receive the input signal frequency parameters and to quantize the input signal frequency parameters;
a second quantizer to receive the input signal mean compensated frequency parameters and to quantize the input signal mean compensated frequency parameters; and
a backend processor to receive the quantized input signal frequency parameters and the input signal mean compensated input signal frequency parameters and to generate an input signal classification therefrom. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
a first stochastic module to receive quantized training output data from the first quantizer to determine the respective probabilities of each of the first group of hidden Markov models; and
a second stochastic module to receive quantized training output data from the second quantizer to determine the respective probabilities of each of the second group of hidden Markov models.
-
-
26. The signal recognition system as in claim 25 wherein the stochastic module comprises a Viterbi algorithm.
-
27. The signal recognition system as in claim 18 wherein the backend processor comprises a neural network to receive respective quantized output data from the first and second quantizers.
-
28. The signal recognition system as in claim 18 further comprising:
-
a memory having code to implement the frequency parameter compensation module, the first quantizer, the second quantizer, and the backend processor; and
a processor coupled to the memory to execute the code.
-
Specification