Adaptive speech recognition method with noise compensation
First Claim
1. An adaptive speech recognition method with noise compensation capable of compensating noises of an input speech by adjusting parameters of a HMM (Hidden Markov Model) speech model, the input speech having a plurality of speech frames, the method comprising the steps of:
- (A) determining, based on the plurality of speech frames of the input speech and the speech model, optimal equalization factors for feature vectors of the plurality of speech frames corresponding to each probability density function in the speech model, wherein the optimal equalization factor is determined based on the parameters θ
ik=(μ
ik, Σ
ik) of the speech model, and is equivalent to a projection of the speech frame upon Σ
ik−
1μ
ik, where μ
ik and Σ
ik are respectively the mean vector and covariance matrix of the k-th mixture density function for a state ŝ
t=i in the speech model; and
(B) adapting the parameters of the speech model by the optimal equalization factor and a bias compensation vector corresponding to and retrieved by the optimal equalization factor, wherein the optimal equalization factor is provided to adjust a distance of the mean vector in the speech model, and the bias compensation vector is provided to adjust a direction change of the mean vector in the speech model.
1 Assignment
0 Petitions
Accused Products
Abstract
An adaptive speech recognition method with noise compensation is disclosed. In speech recognition, optimal equalization factors for feature vectors of a plurality of speech frames corresponding to each probability density function in a speech model are determined based on the plurality of speech frames of the input speech and the speech model. The parameters of the speech model are adapted by the optimal equalization factor and a bias compensation vector, which is corresponding to and retrieved by the optimal equalization factor. The optimal equalization factor is provided to adjust a distance of the mean vector in the speech model. The bias compensation vector is provided to adjust a direction change of the mean vector in the speech model.
-
Citations
9 Claims
-
1. An adaptive speech recognition method with noise compensation capable of compensating noises of an input speech by adjusting parameters of a HMM (Hidden Markov Model) speech model, the input speech having a plurality of speech frames, the method comprising the steps of:
-
(A) determining, based on the plurality of speech frames of the input speech and the speech model, optimal equalization factors for feature vectors of the plurality of speech frames corresponding to each probability density function in the speech model, wherein the optimal equalization factor is determined based on the parameters θ
ik=(μ
ik, Σ
ik) of the speech model, and is equivalent to a projection of the speech frame upon Σ
ik−
1μ
ik, where μ
ik and Σ
ik are respectively the mean vector and covariance matrix of the k-th mixture density function for a state ŝ
t=i in the speech model; and
(B) adapting the parameters of the speech model by the optimal equalization factor and a bias compensation vector corresponding to and retrieved by the optimal equalization factor, wherein the optimal equalization factor is provided to adjust a distance of the mean vector in the speech model, and the bias compensation vector is provided to adjust a direction change of the mean vector in the speech model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
calculating the optimal equalization factors for pairs of each speech frame and all parameters in the speech model based on the speech model and the noisy speech data. calculating adaptation bias vectors corresponding to the optimal equalization factors; and
piecewisely estimating, based on the relation between the optimal equalization factors and the adaptation bias vectors, the bias compensation ill vectors by averaging the adaptation bias vectors.
-
-
5. The method as claimed in claim 4, wherein, the reference function table can be modified on line by actual input speech in executing a recognition process.
-
6. The method as claimed in claim 1, wherein, in step (B), based on the determined optimal equalization factor λ
-
e and the retrieved bias compensation vector b(λ
e), a calculation of probability for speech recognition is adapted by λ
eμ
ik+b(λ
e), where μ
ik is the mean vector of the k-th mixture density function for a state ŝ
t=i in the speech model.
-
e and the retrieved bias compensation vector b(λ
-
7. The method as claimed in claim 1, further comprising a step, before step (A), for performing a feature analysis on the speech frames of the input speech.
-
8. The method as claimed in claim 1, further comprising a step, after step (A), for executing a Viterbi decoding algorithm.
-
9. An adaptive speech recognition method with noise compensation capable of compensating noises of an input speech by adjusting parameters of a HMM (Hidden Markov Model) speech model, the input speech having a plurality of speech frames, the method comprising the steps of:
-
(A) determining, based on the plurality of speech frames of the input speech and the speech model, optimal equalization factors for feature vectors of the plurality of speech frames corresponding to each probability density function in the speech model; and
(B) adapting the parameters of the speech model by the optimal equalization factor and a bias compensation vector corresponding to and retrieved by the optimal equalization factor, wherein the optimal equalization factor is provided to adjust a distance of the mean vector in the speech model, and the bias compensation vector is provided to adjust a direction change of the mean vector in the speech model, wherein the bias compensation vector is retrieved from a reference function table by using a corresponding optimal equalization factor as an index, so as to adjust the direction of the mean vector and remove projection bias.
-
Specification