Method of pattern recognition using noise reduction uncertainty
First Claim
1. A method of recognizing acoustic states from a noisy speech signal, the method comprising:
- removing noise from a representation of a portion of the noisy speech signal to produce a representation of a portion of a cleaned speech signal, wherein removing noise from a representation of a portion of a noisy speech signal comprises removing noise from a feature vector representing a frame of the noisy speech signal and wherein removing noise from a feature vector comprises;
identifying a mixture component based on the feature vector for the noisy speech signal;
identifying a correction vector and an error value associated with the correction vector based on the identified mixture component; and
using the correction vector, the error value, and the feature vector for the noisy speech signal to identify a feature vector for a frame of the cleaned speech signal;
identifying an uncertainty associated with removing the noise;
using the uncertainty to adjust a probability distribution associated with an acoustic state to form a modified probability distribution; and
applying the representation of a portion of the cleaned speech signal to the modified probability distribution to decode an acoustic state for speech recognition.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus are provided for using the uncertainty of a noise-removal process during pattern recognition. In particular, noise is removed from a representation of a portion of a noisy signal to produce a representation of a cleaned signal. In the meantime, an uncertainty associated with the noise removal is computed and is used with the representation of the cleaned signal to modify a probability for a phonetic state in the recognition system. In particular embodiments, the uncertainty is used to modify a probability distribution, by increasing the variance in each Gaussian distribution by the amount equal to the estimated variance of the cleaned signal, which is used in decoding the phonetic state sequence in a pattern recognition task.
-
Citations
10 Claims
-
1. A method of recognizing acoustic states from a noisy speech signal, the method comprising:
-
removing noise from a representation of a portion of the noisy speech signal to produce a representation of a portion of a cleaned speech signal, wherein removing noise from a representation of a portion of a noisy speech signal comprises removing noise from a feature vector representing a frame of the noisy speech signal and wherein removing noise from a feature vector comprises; identifying a mixture component based on the feature vector for the noisy speech signal; identifying a correction vector and an error value associated with the correction vector based on the identified mixture component; and using the correction vector, the error value, and the feature vector for the noisy speech signal to identify a feature vector for a frame of the cleaned speech signal; identifying an uncertainty associated with removing the noise; using the uncertainty to adjust a probability distribution associated with an acoustic state to form a modified probability distribution; and applying the representation of a portion of the cleaned speech signal to the modified probability distribution to decode an acoustic state for speech recognition. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer-readable medium having computer-executable instructions for performing steps comprising:
-
converting a frame of a noisy speech signal into a feature vector comprising at least two components; removing noise from a component of the feature vector for the noisy speech signal to produce a component of a feature vector for a cleaned speech signal, wherein removing noise comprises; identifying a correction vector based on the feature vector for the noisy speech signal; and using the correction vector and the feature vector for the noisy speech signal to form the feature vector for the cleaned speech signal; identifying an uncertainty associated with removing the noise from the component; determining a probability component of a probability for a phonetic state by applying the component for the cleaned speech signal to a distribution for the phonetic state defined in part by the uncertainty associated with removing the noise from the component by computing a variance for the distribution using the uncertainty as a term in the computation; and using the probability component to determine the probability of the phonetic state during speech recognition regardless of the value of the uncertainty. - View Dependent Claims (8, 9, 10)
-
Specification