Environmently compensated speech processing
First Claim
Patent Images
1. A computerized method for processing speech signals, comprising:
- storing first vectors representing clean speech signals in a vector codebook;
determining second vectors from dirty speech signals;
estimating environmental parameters from the second vectors;
predicting third vectors based on the estimated environmental parameters to correct the first vectors;
applying the third vectors to the second vectors to produce corrected vectors; and
comparing the corrected vectors and the first vectors to identify first vectors which resemble the corrected vectors;
wherein said method further comprises one of the following two steps;
(1) using a search algorithm to determine a hypothesis sequence of phonemes of said first vectors that is statistically closest to a sequence of said corrected vectors, and (2) determining mean and covariance for predicted statistics of said dirty speech signals and measuring likelihood that an utterance was generated by a particular speaker based upon an expectation maximization process.
3 Assignments
0 Petitions
Accused Products
Abstract
In a computerized method for processing speech signals, first vectors representing clean speech signals are stored in a vector codebook. Second vectors are determined from dirty speech signals. Noise and distortion parameters are estimated from the second vectors. Third vectors are predicated, based on estimated noise and distortion parameters. The third vectors are used to correct the first vectors. The third vectors can then be applied to the second vectors to produce corrected vectors. The corrected vectors and the first vectors can be compared to identify first vectors which resemble the corrected vectors.
108 Citations
12 Claims
-
1. A computerized method for processing speech signals, comprising:
-
storing first vectors representing clean speech signals in a vector codebook; determining second vectors from dirty speech signals; estimating environmental parameters from the second vectors; predicting third vectors based on the estimated environmental parameters to correct the first vectors; applying the third vectors to the second vectors to produce corrected vectors; and comparing the corrected vectors and the first vectors to identify first vectors which resemble the corrected vectors; wherein said method further comprises one of the following two steps;
(1) using a search algorithm to determine a hypothesis sequence of phonemes of said first vectors that is statistically closest to a sequence of said corrected vectors, and (2) determining mean and covariance for predicted statistics of said dirty speech signals and measuring likelihood that an utterance was generated by a particular speaker based upon an expectation maximization process. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
Specification