System and method for speech recognition
First Claim
1. A speech recognition system having an initial noise model produced based on pre-estimated noise of a service environment, a clean speech model of noiseless speech, and an initial synthesized model produced by combining the initial noise model and the clean speech model, the system performing speech recognition by producing an utterance environment noise model from background noise of the service environment upon speech recognition, producing a sequence of feature vectors from noise-superimposed speech including an uttered voice and the background noise, producing an adaptive model by adapting the initial synthesized model using the utterance environment noise model and the initial noise model, and checking the adaptive model against the sequence of feature vectors, the speech recognition system comprising:
- compensation means for providing compensation in accordance with the sequence of feature vectors upon producing the adaptive model.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method include an initial noise model produced based on pre-estimated noise of a service environment and an initial synthesized model of a voice containing noise. The system and method produce an utterance environment noise model from background noise of the service environment upon speech recognition as well as a sequence of feature vectors from noise-superimposed speech including an uttered voice and the background noise. The system and method also produce an adaptive model by adapting the initial synthesized model using the utterance environment noise model, the initial noise model, and a compensation model, so that the adaptive model is checked against the sequence of feature vectors to perform speech recognition. Upon performing the speech recognition, a compensation model is created upon which the signal to noise ratio between the background noise present at the time of actual utterance of a voice and the uttered voice is reflected.
36 Citations
20 Claims
-
1. A speech recognition system having an initial noise model produced based on pre-estimated noise of a service environment, a clean speech model of noiseless speech, and an initial synthesized model produced by combining the initial noise model and the clean speech model, the system performing speech recognition by producing an utterance environment noise model from background noise of the service environment upon speech recognition, producing a sequence of feature vectors from noise-superimposed speech including an uttered voice and the background noise, producing an adaptive model by adapting the initial synthesized model using the utterance environment noise model and the initial noise model, and checking the adaptive model against the sequence of feature vectors, the speech recognition system comprising:
compensation means for providing compensation in accordance with the sequence of feature vectors upon producing the adaptive model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
11. A speech recognition method comprising the steps of:
-
providing an initial noise model produced based on pre-estimated noise of a service environment, a clean speech model of noiseless speech, and an initial synthesized model produced by combining the initial noise model and the clean speech model;
producing an utterance environment noise model from background noise of the service environment upon speech recognition;
producing a sequence of feature vectors from noise-superimposed speech including an uttered voice and the background noise;
producing an adaptive model by adapting the initial synthesized model using the utterance environment noise model and the initial noise model; and
checking the adaptive model against the sequence of feature vectors to perform speech recognition, wherein the step of producing the adaptive model includes the step of providing compensation in accordance with the sequence of feature vectors. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification