MICROPHONE-ARRAY-BASED SPEECH RECOGNITION SYSTEM AND METHOD
First Claim
1. A microphone-array-based speech recognition system, combining a noise masking module for cancelling noise of input speech signals from an array of microphones, according to an inputted threshold, and comprising:
- at least a speech model and at least a filler model that receive respectively a noise-cancelled speech signal outputted by said noise masking module;
a confidence measure score computation module that computes a confidence measure score with said at least a speech model and said at least a filler model for said threshold and the noise-cancelled speech signal; and
a threshold adjustment module that adjusts said threshold and provides to said noise masking module to continue the noise cancelling for achieving a maximum confidence measure score computed by said confidence measure score computation module, thereby outputting a speech recognition result related to said maximum confidence measure score.
1 Assignment
0 Petitions
Accused Products
Abstract
A microphone-array-based speech recognition system combines a noise cancelling technique for cancelling noise of input speech signals from an array of microphones, according to at least an inputted threshold. The system receives noise-cancelled speech signals outputted by a noise masking module through at least a speech model and at least a filler model, then computes a confidence measure score with the at least a speech model and the at least a filler model for each threshold and each noise-cancelled speech signal, and adjusts the threshold to continue the noise cancelling for achieving a maximum confidence measure score, thereby outputting a speech recognition result related to the maximum confidence measure score.
46 Citations
20 Claims
-
1. A microphone-array-based speech recognition system, combining a noise masking module for cancelling noise of input speech signals from an array of microphones, according to an inputted threshold, and comprising:
-
at least a speech model and at least a filler model that receive respectively a noise-cancelled speech signal outputted by said noise masking module; a confidence measure score computation module that computes a confidence measure score with said at least a speech model and said at least a filler model for said threshold and the noise-cancelled speech signal; and a threshold adjustment module that adjusts said threshold and provides to said noise masking module to continue the noise cancelling for achieving a maximum confidence measure score computed by said confidence measure score computation module, thereby outputting a speech recognition result related to said maximum confidence measure score. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A microphone-array-based speech recognition system combining a noise masking module for cancelling noise of input speech signals from an array of microphones, according to each of a plurality of given thresholds within a predetermined range, and comprising:
-
at least a speech model and at least a filler model that receive respectively a noise-cancelled speech signals after said cancelling noise; a confidence measure score computation module that computes a confidence measure score with said at least a speech model and said at least a filler model for each given threshold within said predetermined range and said noise-cancelled speech signals; and a maximum confidence measure score decision module that determines a maximum confidence measure score from all confidence measure scores computed by said confidence measure score computation module and obtains a threshold corresponding to said maximum confidence measure score, and outputs corresponding speech recognition result. - View Dependent Claims (10, 11, 12)
-
-
13. A microphone-array-based speech recognition method implemented by a computer system, said method comprising following acts executed by said computer system:
-
executing noise cancelling of input speech signals from an array of microphones according to at least an inputted threshold, and transmitting a noise-cancelled speech signal to at least a speech model and at least a filler model respectively; computing a corresponding confidence measure score based on score information for each of said at least a speech model and a score for said at least a filler model; from each of said at least an inputted threshold, finding a threshold corresponding to a maximum confidence measure score among all computed confidence measure scores, and generating speech recognition result. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification