System and method for detecting the recognizability of input speech signals
First Claim
1. A system for detecting recognizability of an input signal, said system being a front stage of a speech recognition device or a dialog device, and comprising:
- an environment parameter generator to generate at least one environment parameter from an input signal by using a voice activity detection (VAD) method and a missing feature imputation (MFI) method wherein the MFI method comprises a step of calculating a clean speech spectrum feature parameter, said at least one environment parameter including a confidence index of said system processing said input signal;
a signal recognition verifier to verify whether said input signal is recognizable in accordance with said at least one environment parameter, said signal recognition verifier being trained with environment parameters in advance; and
a strategy response processor;
wherein said confidence index is generated based on a probability distribution of a spectrum parameter of said input signal and the probability distribution of the spectrum parameter of a system model, and said input signal is passed to said speech recognition or dialog device when said input signal is verified as recognizable;
while said strategy response processor is triggered to respond with a plurality of strategies when said input signal is verified as unrecognizable.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for detecting the recognizability of input speech signal is provided. It is designed in the pre-stage of speech recognition or a dialog system. The invention detects the user'"'"'s environmental condition and verifies if the input speech signal can be recognized. It mainly comprises an environment parameter generator, a signal recognition verifier, and a strategy response processor. Through the use of the invention in the pre-stage of speech recognition or a dialog system, it can precisely verify the recognizability of the input speech signal and receives the input speech signals of high recognition probability in a noisy environment. This reduces the impact caused by receiving the input speech signals of low recognition probability. This invention thus increases the recognition probability for a recognizer.
21 Citations
25 Claims
-
1. A system for detecting recognizability of an input signal, said system being a front stage of a speech recognition device or a dialog device, and comprising:
-
an environment parameter generator to generate at least one environment parameter from an input signal by using a voice activity detection (VAD) method and a missing feature imputation (MFI) method wherein the MFI method comprises a step of calculating a clean speech spectrum feature parameter, said at least one environment parameter including a confidence index of said system processing said input signal; a signal recognition verifier to verify whether said input signal is recognizable in accordance with said at least one environment parameter, said signal recognition verifier being trained with environment parameters in advance; and a strategy response processor; wherein said confidence index is generated based on a probability distribution of a spectrum parameter of said input signal and the probability distribution of the spectrum parameter of a system model, and said input signal is passed to said speech recognition or dialog device when said input signal is verified as recognizable;
while said strategy response processor is triggered to respond with a plurality of strategies when said input signal is verified as unrecognizable. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for detecting the recognizability of an input signal, said method being implemented in a front stage of a speech recognition or dialog device, and comprising the steps of:
-
(a) generating at least one environment parameter for said input signal by using a voice activity detection (VAD) method and a missing feature imputation (MFI) method wherein the MFI method comprises a step of calculating a clean speech spectrum feature parameter, said at least one environment parameter including a confidence index of said system processing said input signal; (b) using said at least one environment parameter to verify whether said input signal is recognizable according to verification training with environment parameters in advance; and (c) passing said input signal to said speech recognition or dialog device when said input signal is verified as recognizable;
otherwise, triggering a strategy response processor to provide a plurality of strategies when said input signal is verified as unrecognizable;wherein said confidence index is generated based on a probability distribution of a spectrum parameter of said input signal and the probability distribution of the spectrum parameter of a system model. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A method for detecting the recognizability of an input signal, said method being implemented in a front stage of a speech recognition or dialog device, and comprising the steps of:
-
(a) generating at least one environment parameter for said input signal by using a voice activity detection (VAD) method and a missing feature imputation (MFI) method wherein the MFI method comprises a step of calculating a clean speech spectrum feature parameter, said at least one environment parameter including a confidence index of said system processing said input signal; (b) using said at least one environment parameter to verify whether said input signal is recognizable according to verification training with environment parameters in advance; and (c) passing said input signal to said speech recognition or dialog device when said input signal is verified as recognizable;
otherwise, triggering a strategy response processor to provide a plurality of strategies when said input signal is verified as unrecognizable;wherein said confidence index is generated based on a probability distribution of a spectrum parameter of said input signal and the probability distribution of the spectrum parameter of a system model by using the steps of; measuring the divergence between said input signal and a known system model distribution on frequency spectrum; and using a sigmoid function to transform said divergence into a confidence index between 0 and 1.
-
Specification