Speech recognition with plural confidence measures
First Claim
1. A speech recognition system, used for receiving a speech signal and output an output language word with respect to the speech signal, wherein the speech recognition system has a first threshold, a second threshold, and a third threshold, the speech recognition system comprising:
- a first speech recognition device, used to receive the speech signal and generate a first candidate language word and a first confidence measurement of the first candidate language word, according to the speech signal;
a second speech recognition device, used to receive the speech signal and generate a second candidate language word and a second confidence measurement of the second candidate language word, according to the speech signal; and
a confidence measurement judging unit, used to judge and output the output language word, according to the first confidence measurement and the second confidence measurement;
wherein when the first confidence measurement is less than the first threshold and the second confidence measurement is less than the second threshold, the first candidate language word is taken as the output language word, when the first confidence measurement is greater than the first threshold and the second confidence measurement is less than the third threshold, the first candidate language word is set to be the output language word, when the first confidence measurement is less than the first threshold and the second confidence measurement is greater than the second threshold, then the second candidate language word is set to be the output language word, and when the second confidence measurement is greater than the third threshold, the second candidate language word is set to be the output language word.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech recognition system is used to receive a speech signal and output an output language word with respect to the speech signal. The speech recognition system has preset quantities for a first threshold, a second threshold, and a third threshold. The speech recognition system includes a first speech recognition device that is used to receive the speech signal and generate a first candidate language word and a first confidence measurement of the first candidate language word, according to the speech signal. A second speech recognition device is used to receive the speech signal and generate a second candidate language word and a second confidence measurement of the second candidate language word, according to the speech signal. A confidence measurement judging unit is used to output the language word, by comparing the first confidence measurement and the second confidence measurement to the above thresholds.
-
Citations
16 Claims
-
1. A speech recognition system, used for receiving a speech signal and output an output language word with respect to the speech signal, wherein the speech recognition system has a first threshold, a second threshold, and a third threshold, the speech recognition system comprising:
-
a first speech recognition device, used to receive the speech signal and generate a first candidate language word and a first confidence measurement of the first candidate language word, according to the speech signal; a second speech recognition device, used to receive the speech signal and generate a second candidate language word and a second confidence measurement of the second candidate language word, according to the speech signal; and a confidence measurement judging unit, used to judge and output the output language word, according to the first confidence measurement and the second confidence measurement; wherein when the first confidence measurement is less than the first threshold and the second confidence measurement is less than the second threshold, the first candidate language word is taken as the output language word, when the first confidence measurement is greater than the first threshold and the second confidence measurement is less than the third threshold, the first candidate language word is set to be the output language word, when the first confidence measurement is less than the first threshold and the second confidence measurement is greater than the second threshold, then the second candidate language word is set to be the output language word, and when the second confidence measurement is greater than the third threshold, the second candidate language word is set to be the output language word. - View Dependent Claims (2, 3, 4)
-
-
5. A speech recognition system, used to receive a speech signal and output an output language word with respect to the speech signal, wherein the speech recognition system has preset quantities for a first threshold and a second threshold, the speech recognition system further includes a storage device, wherein the storage device is used to receive the speech signal and output the speech signal, the speech recognition system comprising:
-
a first speech recognition device, which is used to receive the speech signal and generate a first candidate language word and a first confidence measurement of the first candidate language word, according to the speech signal; a confidence measurement judging unit, which is used to determine the output language word; and a second speech recognition device, which is controlled by the confidence measurement judging unit and is used to receive an output of the speech signal output from the storage device and generate a second candidate language word and a second confidence measurement of the second candidate language word, according to the speech signal, wherein the confidence measurement judging unit judges whether or not the first confidence measurement is greater than the first threshold, if it being yes, then the first candidate language word is taken as the output language word, if it being no, then the confidence measurement judging unit causes the second speech recognition device to generate a second language word and a second confidence measurement, and then judges whether or not the second confidence measurement is greater than the second threshold, if it being yes, then the second candidate language word is taken as the output language word, if it being no, then the first candidate language word is taken as the output language word. - View Dependent Claims (6, 7, 8)
-
-
9. A speech recognition method, the method comprising the following steps:
-
feeding a speech signal into a first speech recognition device and a second speech recognition device; the first speech recognition device generating a first candidate language word and a first confidence measurement, according to the speech signal, and the second speech recognition device generating a second candidate language word and a second confidence measurement, according to the speech signal; and if the first confidence measurement being less than the first threshold and the second confidence measurement being less than the second threshold, then the first candidate language word being taken as the output language word, if the first confidence measurement being greater than the first threshold and the second confidence measurement being less than a third threshold, then the first candidate language word being taken as the output language word, if the first confidence measurement being less than the first threshold and the second confidence measurement being greater than the second threshold, then the second candidate language word being taken as the output language word, and if the second confidence measurement being greater than the third threshold, then the second candidate language word being taken as the output language word. - View Dependent Claims (10, 11, 12)
-
-
13. A speech recognition method, the method comprising the following steps:
-
(a) feeding a speech signal into a first speech recognition device; (b) the first speech recognition device generating a first candidate language word and a first confidence measurement, according to the speech signal; (c) judging whether or not the first confidence measurement is greater than the first threshold, if it being yes, then the first candidate language word being taken as the output language word and then the method goes to an end; (d) feeding the speech signal into a second speech recognition device and the second speech recognition device generating a second candidate language word and a second confidence measurement, according to the input speech signal; and (e) judging whether or not the second confidence measurement is greater than the second threshold, if it being yes, then the second candidate language word being taken as the output language word, if it being no, then the first candidate language word being taken as the output language word. - View Dependent Claims (14, 15, 16)
-
Specification