User adaptive speech recognition method and apparatus
First Claim
1. A user adaptive speech recognition method comprising:
- calculating confidence scores of respective recognition candidates according to results of speech recognition of respective user input speech;
setting a new threshold value adapted to one or more users based on results of respective user confirmations by the one or more users of the respective recognition candidates according to a previous threshold value, wherein the setting of the new threshold value includes selectively setting the new threshold value so as to have a value according to a select range, selected from plural ranges differently defining respective ranges, based upon a determination indicating that a first confidence score of a first recognition candidate, that should be rejected, is greater than a confidence score of a second recognition candidate, that should be accepted, or a confidence score of a third recognition candidate, that should be accepted, is less than a confidence score of a fourth recognition candidate, that should be rejected; and
outputting a corresponding recognition candidate as a result of the speech recognition when a calculated confidence score of the corresponding recognition candidate is determined to meet the new threshold value.
1 Assignment
0 Petitions
Accused Products
Abstract
A user adaptive speech recognition method and apparatus is disclosed that controls user confirmation of a recognition candidate using a new threshold value adapted to a user. The user adaptive speech recognition method includes calculating a confidence score of a recognition candidate according to the result of speech recognition, setting a new threshold value adapted to the user based on a result of user confirmation of the recognition candidate and the confidence score of the recognition candidate, and outputting a corresponding recognition candidate as a result of the speech recognition if the calculated confidence score is higher than the new threshold value. Thus, the need for user confirmation of the result of speech recognition is reduced and the probability of speech recognition success is increased.
80 Citations
21 Claims
-
1. A user adaptive speech recognition method comprising:
-
calculating confidence scores of respective recognition candidates according to results of speech recognition of respective user input speech; setting a new threshold value adapted to one or more users based on results of respective user confirmations by the one or more users of the respective recognition candidates according to a previous threshold value, wherein the setting of the new threshold value includes selectively setting the new threshold value so as to have a value according to a select range, selected from plural ranges differently defining respective ranges, based upon a determination indicating that a first confidence score of a first recognition candidate, that should be rejected, is greater than a confidence score of a second recognition candidate, that should be accepted, or a confidence score of a third recognition candidate, that should be accepted, is less than a confidence score of a fourth recognition candidate, that should be rejected; and outputting a corresponding recognition candidate as a result of the speech recognition when a calculated confidence score of the corresponding recognition candidate is determined to meet the new threshold value. - View Dependent Claims (2, 3, 4)
-
-
5. A user adaptive speech recognition method comprising:
-
calculating a confidence score of a recognition candidate according to a result of speech recognition of speech input from a user; setting a new threshold value adapted to the user based on a result of user confirmation of the recognition candidate and the confidence score of the recognition candidate; and outputting a corresponding recognition candidate as the result of the speech recognition when the calculated confidence score is higher than the new threshold value, wherein the setting a new threshold value comprises; classifying recognition candidates selected as correct answers by the user into a first group and classifying recognition candidates selected as incorrect answers by the user into a second group; and calculating the new threshold value having a value in a range that is higher than the highest confidence score of the recognition candidates classified into the second group and that is lower than the lowest confidence score of the recognition candidates classified into the first group, when a first confidence score section being the confidence scores of the recognition candidates classified into the first group and a second confidence score section being the confidence scores of the recognition candidates classified into the second group do not overlap. - View Dependent Claims (6, 7, 8, 9)
-
-
10. A user adaptive speech recognition apparatus including at least one processing device, the apparatus comprising:
-
a confidence score calculation unit, using the at least one processing device, calculating confidence scores of respective recognition candidates according to results of speech recognition of respective user input speech; a threshold value setup unit setting a new threshold value adapted to one or more users based on results of respective user confirmations by the one or more users of the respective recognition candidates according to a previous threshold value, wherein the setting of the new threshold value includes selectively setting the new threshold value so as to have a value according to a select range, selected from plural ranges differently defining respective ranges, based upon a determination indicating that a first confidence score of a first recognition candidate, that should be rejected, is greater than a confidence score of a second recognition candidate, that should be accepted, or a confidence score of a third recognition candidate, that should be accepted, is less than a confidence score of a fourth recognition candidate, that should be rejected; and a control unit outputting a corresponding recognition candidate as a result of the speech recognition when a calculated confidence score of the corresponding recognition candidate is determined to meet the new threshold value. - View Dependent Claims (11, 12)
-
-
13. A user adaptive speech recognition apparatus, including at least one processing device, the apparatus comprising:
-
a confidence score calculation unit, using the at least one processing device, calculating a confidence score of a recognition candidate according to the result of speech recognition of speech input by a user; a threshold value setup unit setting a new threshold value adapted to the user based on a result of user confirmation of the recognition candidate and the confidence score of the recognition candidate; and a control unit outputting a corresponding recognition candidate as the result of the speech recognition when the calculated confidence score is higher than the new threshold value, wherein the threshold value setup unit comprises; a classification unit classifying recognition candidates selected as correct answers by the user into a first group and classifying recognition candidates selected as incorrect answers by the user into a second group; and a threshold value calculation unit calculating the new threshold value having a value in a range that is higher than the highest confidence score of the recognition candidates classified into the second group and that is lower than the lowest confidence score of the recognition candidates classified into the first group, when a first confidence score section being the confidence scores of the recognition candidates classified into the first group and a second confidence score section being the confidence scores of the recognition candidates classified into the second group do not overlap. - View Dependent Claims (14, 15, 16, 17)
-
-
18. A non-transitory computer-readable recording medium encoded with processing instructions for causing a processor to execute a user adaptive speech recognition method, the method comprising:
-
calculating confidence scores of respective recognition candidates according to results of speech recognition of respective user input speech; setting a new threshold value adapted to one or more users based on results of respective user confirmations by the one or more users of the respective recognition candidates according to a previous threshold value, wherein the setting of the new threshold value includes selectively setting the new threshold value so as to have a value according to a select range, selected from plural ranges differently defining respective ranges, based upon a determination indicating that a first confidence score of a first recognition candidate, that should be rejected, is greater than a confidence score of a second recognition candidate, that should be accepted, or a confidence score of a third recognition candidate, that should be accepted, is less than a confidence score of a fourth recognition candidate, that should be rejected; and outputting a corresponding recognition candidate as a result of the speech recognition when a calculated confidence score of the corresponding recognition candidate is determined to meet the new threshold value. - View Dependent Claims (19)
-
-
20. A method of setting a new threshold value adapted to a user of a speech recognition apparatus, the method comprising:
-
determining whether a recognition candidate presented to the user is indicated by the user as correct; classifying, using at least one processing device, the recognition candidate based on a result of the determining, the recognition candidate being classified in a first group when the recognition candidate is correct, the recognition candidate being classified in a second group when the recognition candidate is incorrect; determining whether a range of probabilities of validity of the recognition candidates of the first group and a range of probabilities of validity of the recognition candidates of the second group overlap; calculating the new threshold value in a range between the range of the probabilities of validity of the recognition candidates of the first group and the range of the probabilities of validity of the recognition candidates of the second group, when the range of the first group and the range of the second group do not overlap; and calculating the new threshold value in a range where the range of the probabilities of validity of the recognition candidates of the first group and the range of the probabilities of validity of the recognition candidates of the second groups overlap, when the range of the first group and the range of the second group overlap.
-
-
21. A user adaptive speech recognition method comprising:
-
calculating confidence scores of respective recognition candidates according to results of speech recognition of respective user input speech; setting a new threshold value adapted to one or more users based on results of respective user confirmations by the one or more users of the respective recognition candidates classified according to a previous threshold value, wherein the setting of the new threshold value includes selectively setting the new threshold value so as to have a value according to one of at least a first range and a second range, with the first range including values not greater than a lesser of a lowest confidence score of available recognition candidates incorrectly classified as being rejected and a lowest confidence score of recognition candidates correctly classified as being accepted and not less than a greater of a confidence score of available first recognition candidates incorrectly classified as being accepted and a greatest confidence score of recognition candidates correctly classified as being rejected, and with the second range including values not greater than a greater of a greatest confidence score of the available recognition candidates incorrectly classified as being accepted and a confidence score of the recognition candidate correctly classified as being rejected, greater than a greatest confidence score of the available recognition candidates incorrectly classified as being rejected, and not less than a lesser of the lowest confidence score of the available recognition candidates incorrectly classified as being rejected and a confidence score of the recognition candidates correctly classified as being accepted, less than a lowest confidence score of the available recognition candidates incorrectly classified as being accepted; and outputting a corresponding recognition candidate as a result of speech recognition when a calculated confidence score of the corresponding recognition candidate is determined to meet the new threshold value.
-
Specification