Apparatus and method for decoding to recognize speech using a third speech recognizer based on first and second recognizer results
First Claim
1. A voice recognition apparatus which performs recognition of voice to be outputted from an output unit, the voice recognition apparatus comprising:
- a processor configured to control;
first, second and third voice recognizers which each recognize an input voice and obtain a recognition result including a candidate character string corresponding to the input voice, each of said first, second and third voice recognizers include a memory that stores a dictionary; and
a controller which, when it is decided based on said recognition result obtained by each of said first and second voice recognizers to cause said third voice recognizer to recognize said input voice, causes said third voice recognizer to recognize said input voice by using the dictionary included in said third voice recognizer including said candidate character string obtained by at least one of said first and second voice recognizers, and causes said output unit to output said recognition result obtained by said third voice recognizer, whereinthe recognition results obtained by each of said first and second voice recognizers further include score values indicating accuracy of said candidate character strings, andwhether or not to cause the third voice recognizer to recognize said input voice is decided based on an index including at least one of said score values which are obtained by said first and second voice recognizers and are a maximum, a similarity indicating a degree that said candidate character strings obtained by said first and second voice recognizers match each other, and an order distance indicating a degree of difference in an order of said candidate character strings aligned in order of said score values obtained by said first and second voice recognizers.
1 Assignment
0 Petitions
Accused Products
Abstract
An object is to provide a technique which can provide a highly valid recognition result while preventing unnecessary processing. A voice recognition device includes first to third voice recognition units, and a control unit. When it is decided based on recognition results obtained by the first and second voice recognition units to cause the third voice recognition unit to recognize an input voice, the control unit causes the third voice recognition unit to recognize the input voice by using a dictionary including a candidate character string obtained by at least one of the first and second voice recognition units.
-
Citations
9 Claims
-
1. A voice recognition apparatus which performs recognition of voice to be outputted from an output unit, the voice recognition apparatus comprising:
-
a processor configured to control; first, second and third voice recognizers which each recognize an input voice and obtain a recognition result including a candidate character string corresponding to the input voice, each of said first, second and third voice recognizers include a memory that stores a dictionary; and a controller which, when it is decided based on said recognition result obtained by each of said first and second voice recognizers to cause said third voice recognizer to recognize said input voice, causes said third voice recognizer to recognize said input voice by using the dictionary included in said third voice recognizer including said candidate character string obtained by at least one of said first and second voice recognizers, and causes said output unit to output said recognition result obtained by said third voice recognizer, wherein the recognition results obtained by each of said first and second voice recognizers further include score values indicating accuracy of said candidate character strings, and whether or not to cause the third voice recognizer to recognize said input voice is decided based on an index including at least one of said score values which are obtained by said first and second voice recognizers and are a maximum, a similarity indicating a degree that said candidate character strings obtained by said first and second voice recognizers match each other, and an order distance indicating a degree of difference in an order of said candidate character strings aligned in order of said score values obtained by said first and second voice recognizers. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A voice recognition method for performing recognition of voice to be outputted from an output unit, the voice recognition method comprising,
when it is decided based on a recognition result obtained by each of first and second voice recognizers among first, second and third voice recognizers which each recognize an input voice and obtain said recognition result including a candidate character string corresponding to the input voice, to cause said third voice recognizer to recognize said input voice, causing said third voice recognizer to recognize said input voice by using a dictionary including said candidate character string obtained by at least one of said first and second voice recognizers, and causing said output unit to output said recognition result obtained by said third voice recognizer, wherein the recognition results obtained by each of said first and second voice recognizers include score values indicating accuracy of said candidate character strings, and whether or not to cause the third voice recognizer to recognize said input voice is decided based on an index including at least one of said score values which are obtained by said first and second voice recognizers and are a maximum, a similarity indicating a degree that said candidate character strings obtained by said first and second voice recognizers match each other, and an order distance indicating a degree of difference in an order of said candidate character strings aligned in order of said score values obtained by said first and second voice recognizers.
Specification