HANDWRITING-BASED USER INTERFACE FOR CORRECTION OF SPEECH RECOGNITION ERRORS
First Claim
Patent Images
1. A method of correcting speech recognition result output by a speech recognizer, comprising:
- displaying the speech recognition result as a sequence of tokens on a user interface display;
receiving editing marks on the displayed speech recognition result, input by a user, through the user interface display;
identifying an error type and error position within the speech recognition result based on the editing marks; and
replacing tokens in the speech recognition result, marked by the editing marks as being incorrect, with alternative tokens, based on the error type and error position identified, to obtain a revised speech recognition result; and
outputting the revised speech recognition result for display on the user interface display.
2 Assignments
0 Petitions
Accused Products
Abstract
A speech recognition result is displayed for review by a user. If it is incorrect, the user provides pen-based editing marks. An error type and location (within the speech recognition result) are identified based on the pen-based editing marks. An alternative result template is generated, and an N-best alternative list is also generated by applying the template to intermediate recognition results from an automatic speech recognizer. The N-best alternative list is output for use in correcting the speech recognition results.
-
Citations
20 Claims
-
1. A method of correcting speech recognition result output by a speech recognizer, comprising:
-
displaying the speech recognition result as a sequence of tokens on a user interface display; receiving editing marks on the displayed speech recognition result, input by a user, through the user interface display; identifying an error type and error position within the speech recognition result based on the editing marks; and replacing tokens in the speech recognition result, marked by the editing marks as being incorrect, with alternative tokens, based on the error type and error position identified, to obtain a revised speech recognition result; and outputting the revised speech recognition result for display on the user interface display. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A user interface system used for performing correction of speech recognition results generated by a speech recognizer, comprising:
-
a user interface display displaying a speech recognition result; a user interface component configured to receive through the user interface display, handwritten editing marks on the speech recognition result and being indicative of an error type of an error located at an error position in the speech recognition result where the handwritten editing mark is made; a template generator generating a template indicative of alternative speech recognition results based on the error type and error position; an N-best alternative generator configured to identify intermediate speech recognition results output by the speech recognizer that match the template and to score each matching intermediate speech recognition result to obtain an N-best list of alternatives comprising the N-best scoring intermediate speech recognition results that match the template; and an error correction component configured to generate a revised speech recognition result by revising the speech recognition result with one of the N-best alternatives and to display the revised speech recognition result on the user interface display. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A method of correcting a speech recognition result displayed on a touch sensitive user interface display, comprising:
-
receiving a handwritten input identifying an error type and error position of an error in the speech recognition result, through the touch sensitive user interface display; generating a list of alternatives for the speech recognition result at the error position; and performing error correction by; automatically generating a revised speech recognition result using a first alternative in the list and displaying the revised speech recognition result; displaying the list of alternatives, and, if the revised speech recognition result is incorrect, receiving a user selection of a correct one of the alternatives and displaying the revised speech recognition result using the selected correct alternative, and if a user input is received indicative of there being no correct alternative in the list, receiving a user handwriting input indicative of a user written correction of the error, performing handwriting recognition on the user handwriting input to generate a handwriting recognition result and displaying the revised speech recognition result using the handwriting recognition result. - View Dependent Claims (17, 18, 19, 20)
-
Specification