System and method for tuning and testing in a speech recognition system
First Claim
1. A method of tuning a speech recognizer, the method comprising:
- playing a selected portion of a digital audio data file with a digital audio player;
creating and/or modifying a digital transcript of the selected audio portion;
displaying information associated with a decode of the selected audio portion on an electronic display, wherein the displayed information includes a menu of selectable noise tags for identifying noise events in the transcript;
receiving an input selected from the displayed menu, the input identifying at least one noise event in the transcript;
modifying, by a computing device, the transcript based on the input; and
utilizing the modified transcript to improve the performance of the speech recognizer.
3 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for improving the performance of a speech recognition system. In some embodiments a tuner module and/or a tester module are configured to cooperate with a speech recognition system. The tester and tuner modules can be configured to cooperate with each other. In one embodiment, the tuner module may include a module for playing back a selected portion of a digital data audio file, a module for creating and/or editing a transcript of the selected portion, and/or a module for displaying information associated with a decoding of the selected portion, the decoding generated by a speech recognition engine. In other embodiments, the tester module can include an editor for creating and/or modifying a grammar, a module for receiving a selected portion of a digital audio file and its corresponding transcript, and a scoring module for producing scoring statistics of the decoding based at least in part on the transcript.
61 Citations
28 Claims
-
1. A method of tuning a speech recognizer, the method comprising:
-
playing a selected portion of a digital audio data file with a digital audio player; creating and/or modifying a digital transcript of the selected audio portion; displaying information associated with a decode of the selected audio portion on an electronic display, wherein the displayed information includes a menu of selectable noise tags for identifying noise events in the transcript; receiving an input selected from the displayed menu, the input identifying at least one noise event in the transcript; modifying, by a computing device, the transcript based on the input; and utilizing the modified transcript to improve the performance of the speech recognizer. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A system for facilitating the tuning of a speech recognizer, the system comprising:
-
a processor; a memory; a playback module configured to play a selected portion of a digital audio data file; a user interface configured to provide a menu of selectable noise tags for identifying noise events in a transcript; an editor module configured to receive input modifying the transcript or the notes, wherein the input includes noise tags, selected from the menu of selectable noise tags, attaching markers to the transcript; and a detail viewing module configured to display information associated with a decoding of the selected portion by the speech recognizer, the information including the noise tags identifying noise events in the transcript. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28)
-
Specification