System and method for tuning and testing in a speech recognition system

US 7,962,331 B2
Filed: 10/21/2008
Issued: 06/14/2011
Est. Priority Date: 12/01/2003
Status: Active Grant

First Claim

Patent Images

1. A method of tuning a speech recognizer, the method comprising:

playing a selected portion of a digital audio data file with a digital audio player;

creating and/or modifying a digital transcript of the selected audio portion;

displaying information associated with a decode of the selected audio portion on an electronic display, wherein the displayed information includes a menu of selectable noise tags for identifying noise events in the transcript;

receiving an input selected from the displayed menu, the input identifying at least one noise event in the transcript;

modifying, by a computing device, the transcript based on the input; and

utilizing the modified transcript to improve the performance of the speech recognizer.

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Systems and methods for improving the performance of a speech recognition system. In some embodiments a tuner module and/or a tester module are configured to cooperate with a speech recognition system. The tester and tuner modules can be configured to cooperate with each other. In one embodiment, the tuner module may include a module for playing back a selected portion of a digital data audio file, a module for creating and/or editing a transcript of the selected portion, and/or a module for displaying information associated with a decoding of the selected portion, the decoding generated by a speech recognition engine. In other embodiments, the tester module can include an editor for creating and/or modifying a grammar, a module for receiving a selected portion of a digital audio file and its corresponding transcript, and a scoring module for producing scoring statistics of the decoding based at least in part on the transcript.

61 Citations

View as Search Results

28 Claims

1. A method of tuning a speech recognizer, the method comprising:
- playing a selected portion of a digital audio data file with a digital audio player;
  
  creating and/or modifying a digital transcript of the selected audio portion;
  
  displaying information associated with a decode of the selected audio portion on an electronic display, wherein the displayed information includes a menu of selectable noise tags for identifying noise events in the transcript;
  
  receiving an input selected from the displayed menu, the input identifying at least one noise event in the transcript;
  
  modifying, by a computing device, the transcript based on the input; and
  
  utilizing the modified transcript to improve the performance of the speech recognizer.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
- - 2. The method of claim 1, wherein the menu comprises a graphical user interface having elements for allowing selection, input, and command entry related to the selectable noise tags.
  - 3. The method of claim 1, wherein the information comprises a confidence score.
  - 4. The method of claim 1, wherein the information comprises an acoustic model score.
  - 5. The method of claim 1, wherein the at least one noise event is a cough.
  - 6. The method of claim 1, wherein the at least one noise event is a sneeze.
  - 7. The method of claim 1, wherein the at least one noise event is a laugh.
  - 8. The method of claim 1, wherein the at least one noise event is a breath.
  - 9. The method of claim 1, wherein speech recognizer performance is improved by training new acoustic models.
  - 10. The method of claim 1, wherein improving the performance of the speech recognizer includes tuning parameters in the speech recognizer that are not acoustic models.
  - 11. The method of claim 1, wherein improving the performance of the speech recognizer comprises using the identified at least one noise event to train the speech recognizer to interpret or ignore acoustic phenomena characterized as noise.
  - 12. The method of claim 1, wherein improving the performance of the speech recognizer comprises building a new acoustic model using the identified at least one noise event.
  - 13. The method of claim 1, further comprising determining, based at least in part on the transcript and the information associated with the decode, a modification of the speech recognizer to improve its performance.
  - 14. The method of claim 13, wherein the modification comprises modifying a grammar of the speech recognizer.
  - 15. The method of claim 14, wherein the modification comprises adding a concept, phrase, word, or phoneme to the grammar.
  - 16. The method of claim 13, wherein the modification comprises modifying a word pronunciation, dictionary, or acoustic model of the speech recognizer.
  - 17. The method of claim 13, wherein the modification comprises modifying a call flow.
  - 18. The method of claim 13, further comprising making a modification to the speech recognizer.
  - 19. The method of claim 18, further comprising iteratively performing the recited steps.

20. A system for facilitating the tuning of a speech recognizer, the system comprising:
- a processor;
  
  a memory;
  
  a playback module configured to play a selected portion of a digital audio data file;
  
  a user interface configured to provide a menu of selectable noise tags for identifying noise events in a transcript;
  
  an editor module configured to receive input modifying the transcript or the notes, wherein the input includes noise tags, selected from the menu of selectable noise tags, attaching markers to the transcript; and
  
  a detail viewing module configured to display information associated with a decoding of the selected portion by the speech recognizer, the information including the noise tags identifying noise events in the transcript.
- View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28)
- - 21. The system of claim 20, further comprising a user interface that includes the menu of selectable noise tags.
  - 22. The system of claim 20, wherein the user interface comprises a graphical user interface.
  - 23. The system of claim 20, wherein the information associated with the decoding comprises a grammar associated with the selected portions.
  - 24. The system of claim 23, wherein the grammar comprises a set of responses expected to occur in the selected portions.
  - 25. The system of claim 24, wherein the set of responses comprises phrases, words, and/or phonemes.
  - 26. The system of claim 20, wherein the information associated with the decoding comprises a confidence score.
  - 27. The system of claim 20, wherein the information associated with the decoding comprises an identification of an acoustic model.
  - 28. The system of claim 20, wherein the information associated with the decoding comprises phonemes used by the speech recognizer to decode the selected portions.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
LumenVox, LLC.
Original Assignee
LumenVox, LLC.
Inventors
Auckland, Alexandra L., Herold, Keith C., Bergman, Michael D., Blake, James F. II, Miller, Edward S., Danielson, Kyle N.
Primary Examiner(s)
Vo; Huyen X.

Application Number

US12/255,564
Publication Number

US 20090043576A1
Time in Patent Office

966 Days
Field of Search

704/215, 704/210, 704/208, 704/214, 704/231, 704/235, 704/278, 704/276, 704/257, 704/270, 704/270.1, 704/3, 704/4, 704/7, 704/272, 379/266.1
US Class Current

704/215
CPC Class Codes

G10L 15/01   Assessment or evaluation of...

G10L 15/063   Training

G10L 15/193   Formal grammars, e.g. finit...

G10L 2015/0631   Creating reference template...

System and method for tuning and testing in a speech recognition system

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

61 Citations

28 Claims

Specification

Solutions

Use Cases

Quick Links

System and method for tuning and testing in a speech recognition system

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

61 Citations

28 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links