System and method for improving the accuracy of a speech recognition program through repetitive training
First Claim
1. A system for improving the accuracy of a speech recognition program operating on a computer, said system comprising:
- means for automatically converting a pre-recorded audio file into a written text;
means for parsing said written text into segments;
means for correcting each and every segment of said written text;
means for saving said corrected segment in an individually retrievable manner in association with said computer;
means for saving speech files associated with a substantially corrected written text and used by said speech recognition program towards improving accuracy in speech-to-text conversion by said speech recognition program; and
means for repetitively establishing an independent instance of said written text from said pre-recorded audio file using said speech recognition program and for automatically replacing each segment in said independent instance of said written text with said individually retrievable saved corrected segment associated therewith.
3 Assignments
0 Petitions
Accused Products
Abstract
A system and method for quickly improving the accuracy of a speech recognition program. The system is based on a speech recognition program that automatically converts a pre-recorded audio file into a written text. The system parses the written text into segments, each of which is corrected by the system and saved in an individually retrievable manner in association with the computer. The standard speech files are saved towards improving accuracy in speech-to-text conversion by the speech recognition program. The system further includes facilities to repetitively establish an independent instance of the written text from the prerecorded audio file using the speech recognition program. This independent instance can then be broken into segments and each segment in said independent instance replaced with an individually retrievable saved corrected segment associated with that segment. In this manner, repetitive instruction of a speech recognition program can be facilitated.
40 Citations
7 Claims
-
1. A system for improving the accuracy of a speech recognition program operating on a computer, said system comprising:
-
means for automatically converting a pre-recorded audio file into a written text;
means for parsing said written text into segments;
means for correcting each and every segment of said written text;
means for saving said corrected segment in an individually retrievable manner in association with said computer;
means for saving speech files associated with a substantially corrected written text and used by said speech recognition program towards improving accuracy in speech-to-text conversion by said speech recognition program; and
means for repetitively establishing an independent instance of said written text from said pre-recorded audio file using said speech recognition program and for automatically replacing each segment in said independent instance of said written text with said individually retrievable saved corrected segment associated therewith. - View Dependent Claims (2, 3, 4, 5, 6)
means for sequentially comparing a copy of said written text with a second written text resulting in a sequential list of unmatched words culled from said copy of said written text, said sequential list having a beginning, an end and a current unmatched word, said current unmatched word pointer being successively advanced from said beginning to said end;
means for incrementally searching for said current unmatched word contemporaneously within a first buffer associated with the speech recognition program containing said written text and a second buffer associated with said sequential list; and
means for correcting said current unmatched word in said second buffer, said correcting means including means for displaying said current unmatched word in a manner substantially visually isolated from other text in said copy of said written text and means for playing a portion of said synchronized voice dictation recording from said first buffer associated with said current unmatched word.
-
-
4. The invention according to claim 3 wherein said second written text is established by a second speech recognition program having at least one conversion variable different from said speech recognition program.
-
5. The invention according to claim 3 wherein said second written text is established by one or more human beings.
-
6. The invention according to claim 3 wherein said correcting means further includes means for alternatively viewing said current unmatched word in context within said copy of said written text.
-
7. A method for improving the accuracy of a speech recognition program operating on a computer comprising:
-
(a) automatically converting a pre-recorded audio file into a written text;
(b) parsing the written text into segments;
(c) correcting each and every segment of the written text;
(d) saving the corrected segment in an individually retrievable manner;
(e) saving speech files associated with a substantially corrected written text and used by the speech recognition program towards improving accuracy in speech-to-text conversion by the speech recognition program;
(f) establishing an independent instance of the written text from the pre-recorded audio file using the speech recognition program;
(g) automatically replacing each segment in the independent instance of the written text with the individually retrievable saved corrected segment associated therewith;
(h) saving speech files associated with the independent instance of the written text used by the speech recognition program towards improving accuracy in speech-to-text conversion by the speech recognition program; and
(i) repeating steps (f) through (i) a predetermined number of times.
-
Specification