System and method for post processing speech recognition output
First Claim
1. A computer implemented method for altering output from a speech recognition engine, the method comprising:
- inputting the output from a speech recognition engine into a post processor configured to convert the output, comprising speech-recognized dictation in the form of unformatted raw text or data, from the speech recognition engine to a formatted text document that is a transcription of the dictation, the post processor operatively configured to;
convert said output to a list of tokens;
identify a set of tokens from said list of tokens matching a first set of predetermined patterns; and
perform a set of rewrite rules based on said identified set of tokens, the rewrite rules used to transform the output of the speech recognition engine to the formatted text document that corresponds to the dictation, wherein the set of rewrite rules comprise;
identify section headings in the speech-recognized dictation;
format the section headings according to a standard practice of a specific site;
identify and transform number clusters adjacent to keywords in the speech-recognized dictation;
identify and transform number clusters not adjacent to keywords in the speech-recognized dictation;
perform consolidation or suppression of filled pauses and silences in the speech-recognized dictation;
perform attachment and format punctuation in the speech-recognized dictation; and
perform capitalization of words following punctuation in the speech-recognized dictation to thereby provide the formatted text document for writing to a storage device.
7 Assignments
0 Petitions
Accused Products
Abstract
A system and method may be disclosed for facilitating the conversion of dictation into usable and formatted documents by providing a method of post processing speech recognition output. In particular, the post processing system may be configured to implement rewrite rules and process raw speech recognition output or other raw data according to those rewrite rules. The application of the rewrite rules may format and/or normalize the raw speech recognition output into formatted or finalized documents and reports. The system may thereby reduce or eliminate the need for post processing by transcriptionists or dictation authors.
-
Citations
11 Claims
-
1. A computer implemented method for altering output from a speech recognition engine, the method comprising:
-
inputting the output from a speech recognition engine into a post processor configured to convert the output, comprising speech-recognized dictation in the form of unformatted raw text or data, from the speech recognition engine to a formatted text document that is a transcription of the dictation, the post processor operatively configured to; convert said output to a list of tokens; identify a set of tokens from said list of tokens matching a first set of predetermined patterns; and perform a set of rewrite rules based on said identified set of tokens, the rewrite rules used to transform the output of the speech recognition engine to the formatted text document that corresponds to the dictation, wherein the set of rewrite rules comprise; identify section headings in the speech-recognized dictation; format the section headings according to a standard practice of a specific site; identify and transform number clusters adjacent to keywords in the speech-recognized dictation; identify and transform number clusters not adjacent to keywords in the speech-recognized dictation; perform consolidation or suppression of filled pauses and silences in the speech-recognized dictation; perform attachment and format punctuation in the speech-recognized dictation; and perform capitalization of words following punctuation in the speech-recognized dictation to thereby provide the formatted text document for writing to a storage device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer implemented method for converting output from a speech recognition engine, the method comprising:
-
inputting the output of the speech recognition engine to a post processor configured to convert the output, comprising speech-recognized dictation in the form of unformatted raw text or data, from the speech recognition engine to a formatted text document that is a transcription of the dictation; converting, by the post processor, the output to a list of tokens; identifying, by the post processor, a set of tokens from the list of tokens matching a first set of predetermined patterns; and performing, by the post processor, a set of rewrite rules based on the identified set of tokens, the rewrite rules used to transform the output of the speech recognition engine to the formatted text document that corresponds to the dictation, wherein performing the set of rewrite rules comprises identifying and transforming number clusters adjacent to keywords in the speech-recognized dictation, identifying and transforming number clusters not adjacent to keywords in the speech-recognized dictation, performing consolidation or suppression of filled pauses and silences in the speech-recognized dictation, and performing attachment and formatting punctuation in the speech-recognized dictation to thereby provide the formatted text document for writing to a storage device.
-
Specification