System and method for post processing speech recognition output
First Claim
1. A method for converting output from a speech recognition engine, the method comprising the steps of:
- collecting an output from the speech recognition engine;
converting the output to a list of tokens;
identifying a first set of tokens matching a first set of predetermined patterns;
performing a first set of rewrite rules, where the first set of rewrite rules is performed to transform text that matches a first set of predetermined patterns;
identifying and transforming number clusters adjacent to keywords;
identifying and transforming number clusters not adjacent to keywords;
performing a second set of rewrite rules;
performing a third set of rewrite rules, performing consolidation or suppression of filled pauses and silences performing attachment and formatting of punctuation performing capitalization of words following punctuation
7 Assignments
0 Petitions
Accused Products
Abstract
A system and method may be disclosed for facilitating the conversion of dictation into usable and formatted documents by providing a method of post processing speech recognition output. In particular, the post processing system may be configured to implement rewrite rules and process raw speech recognition output or other raw data according to those rewrite rules. The application of the rewrite rules may format and/or normalize the raw speech recognition output into formatted or finalized documents and reports. The system may thereby reduce or eliminate the need for post processing by transcriptionists or dictation authors.
147 Citations
20 Claims
-
1. A method for converting output from a speech recognition engine, the method comprising the steps of:
-
collecting an output from the speech recognition engine;
converting the output to a list of tokens;
identifying a first set of tokens matching a first set of predetermined patterns;
performing a first set of rewrite rules, where the first set of rewrite rules is performed to transform text that matches a first set of predetermined patterns;
identifying and transforming number clusters adjacent to keywords;
identifying and transforming number clusters not adjacent to keywords;
performing a second set of rewrite rules;
performing a third set of rewrite rules, performing consolidation or suppression of filled pauses and silences performing attachment and formatting of punctuation performing capitalization of words following punctuation - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method for converting output from a speech recognition engine, the method comprising the steps of:
-
collecting output from a speech recognition engine whereby the output includes text having at least one of a token and a number cluster;
converting automatic speech recognition engine output to a list of tokens;
performing at least a first rule interpretation, where the rule interpretation is performed to transform text that matches a first set of predetermined patterns;
wherein the output includes a first number cluster adjacent to a keyword context, converting said first number cluster to a first format based upon the keyword context;
wherein the output includes a second number cluster not adjacent to the keyword, converting said second number cluster to a second format based upon a different set of rules;
- View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification