System and method for post processing speech recognition output

US 7,996,223 B2
Filed: 09/29/2004
Issued: 08/09/2011
Est. Priority Date: 10/01/2003
Status: Expired due to Fees

First Claim

Patent Images

1. A computer implemented method for altering output from a speech recognition engine, the method comprising:

inputting the output from a speech recognition engine into a post processor configured to convert the output, comprising speech-recognized dictation in the form of unformatted raw text or data, from the speech recognition engine to a formatted text document that is a transcription of the dictation, the post processor operatively configured to;

convert said output to a list of tokens;

identify a set of tokens from said list of tokens matching a first set of predetermined patterns; and

perform a set of rewrite rules based on said identified set of tokens, the rewrite rules used to transform the output of the speech recognition engine to the formatted text document that corresponds to the dictation, wherein the set of rewrite rules comprise;

identify section headings in the speech-recognized dictation;

format the section headings according to a standard practice of a specific site;

identify and transform number clusters adjacent to keywords in the speech-recognized dictation;

identify and transform number clusters not adjacent to keywords in the speech-recognized dictation;

perform consolidation or suppression of filled pauses and silences in the speech-recognized dictation;

perform attachment and format punctuation in the speech-recognized dictation; and

perform capitalization of words following punctuation in the speech-recognized dictation to thereby provide the formatted text document for writing to a storage device.

View all claims

7 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A system and method may be disclosed for facilitating the conversion of dictation into usable and formatted documents by providing a method of post processing speech recognition output. In particular, the post processing system may be configured to implement rewrite rules and process raw speech recognition output or other raw data according to those rewrite rules. The application of the rewrite rules may format and/or normalize the raw speech recognition output into formatted or finalized documents and reports. The system may thereby reduce or eliminate the need for post processing by transcriptionists or dictation authors.

Citations

11 Claims

1. A computer implemented method for altering output from a speech recognition engine, the method comprising:
- inputting the output from a speech recognition engine into a post processor configured to convert the output, comprising speech-recognized dictation in the form of unformatted raw text or data, from the speech recognition engine to a formatted text document that is a transcription of the dictation, the post processor operatively configured to;
  
  convert said output to a list of tokens;
  
  identify a set of tokens from said list of tokens matching a first set of predetermined patterns; and
  
  perform a set of rewrite rules based on said identified set of tokens, the rewrite rules used to transform the output of the speech recognition engine to the formatted text document that corresponds to the dictation, wherein the set of rewrite rules comprise;
  
  identify section headings in the speech-recognized dictation;
  
  format the section headings according to a standard practice of a specific site;
  
  identify and transform number clusters adjacent to keywords in the speech-recognized dictation;
  
  identify and transform number clusters not adjacent to keywords in the speech-recognized dictation;
  
  perform consolidation or suppression of filled pauses and silences in the speech-recognized dictation;
  
  perform attachment and format punctuation in the speech-recognized dictation; and
  
  perform capitalization of words following punctuation in the speech-recognized dictation to thereby provide the formatted text document for writing to a storage device.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
- - 2. The method according to claim 1, wherein a second set of rewrite rules may operate on the result of the first set of rewrite rules as well as the transformed number clusters.
  - 3. The method according to claim 2, further comprising performing at least a second rule interpretation performed to transform text that matches a second set of predetermined patterns.
  - 4. The method according to claim 3, further comprising converting punctuation tokens into symbols that cling to an adjacent word.
  - 5. The method according to claim 4, where the adjacent word is capitalized.
  - 6. The method according to claim 5, where filled pauses and silences are consolidated or suppressed and the boundaries between them and adjacent words are adjusted.
  - 7. The method according to claim 6, further comprising removing text used according to a set of post processor rules.
  - 8. The method according to claim 7, further comprising writing a file containing post-processed output to a target location.
  - 9. The method according to claim 8, where the target location is a buffer.
  - 10. The method according to claim 9, where the target location is a file.

11. A computer implemented method for converting output from a speech recognition engine, the method comprising:
- inputting the output of the speech recognition engine to a post processor configured to convert the output, comprising speech-recognized dictation in the form of unformatted raw text or data, from the speech recognition engine to a formatted text document that is a transcription of the dictation;
  
  converting, by the post processor, the output to a list of tokens;
  
  identifying, by the post processor, a set of tokens from the list of tokens matching a first set of predetermined patterns; and
  
  performing, by the post processor, a set of rewrite rules based on the identified set of tokens, the rewrite rules used to transform the output of the speech recognition engine to the formatted text document that corresponds to the dictation, wherein performing the set of rewrite rules comprises identifying and transforming number clusters adjacent to keywords in the speech-recognized dictation, identifying and transforming number clusters not adjacent to keywords in the speech-recognized dictation, performing consolidation or suppression of filled pauses and silences in the speech-recognized dictation, and performing attachment and formatting punctuation in the speech-recognized dictation to thereby provide the formatted text document for writing to a storage device.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Nuance Communications, Inc. (Microsoft Corporation)
Original Assignee
Dictaphone Corporation (Microsoft Corporation)
Inventors
Santisteban, Ana, Frankel, Alan
Primary Examiner(s)
Saint Cyr; Leonard

Application Number

US10/953,474
Publication Number

US 20050108010A1
Time in Patent Office

2,505 Days
Field of Search

None
US Class Current

704/252
CPC Class Codes

G06F 40/103 Formatting, i.e. changing o...

G10L 15/26 Speech to text systems G10L...

System and method for post processing speech recognition output

First Claim

7 Assignments

0 Petitions

Accused Products

Abstract

Citations

11 Claims

Specification

Solutions

Use Cases

Quick Links

System and method for post processing speech recognition output

First Claim

7 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

11 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links