Speech recognition system distinguishing dictation from commands by arbitration between continuous speech and isolated word modules

US 5,794,196 A
Filed: 06/24/1996
Issued: 08/11/1998
Est. Priority Date: 06/30/1995
Status: Expired due to Term

First Claim

Patent Images

1. A speech recognition system which separately outputs text and commands comprising:

an isolated word speech recognizer;

accessible by said isolated word speech recognizer, a first vocabulary of respective text word models, said isolated word speech recognizer operating to compare speech input with at least a selected portion of said first vocabulary and to provide a plurality of scores indicating the degree of match of said speech input with an identified one or more of said models;

a continuous speech recognizer;

accessible by said continuous speech recognizer, a second vocabulary of respective command word models, said continuous speech recognizer operating to compare speech input to said second vocabulary of command word models and to provide a score indicating the degree of match of said speech input with at least one identified sequence of the respective models;

an arbitration algorithm for selecting from among the models identified by said isolated word speech recognizer and the sequence of models identified by said continuous speech recognizer and for outputting corresponding text if a score from said isolated word recognizer is selected and outputting a respective command if a score from said continuous speech recognizer is selected.

View all claims

11 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

In the speech recognition system disclosed herein, an input utterance is submitted to both a large vocabulary isolated word speech recognition module and a small vocabulary continuous speech recognition module. The small vocabulary contains command words which can be combined in sequences to define commands to an application program. The two recognition modules generate respective scores for identified large vocabulary models and for sequences of small vocabulary models. The score provided by the continuous speech recognizer is normalized on the basis of the length of the speech input utterance and an arbitration algorithm selects among the candidates identified by the recognition modules. Without requiring the user to switch modes, text is output if a score from the isolated word recognizer is selected and a command is output if a score from the continuous speech recognizer is selected.

107 Citations

View as Search Results

10 Claims

1. A speech recognition system which separately outputs text and commands comprising:
- an isolated word speech recognizer;
  
  accessible by said isolated word speech recognizer, a first vocabulary of respective text word models, said isolated word speech recognizer operating to compare speech input with at least a selected portion of said first vocabulary and to provide a plurality of scores indicating the degree of match of said speech input with an identified one or more of said models;
  
  a continuous speech recognizer;
  
  accessible by said continuous speech recognizer, a second vocabulary of respective command word models, said continuous speech recognizer operating to compare speech input to said second vocabulary of command word models and to provide a score indicating the degree of match of said speech input with at least one identified sequence of the respective models;
  
  an arbitration algorithm for selecting from among the models identified by said isolated word speech recognizer and the sequence of models identified by said continuous speech recognizer and for outputting corresponding text if a score from said isolated word recognizer is selected and outputting a respective command if a score from said continuous speech recognizer is selected.
- View Dependent Claims (2, 3, 4, 5, 6)
- - 2. A recognition system as set forth in claim 1 further comprising means for applying a relative scaling to said recognizer scores by a factor empirically trained to minimize incursions by each of said vocabularies on correct results from the other vocabulary.
  - 3. A recognition system as set forth in claim 2 wherein said scaling factor is determined by applying, with a given factor,A. isolated word test data to both recognizers and counting the intrusions by CSR models on correct ISR translations, andB. continuous word test data to both recognizers and counting intrusions by ISR models on correct CSR translations,and then adjusting said factor to minimize intrusions.
  - 4. A recognition system as set forth in claim 1 further comprising means for testing a sequence of models scored by said continuous speech recognizer to determine if the sequence parses into an executable command.
  - 5. A recognition system as set forth in said claim 1 wherein said text word vocabulary includes in excess of 5000 models and said command word vocabulary includes fewer than 2000 models.
  - 6. A recognition system as set forth in claim 1 further comprising means for normalizing the score provided by said continuous speech recognizer on the basis of the length of the speech input.

7. A speech recognition system which separately outputs text and commands comprising:
- an isolated word speech recognizer;
  
  accessible by said isolated word speech recognizer, a first vocabulary of respective text word models numbering in excess of 5000, said isolated word speech recognizer operating to compare speech input with at least a selected portion of said first vocabulary and to provide a plurality of scores indicating the degree of match of said speech input with identified ones of said models;
  
  a continuous speech recognizer;
  
  accessible by said continuous speech recognizer, a second vocabulary of respective command word models numbering less than 2000, said continuous speech recognizer operating to compare speech input to said second vocabulary of models and to provide a score indicating the degree of match of said speech input with an identified sequence of the respective models;
  
  means for normalizing the score provided by said continuous speech recognizer on the basis of the length of the speech input;
  
  an arbitration algorithm for selecting from among the models identified by said isolated word speech recognizer and the sequence of models identified by said continuous speech recognizer and for outputting corresponding text if a score from said isolated word recognizer is selected and outputting a respective command if a score from said continuous speech recognizer is selected.
- View Dependent Claims (8)
- - 8. A recognition system as set forth in claim 7 further comprising means for applying a relative scaling to said recognizer scores by a factor empirically trained to minimize incursions by each of said vocabularies on correct results from the other vocabulary.

9. A speech recognition system which separately outputs text and commands comprising:
- an isolated word speech recognizer;
  
  accessible by said isolated word speech recognizer, a vocabulary of respective text word models numbering in excess of 5000, said isolated word speech recognizer operating to compare speech input with at least a selected portion of said first vocabulary and to provide a plurality of scores indicating the degree of match of said speech input with identified ones of said models;
  
  a continuous speech recognizer;
  
  accessible by said continuous speech recognizer, a second vocabulary of respective command word models numbering less than 2000, said continuous speech recognizer operating to compare speech input to said second vocabulary of models and to provide a score indicating the degree of match of said speech input with an identified sequence of the respective models;
  
  means for normalizing the score provided by said continuous speech recognizer on the basis of the length of the speech input;
  
  means for applying a relative scaling to said recognizer scores by a factor empirically trained to minimize incursions by each of said vocabularies on correct results from the other vocabulary; and
  
  an arbitration algorithm for selecting from among the models identified by said isolated word speech recognizer and the sequence of models identified by said continuous speech recognizer and for outputting corresponding text if a score from said isolated word recognizer is selected and outputting a respective command if a score from said continuous speech recognizer is selected.
- View Dependent Claims (10)
- - 10. A recognition system as set forth in claim 9 wherein said scaling factor is determined by applying, with a given factor,A. isolated word test data to both recognizers and counting the intrusions by CSR models on correct ISR translations, andB. continuous word test data to both recognizers and counting intrusions by ISR models on correct CSR translations,and then adjusting said factor to minimize intrusions.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Nuance Communications, Inc. (Microsoft Corporation)
Original Assignee
Kurzweil Applied Intelligence, Inc. (Intel Corporation)
Inventors
Yegnanarayanan, Girija, Hsu, Dong, Armstrong, John III
Primary Examiner(s)
Hudspeth, David R.
Assistant Examiner(s)
Smits, Talivaldis Ivars

Application Number

US08/669,242
Time in Patent Office

778 Days
Field of Search

395/2.48, 395/2.6, 395/2.62, 395/2.64, 704/239, 704/251, 704/253, 704/255
US Class Current

704/255
CPC Class Codes

G10L 15/26 Speech to text systems G10L...

G10L 2015/223 Execution procedure of a sp...

Speech recognition system distinguishing dictation from commands by arbitration between continuous speech and isolated word modules

First Claim

11 Assignments

0 Petitions

Accused Products

Abstract

107 Citations

10 Claims

Specification

Use Cases

Quick Links

Others

Speech recognition system distinguishing dictation from commands by arbitration between continuous speech and isolated word modules

First Claim

11 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

107 Citations

10 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others