Speech recognition using re-utterance recognition

US 7,444,286 B2
Filed: 12/05/2004
Issued: 10/28/2008
Est. Priority Date: 09/05/2001
Status: Expired due to Term

First Claim

Patent Images

1. A method of speech recognition comprising:

receiving an original utterance of one or more words;

performing an original speech recognition upon the original utterance;

producing a user perceivable output representing one or more sequences of one or more words selected by the recognition as most likely corresponding to the utterance;

providing a user interface that allows a user to select to perform a re-utterance recognition upon a part of the original utterance corresponding to all or a selected part of the user perceivable output; and

responding to a user selection to perform a re-utterance recognition upon all or a part of the original utterance by;

treating a second utterance received in association with the selection as a re-utterance of the selected portion of the original utterance; and

performing speech recognition upon the re-utterance to select one or more sequences of one or more words considered to most likely match the re-utterance based on the scoring of the one or more words against both the re-utterance and the selected portion of the original utterance;

wherein;

the original recognition of the original utterance is by continuous speech recognition;

the re-utterance is recognized by discrete speech recognition; and

the number of utterances detected with a re-utterance recognized by discrete recognition is used to determine the number of words allowable in sequences of one or more words recognized for the original utterance after the re-utterance.

View all claims

7 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The present invention relates to speech recognition that enables a user to perform re-utterance recognition, in which speech recognition is performed upon both a second saying of a sequence of one or more words and upon an earlier saying of the same sequence to help the speech recognition better select one or more best scoring text sequences for the utterances.

Citations

5 Claims

1. A method of speech recognition comprising:
- receiving an original utterance of one or more words;
  
  performing an original speech recognition upon the original utterance;
  
  producing a user perceivable output representing one or more sequences of one or more words selected by the recognition as most likely corresponding to the utterance;
  
  providing a user interface that allows a user to select to perform a re-utterance recognition upon a part of the original utterance corresponding to all or a selected part of the user perceivable output; and
  
  responding to a user selection to perform a re-utterance recognition upon all or a part of the original utterance by;
  
  treating a second utterance received in association with the selection as a re-utterance of the selected portion of the original utterance; and
  
  performing speech recognition upon the re-utterance to select one or more sequences of one or more words considered to most likely match the re-utterance based on the scoring of the one or more words against both the re-utterance and the selected portion of the original utterance;
  
  wherein;
  
  the original recognition of the original utterance is by continuous speech recognition;
  
  the re-utterance is recognized by discrete speech recognition; and
  
  the number of utterances detected with a re-utterance recognized by discrete recognition is used to determine the number of words allowable in sequences of one or more words recognized for the original utterance after the re-utterance.

2. A method of speech recognition comprising:
- receiving an original utterance of one or more words;
  
  performing an original speech recognition upon the original utterance;
  
  producing a user perceivable output representing one or more sequences of one or more words selected by the recognition as most likely corresponding to the utterance;
  
  providing a user interface that allows a user to select to perform a re-utterance recognition upon a part of the original utterance corresponding to all or a selected part of the user perceivable output; and
  
  responding to a user selection to perform a re-utterance recognition upon all or a part of the original utterance by;
  
  treating a second utterance received in association with the selection as a re-utterance of the selected portion of the original utterance; and
  
  performing speech recognition upon the re-utterance to select one or more sequences of one or more words considered to most likely match the re-utterance based on the scoring of the one or more words against both the re-utterance and the selected portion of the original utterance;
  
  wherein the selection of a sequences of one or more words considered to most likely match both the re-utterance and the selected portion of the original utterance is used to update acoustic models with data from the selected portion of the original utterance.
- View Dependent Claims (3, 4)
- - 3. A method as in claim 2 wherein both the original utterance and the re-utterance are recognized by discrete speech recognition.
  - 4. A method as in claim 2 wherein both the original utterance and the re-utterance are recognized by continuous speech recognition.

5. A method of speech recognition comprising:
- receiving an original utterance of one or more words;
  
  performing an original speech recognition upon the original utterance;
  
  producing a user perceivable output representing one or more sequences of one or more words selected by the recognition as most likely corresponding to the utterance;
  
  providing a user interface that allows a user to select to perform a re-utterance recognition upon a part of the original utterance corresponding to all or a selected part of the user perceivable output; and
  
  responding to a user selection to perform a re-utterance recognition upon all or a part of the original utterance by;
  
  treating a second utterance received in association with the selection as a re-utterance of the selected portion of the original utterance; and
  
  performing speech recognition upon the re-utterance to select one or more sequences of one or more words considered to most likely match the re-utterance based on the scoring of the one or more words against both the re-utterance and the selected portion of the original utterance;
  
  wherein;
  
  the user interface allows a user to select one or more word filtering inputs, each indicating that the desired output has certain characteristics, to be used in conjunction with the re-utterance recognition;
  
  the process of selecting of one or more sequences as most likely matching both the re-utterance and the original utterance also uses the selected filtering inputs to favor the selection of any recognition candidates having the selected characteristics; and
  
  the user interface allows a user to select as said word filtering inputs alphabetic filtering inputs comprised of a partial spelling of one or more words indicating that the desired output contains a sequence of one or more words that start with the sequence of one or more letters contained in said partial spelling.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Cerence Operating Company (Cerence Inc.)
Original Assignee
Nuance Communications, Inc. (Microsoft Corporation)
Inventors
Roth, Daniel L., Cohen, Jordan R.
Primary Examiner(s)
Hudspeth; David R.
Assistant Examiner(s)
ALBERTALLI, BRIAN LOUIS

Application Number

US11/005,567
Publication Number

US 20050159950A1
Time in Patent Office

1,423 Days
Field of Search

None
US Class Current

704/270
CPC Class Codes

G10L 15/22 Procedures used during a sp...

Speech recognition using re-utterance recognition

First Claim

7 Assignments

0 Petitions

Accused Products

Abstract

Citations

5 Claims

Specification

Solutions

Use Cases

Quick Links

Speech recognition using re-utterance recognition

First Claim

7 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

5 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links