Speech recognition using re-utterance recognition
First Claim
Patent Images
1. A method of speech recognition comprising:
- receiving an original utterance of one or more words;
performing an original speech recognition upon the original utterance;
producing a user perceivable output representing one or more sequences of one or more words selected by the recognition as most likely corresponding to the utterance;
providing a user interface that allows a user to select to perform a re-utterance recognition upon a part of the original utterance corresponding to all or a selected part of the user perceivable output; and
responding to a user selection to perform a re-utterance recognition upon all or a part of the original utterance by;
treating a second utterance received in association with the selection as a re-utterance of the selected portion of the original utterance; and
performing speech recognition upon the re-utterance to select one or more sequences of one or more words considered to most likely match the re-utterance based on the scoring of the one or more words against both the re-utterance and the selected portion of the original utterance;
wherein;
the original recognition of the original utterance is by continuous speech recognition;
the re-utterance is recognized by discrete speech recognition; and
the number of utterances detected with a re-utterance recognized by discrete recognition is used to determine the number of words allowable in sequences of one or more words recognized for the original utterance after the re-utterance.
7 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates to speech recognition that enables a user to perform re-utterance recognition, in which speech recognition is performed upon both a second saying of a sequence of one or more words and upon an earlier saying of the same sequence to help the speech recognition better select one or more best scoring text sequences for the utterances.
-
Citations
5 Claims
-
1. A method of speech recognition comprising:
-
receiving an original utterance of one or more words; performing an original speech recognition upon the original utterance; producing a user perceivable output representing one or more sequences of one or more words selected by the recognition as most likely corresponding to the utterance; providing a user interface that allows a user to select to perform a re-utterance recognition upon a part of the original utterance corresponding to all or a selected part of the user perceivable output; and responding to a user selection to perform a re-utterance recognition upon all or a part of the original utterance by; treating a second utterance received in association with the selection as a re-utterance of the selected portion of the original utterance; and performing speech recognition upon the re-utterance to select one or more sequences of one or more words considered to most likely match the re-utterance based on the scoring of the one or more words against both the re-utterance and the selected portion of the original utterance; wherein; the original recognition of the original utterance is by continuous speech recognition; the re-utterance is recognized by discrete speech recognition; and the number of utterances detected with a re-utterance recognized by discrete recognition is used to determine the number of words allowable in sequences of one or more words recognized for the original utterance after the re-utterance.
-
-
2. A method of speech recognition comprising:
-
receiving an original utterance of one or more words; performing an original speech recognition upon the original utterance; producing a user perceivable output representing one or more sequences of one or more words selected by the recognition as most likely corresponding to the utterance; providing a user interface that allows a user to select to perform a re-utterance recognition upon a part of the original utterance corresponding to all or a selected part of the user perceivable output; and responding to a user selection to perform a re-utterance recognition upon all or a part of the original utterance by; treating a second utterance received in association with the selection as a re-utterance of the selected portion of the original utterance; and performing speech recognition upon the re-utterance to select one or more sequences of one or more words considered to most likely match the re-utterance based on the scoring of the one or more words against both the re-utterance and the selected portion of the original utterance; wherein the selection of a sequences of one or more words considered to most likely match both the re-utterance and the selected portion of the original utterance is used to update acoustic models with data from the selected portion of the original utterance. - View Dependent Claims (3, 4)
-
-
5. A method of speech recognition comprising:
-
receiving an original utterance of one or more words; performing an original speech recognition upon the original utterance; producing a user perceivable output representing one or more sequences of one or more words selected by the recognition as most likely corresponding to the utterance; providing a user interface that allows a user to select to perform a re-utterance recognition upon a part of the original utterance corresponding to all or a selected part of the user perceivable output; and responding to a user selection to perform a re-utterance recognition upon all or a part of the original utterance by; treating a second utterance received in association with the selection as a re-utterance of the selected portion of the original utterance; and performing speech recognition upon the re-utterance to select one or more sequences of one or more words considered to most likely match the re-utterance based on the scoring of the one or more words against both the re-utterance and the selected portion of the original utterance; wherein; the user interface allows a user to select one or more word filtering inputs, each indicating that the desired output has certain characteristics, to be used in conjunction with the re-utterance recognition; the process of selecting of one or more sequences as most likely matching both the re-utterance and the original utterance also uses the selected filtering inputs to favor the selection of any recognition candidates having the selected characteristics; and the user interface allows a user to select as said word filtering inputs alphabetic filtering inputs comprised of a partial spelling of one or more words indicating that the desired output contains a sequence of one or more words that start with the sequence of one or more letters contained in said partial spelling.
-
Specification