Speech recognition application or server using iterative recognition constraints
First Claim
1. A speech recognition system comprising:
- a computer storage media and a processing unit to implement instructions stored on the non-transitory computer storage media;
a contacts list or directory including contacts having a plurality of contact attributes stored on the non-transitory computer storage media;
a recognition module including instructions stored on the non-transitory computer storage media and executable by the processing unit to receive a first input utterance corresponding to a first contact attribute and provide a first N-Best list of one or more data entries in a first iteration for the first input utterance using a first grammar comprising grammars associated with contact records of the contacts list or directory;
an application module including instructions stored on the non-transitory computer storage media and executable by the processing unit to utilize the contact records associated with the first N-Best list to provide a second subset grammar or grammars for a second contact attribute limited to only the contacts corresponding to the N-Best entries of the first N-Best list and the recognition module is configured to receive a second input utterance corresponding to the second contact attribute and process the second input utterance using the second subset grammar or grammars to recognize the second contact attribute to provide a second N-Best list; and
an application component including instructions stored on the non-transitory computer storage media and executable by the processing unit to select or order the one or more N-Best entries from the first or second N-Best lists using information associated with the first or second iterations.
2 Assignments
0 Petitions
Accused Products
Abstract
A speech recognition application including a recognition module configured to receive input utterances and an application module configured to select a recognition from the speech recognition module using output from a first iteration to select a recognition result for a second iteration. In one embodiment, the application module eliminates a previous rejected recognition result or results from the N-Best list for recognition. In another embodiment, the application module rescores N-Best entries based upon N-Best lists or information from another iteration. In another illustrated embodiment, the application module uses a limited grammar from a current N-Best list for subsequent recognition, for example for rerecognition using a recorded input from a previous iteration.
-
Citations
18 Claims
-
1. A speech recognition system comprising:
-
a computer storage media and a processing unit to implement instructions stored on the non-transitory computer storage media; a contacts list or directory including contacts having a plurality of contact attributes stored on the non-transitory computer storage media; a recognition module including instructions stored on the non-transitory computer storage media and executable by the processing unit to receive a first input utterance corresponding to a first contact attribute and provide a first N-Best list of one or more data entries in a first iteration for the first input utterance using a first grammar comprising grammars associated with contact records of the contacts list or directory; an application module including instructions stored on the non-transitory computer storage media and executable by the processing unit to utilize the contact records associated with the first N-Best list to provide a second subset grammar or grammars for a second contact attribute limited to only the contacts corresponding to the N-Best entries of the first N-Best list and the recognition module is configured to receive a second input utterance corresponding to the second contact attribute and process the second input utterance using the second subset grammar or grammars to recognize the second contact attribute to provide a second N-Best list; and an application component including instructions stored on the non-transitory computer storage media and executable by the processing unit to select or order the one or more N-Best entries from the first or second N-Best lists using information associated with the first or second iterations. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for retrieving a contact from a contact list or directory comprising the steps of:
-
receiving a first input utterance from an audio input device coupled to a computing device; recognizing the first input utterance using a processing unit of the computing device and outputting a first N-best list corresponding to the first input utterance using a first grammar or grammars comprising grammars associated with contacts in the contact list or directory during one recognition iteration; creating a second grammar or grammars for a second input utterance using the processing unit of the computing device and a subset of grammars limited to a subset of the contacts corresponding to one or more entries of the first N-Best list; receiving the second input utterance from the audio input device; recognizing and outputting a second N-Best list corresponding to the second input utterance using the processing unit of the computing device and the second grammar or grammars during a different recognition iteration; and selecting a recognition result for the different iteration using the processing unit of the computing device and the information for the one recognition iteration and the second N-Best list from the different recognition iteration. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
-
16. A method comprising:
-
receiving a first input utterance from an audio input device coupled to a computing device to retrieve a contact from a contact list or directory; processing the first input utterance using a processing unit of the computing device and using a first grammar comprising grammars associated with contacts of the contact list or directory for a first contact attribute to recognize the first input utterance during a first recognition iteration; generating a first N-best list for the first contact attribute; storing information from the first recognition iteration on a non-transitory data storage media; processing data for one or more entities of the first N-best list associated with the first input utterance using the processing unit of the computing device and generating a second grammar comprising a subset grammar or grammars limited to contact records associated with the one or more entries of the first N-best list; processing a second input utterance from the audio input device using the processing unit and the second grammar for a second contact attribute to recognize the second input utterance during a second recognition iteration wherein the first and second attributes correspond to different contact attributes of the contact records of the contact list or directory; generating a second N-Best list corresponding to the second input utterance for the second contact attribute; and selecting a recognition result for the second iteration using the second N-Best list to provide the contact from the contact list or directory corresponding to the second input utterances. - View Dependent Claims (17, 18)
-
Specification