Post processing of natural language automatic speech recognition
First Claim
1. A post-processing automated speech recognition system comprising:
- a natural language-based speech recognition system that compares a spoken utterance to a natural language vocabulary comprising words in one or more active grammars to generate a natural language speech recognition result and a natural language confidence score;
a master conversation module engine that post processes the natural language speech recognition result by comparing the natural language speech recognition result generated by the natural language-based speech processing system to lists of words and phrases from a plurality of active sub-grammars that are mapped to a plurality of domain specific words and phrases, and generates a post processed natural language speech recognition result confidence score for each of the listed words and phrases from the plurality of active sub-grammars based on a likelihood that the respective listed words and phrases from the plurality of active sub-grammars match the natural language speech recognition result;
a voting module engine that is interfaced to the master conversation module engine and that is operable to select a word or a phrase from the list of words and phrases from the plurality of active sub-grammars, based on the post processed natural language speech recognition result confidence scores;
wherein;
when the voting module engine selects a word or phrase from the plurality of active sub-grammars based on the post processed natural language speech recognition result confidence scores, the selected word or phrase is transmitted to an application control system that transmits control signals used to control an internal or an external device or an internal or an external process;
orwhen both of;
the voting module does not select a word or phrase from the plurality of active sub-grammars based on the post processed natural language speech recognition result confidence scores, andthe natural language confidence score exceeds a specified threshold,then the master conversation module engine selects the natural language speech recognition result to be transmitted to the application control system;
where the natural language-based speech recognition system, the master conversation module engine, and the voting module engine are executed by one or more processors.
4 Assignments
0 Petitions
Accused Products
Abstract
A post-processing speech system includes a natural language-based speech recognition system that compares a spoken utterance to a natural language vocabulary that includes words used to generate a natural language speech recognition result. A master conversation module engine compares the natural language speech recognition result to domain specific words and phrases. A voting engine selects a word or a phrase from the domain specific words and phrases that is transmitted to an application control system. The application control system transmits one or more control signals that are used to control an internal or an external device or an internal or an external process.
-
Citations
20 Claims
-
1. A post-processing automated speech recognition system comprising:
-
a natural language-based speech recognition system that compares a spoken utterance to a natural language vocabulary comprising words in one or more active grammars to generate a natural language speech recognition result and a natural language confidence score; a master conversation module engine that post processes the natural language speech recognition result by comparing the natural language speech recognition result generated by the natural language-based speech processing system to lists of words and phrases from a plurality of active sub-grammars that are mapped to a plurality of domain specific words and phrases, and generates a post processed natural language speech recognition result confidence score for each of the listed words and phrases from the plurality of active sub-grammars based on a likelihood that the respective listed words and phrases from the plurality of active sub-grammars match the natural language speech recognition result; a voting module engine that is interfaced to the master conversation module engine and that is operable to select a word or a phrase from the list of words and phrases from the plurality of active sub-grammars, based on the post processed natural language speech recognition result confidence scores;
wherein;when the voting module engine selects a word or phrase from the plurality of active sub-grammars based on the post processed natural language speech recognition result confidence scores, the selected word or phrase is transmitted to an application control system that transmits control signals used to control an internal or an external device or an internal or an external process;
orwhen both of; the voting module does not select a word or phrase from the plurality of active sub-grammars based on the post processed natural language speech recognition result confidence scores, and the natural language confidence score exceeds a specified threshold, then the master conversation module engine selects the natural language speech recognition result to be transmitted to the application control system; where the natural language-based speech recognition system, the master conversation module engine, and the voting module engine are executed by one or more processors. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A computer implemented method of automatically recognizing speech comprising:
-
capturing speech utterances and converting the speech utterances into frames of speech; recognizing speech utterances by comparing the frames of speech to a list of words in active grammars; generating a natural language-based speech recognition result and a natural language confidence score; post processing the natural language-based speech recognition result by comparing the natural language-based speech recognition result to a domain specific vocabulary that comprises lists of words and phrases from sub-grammars that are mapped to domain specific words or phrases and generating a post process confidence score for each of the words and phrases from the sub-grammars, based on a likelihood that the words and phrases match the natural language-based recognition result; and selecting a speech recognition result based on a voting scheme;
wherein;a word or phrase from the sub-grammars that best matches the natural language speech recognition result based on the post process confidence scores, is selected as the speech recognition result;
orthe natural language speech recognition result is selected as the speech recognition result when both of; a word or phrase from the sub-grammars is not selected as the speech recognition result based on the post process confidence scores, and the natural language confidence score exceeds a specified threshold. - View Dependent Claims (17, 18)
-
-
19. A method of automatically recognizing speech comprising:
-
capturing a speech utterance; recognizing the speech utterance by comparing the speech utterance to a list of words in active grammars by a natural language-based speech recognition system; generating automated speech recognition results and a confidence score from the natural language-based speech recognition system; post processing the natural language-based speech recognition results by comparing the natural language speech recognition results to a domain specific vocabulary that comprises lists of words and phrases from sub-grammars mapped to one or more application control signals that control an internal or external device or process through an application engine; and selecting a speech recognition post processing result based on a voting scheme, wherein; a word or phrase from the sub-grammars that best matches the natural language speech recognition result based on the post processing comparisons is selected as the speech recognition post processing result;
orthe natural language speech recognition result is selected as the speech recognition post processing result when both of; a word or phrase from the sub-grammars is not selected as the speech recognition post processing result based on the post processing comparisons, and the confidence score from the natural language-based speech recognition system exceeds a specified threshold. - View Dependent Claims (20)
-
Specification