Allocation of speech recognition tasks and combination of results thereof
First Claim
1. A system for using multiple speech recognizers, the system comprising:
- an allocation determination mechanism to determine an allocation of speech recognition tasks among multiple speech recognizers based on a complexity of a speech, wherein the multiple speech recognizers include a mobile-based speech recognizer on a mobile device and a server-based speech recognizer on a server,wherein said allocation determination mechanism is to use a threshold set on a vocabulary size to determine the complexity level of the speech,a task allocation mechanism to allocate the speech recognition tasks to both the mobile-device-based speech recognizer and the server-based speech recognizer based on a determination by the allocation determination mechanism; and
a combination mechanism to receive results from the multiple speech recognizers and combine the results into a single result,wherein the results from each of the multiple speech recognizers include recognized words and a confidence score for each of the recognized words, andwherein, to combine the results, the combination mechanism is to compare the results from the multiple speech recognizers on a word-to-word basis and select a word from one of the multiple speech recognizers as a recognized word for the single result based on the confidence score of that word.
3 Assignments
0 Petitions
Accused Products
Abstract
A system, method, computer-readable medium, and computer-implemented system for optimizing allocation of speech recognition tasks among multiple speech recognizers and combining recognizer results is described. An allocation determination is performed to allocate speech recognition among multiple speech recognizers using at least one of an accuracy-based allocation mechanism, a complexity-based allocation mechanism, and an availability-based allocation mechanism. The speech recognition is allocated among the speech recognizers based on the determined allocation. Recognizer results received from multiple speech recognizers in accordance with the speech recognition task allocation are combined.
-
Citations
19 Claims
-
1. A system for using multiple speech recognizers, the system comprising:
-
an allocation determination mechanism to determine an allocation of speech recognition tasks among multiple speech recognizers based on a complexity of a speech, wherein the multiple speech recognizers include a mobile-based speech recognizer on a mobile device and a server-based speech recognizer on a server, wherein said allocation determination mechanism is to use a threshold set on a vocabulary size to determine the complexity level of the speech, a task allocation mechanism to allocate the speech recognition tasks to both the mobile-device-based speech recognizer and the server-based speech recognizer based on a determination by the allocation determination mechanism; and a combination mechanism to receive results from the multiple speech recognizers and combine the results into a single result, wherein the results from each of the multiple speech recognizers include recognized words and a confidence score for each of the recognized words, and wherein, to combine the results, the combination mechanism is to compare the results from the multiple speech recognizers on a word-to-word basis and select a word from one of the multiple speech recognizers as a recognized word for the single result based on the confidence score of that word. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method of using multiple speech recognizers, said method comprising:
-
determining an allocation of speech recognition tasks among the multiple speech recognizers based on a complexity level of a speech with respect to a threshold, wherein the threshold is based on a vocabulary size, and wherein the multiple speech recognizers include a mobile-device-based speech recognizer on a mobile device and a server-based speech recognizer on a server; allocating the speech recognition tasks to both the mobile-device-based speech recognizer and the server-based speech recognizer based on the determined allocation; receiving results from the mobile-device-based speech recognizer and the server-based speech recognizer, wherein the results from each of the speech recognizers include recognized words and a confidence score for each of the recognized words; and combining the results to generate a single result, including comparing the results from the mobile-device-based speech recognizer and the results from the server-based speech recognizer on a word-to-word basis, and selecting a word from the mobile-device-based speech recognizer or a word from the server-based speech recognizer as a recognized word for the single result based on the confidence score of that word. - View Dependent Claims (11, 12, 13)
-
-
14. A non-transitory computer-readable medium, on which is stored machine executable instructions which when executed by a processor cause the processor to:
-
determine an allocation of speech recognition tasks among multiple speech recognizers based on a complexity of a speech with respect to a threshold, wherein the threshold is based on a vocabulary size and wherein the multiple speech recognizers include a mobile-device-based speech recognizer on a mobile device and a server-based speech recognizer on a server; allocate the speech recognition tasks to both the mobile-device-based speech recognizer and the server-based speech recognizer based on the determined allocation; receive results from the mobile-device-based speech recognizer and the server-based speech recognizer, wherein the results from each of the speech recognizers include recognized words and a confidence score for each of the recognized words; and combine the results to generate a single result, including compare the results from the mobile-device-based speech recognizer and the results from the server-based speech recognizer on a word-to-word basis, and select a word from the mobile-device-based speech recognizer or a word from the server-based speech recognizer as a recognized word for the single result based on the confidence score of that word. - View Dependent Claims (15, 16)
-
-
17. A computer-implemented system for allocating speech recognition tasks among multiple speech recognizers, the system comprising:
-
a processor; and a memory coupled to the processor, the memory having stored therein instructions causing the processor to; determine an allocation of the speech recognition tasks among multiple speech recognizers based on a complexity of a speech with respect to a threshold, wherein the threshold is based on a vocabulary size, and wherein the multiple speech recognizers include a mobile-based speech recognizer on a mobile device and a server-based speech recognizer on a server; allocate the speech recognition tasks to both the mobile-device-based speech recognizer and the server-based speech recognizer based on the determined allocation, and receive results from the mobile-device-based speech recognizer and the server-based speech recognizer, wherein the results from each of the speech recognizers include recognized words and a confidence score for each of the recognized words; combine the results to generate a single result, including compare the results from the mobile-device-based speech recognizer and the results from the server-based speech recognizer on a word-to-word basis, and select a word from the mobile-device-based speech recognizer or a word from the server-based speech recognizer as a recognized word for the single result based on the confidence score of that word. - View Dependent Claims (18, 19)
-
Specification