Methods and apparatus for hybrid speech recognition processing
First Claim
1. A mobile electronic device for use in a hybrid speech processing system comprising the mobile electronic device and a network-connected server remotely located from the mobile electronic device, the mobile electronic device comprising:
- an input interface configured to receive input audio comprising speech;
an embedded speech recognizer configured to perform speech recognition in a first language and process at least a portion of the input audio to produce first recognized text;
a controller configured to determine whether to send information to the server for speech processing, wherein the information includes the at least a portion of the input audio and/or at least a portion of the first recognized text, wherein;
the determination of whether to send the information to the server for speech processing is based, at least in part, on a semantic category associated with the first recognized text or on an analysis of the first recognized text, the analysis of the first recognized text comprising;
detecting at least one language that the speech may include based at least in part on the analysis of the first recognized text,determining whether the detected at least one language includes a second language different from the first language for which the embedded speech recognizer is configured to perform speech recognition, andin response to determining that the detected at least one language includes the second language different from the first language for which the embedded speech recognizer is configured to perform speech recognition, determining to send at least a portion of the speech to the server; and
a network interface configured to send the information to the server in response to determining that the information should be sent to the server.
3 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus for selectively performing speech processing in a hybrid speech processing system. The hybrid speech processing system includes at least one mobile electronic device and a network-connected server remotely located from the at least one mobile electronic device. The mobile electronic device is configured to use an embedded speech recognizer to process at least a portion of input audio to produce recognized text. A controller on the mobile electronic device determines whether to send information from the mobile electronic device to the server for speech processing. The determination of whether to send the information is based, at least in part, on an analysis of the input audio, the recognized text, or a semantic category associated with the recognized text.
194 Citations
14 Claims
-
1. A mobile electronic device for use in a hybrid speech processing system comprising the mobile electronic device and a network-connected server remotely located from the mobile electronic device, the mobile electronic device comprising:
-
an input interface configured to receive input audio comprising speech; an embedded speech recognizer configured to perform speech recognition in a first language and process at least a portion of the input audio to produce first recognized text; a controller configured to determine whether to send information to the server for speech processing, wherein the information includes the at least a portion of the input audio and/or at least a portion of the first recognized text, wherein; the determination of whether to send the information to the server for speech processing is based, at least in part, on a semantic category associated with the first recognized text or on an analysis of the first recognized text, the analysis of the first recognized text comprising; detecting at least one language that the speech may include based at least in part on the analysis of the first recognized text, determining whether the detected at least one language includes a second language different from the first language for which the embedded speech recognizer is configured to perform speech recognition, and in response to determining that the detected at least one language includes the second language different from the first language for which the embedded speech recognizer is configured to perform speech recognition, determining to send at least a portion of the speech to the server; and a network interface configured to send the information to the server in response to determining that the information should be sent to the server. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for use in a hybrid speech processing system comprising a mobile electronic device and a network-connected server remotely located from the mobile electronic device, the method comprising:
-
processing, by an embedded speech recognizer on the mobile electronic device and configured to perform speech recognition in a first language, at least a portion of input audio to produce recognized text; determining, by a controller, whether to send information from the mobile electronic device to the server for speech processing, wherein the information includes the at least a portion of the input audio and/or at least a portion of the recognized text, wherein; the determination of whether to send the information to the server for speech processing is based, at least in part, on a semantic category associated with the recognized text or on an analysis of the recognized text, the analysis of the recognized text comprising; detecting at least one language that the speech may include based at least in part on the analysis of the recognized text, determining whether the detected at least one language includes a second language different from the first language for which the embedded speech recognizer is configured to perform speech recognition, and in response to determining that the detected at least one language includes the second language different from the first language for which the embedded speech recognizer is configured to perform speech recognition, determining to send at least a portion of the speech to the server; and sending the information from the mobile electronic device to the server in response to determining that the information should be sent to the server. - View Dependent Claims (13, 14)
-
Specification