System and method for performing dual mode speech recognition
First Claim
1. A method for performing dual mode speech recognition, comprising:
- receiving at a device a query from a user;
sending the query to a first recognition system;
sending the query to a second recognition system;
receiving at least a first recognition result from either the first recognition system or the second recognition system;
producing a final result considering the first recognition result; and
setting a latency timer to a timeout value,wherein the first recognition system maintains a first vocabulary and the second recognition system maintains a second vocabulary, and whereby the final result is produced at or before the time that the latency timer reaches the timeout value.
9 Assignments
0 Petitions
Accused Products
Abstract
A system and method is presented for performing dual mode speech recognition, employing a local recognition module on a mobile device and a remote recognition engine on a server device. The system accepts a spoken query from a user, and both the local recognition module and the remote recognition engine perform speech recognition operations on the query, returning a transcription and confidence score, subject to a latency cutoff time. If both sources successfully transcribe the query, then the system accepts the result having the higher confidence score. If only one source succeeds, then that result is accepted. In either case, if the remote recognition engine does succeed in transcribing the query, then a client vocabulary is updated if the remote system result includes information not present in the client vocabulary.
24 Citations
15 Claims
-
1. A method for performing dual mode speech recognition, comprising:
-
receiving at a device a query from a user; sending the query to a first recognition system; sending the query to a second recognition system; receiving at least a first recognition result from either the first recognition system or the second recognition system; producing a final result considering the first recognition result; and setting a latency timer to a timeout value, wherein the first recognition system maintains a first vocabulary and the second recognition system maintains a second vocabulary, and whereby the final result is produced at or before the time that the latency timer reaches the timeout value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A client for dual mode speech recognition, the client comprising:
-
an interface enabled to receive a query from a user; a communication module enabled to send the query to a server and receive a remote recognition result from a server; a local recognition module enabled to create a local recognition result from the query; a latency timer; a control module enabled to receive a notification from the latency timer and to select between the local recognition result and the remote recognition result; and a client vocabulary enabled to describe words or phrases available to the local recognition module. - View Dependent Claims (12, 13, 14)
-
-
15. A server for dual mode speech recognition, the server comprising:
-
a recognition engine enabled to create a recognition result from audio content; a communication module enabled to receive a query from a client and send the recognition result to the client; and a vocabulary download module enabled to respond to requests from the client to send updates to a client vocabulary.
-
Specification