System and Method for Performing Dual Mode Speech Recognition
First Claim
1. A method comprising:
- transmitting a spoken query to a remote recognition system;
obtaining a server transcription from the remote recognition system; and
responsive to determining that the server transcription contains a word that is missing from a local vocabulary, issuing a request to the server to send a description of the missing word.
9 Assignments
0 Petitions
Accused Products
Abstract
A system and method is presented for performing dual mode speech recognition, employing a local recognition module on a mobile device and a remote recognition engine on a server device. The system accepts a spoken query from a user, and both the local recognition module and the remote recognition engine perform speech recognition operations on the query, returning a transcription and confidence score, subject to a latency cutoff time. If both sources successfully transcribe the query, then the system accepts the result having the higher confidence score. If only one source succeeds, then that result is accepted. In either case, if the remote recognition engine does succeed in transcribing the query, then a client vocabulary is updated if the remote system result includes information not present in the client vocabulary.
25 Citations
17 Claims
-
1. A method comprising:
-
transmitting a spoken query to a remote recognition system; obtaining a server transcription from the remote recognition system; and responsive to determining that the server transcription contains a word that is missing from a local vocabulary, issuing a request to the server to send a description of the missing word. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method comprising:
-
receiving a request from a client device to send a description of a missing word; determining a topic to which the missing word relates; and sending to the client a set of words that relate to the topic and descriptions of the words.
-
-
10. A non-transitory computer readable medium comprising code that, when executed by one or more processors, causes the one or more processors to:
-
transmit a spoken query to a remote recognition system; obtain a server transcription from the remote recognition system; and responsive to determining that the server transcription contains a word that is missing from a local vocabulary, issue a request to the server to send a description of the missing word. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
Specification