System and method for speech recognition services
First Claim
Patent Images
1. A method of processing speech data received from a mobile device, the method comprising:
- receiving at a speech server a speech request from a mobile device to transmit an audio segment;
notifying a session object communicating with the mobile device regarding the arrival of the audio segment;
generating from the session object a handler to process the audio segment, the handler acquiring a decoder proxy for the audio segment from a decoder proxy cache;
obtaining an automatic speech recognition (ASR) decoder result associated with the audio segment, the ASR decoder result being passed to the decoder proxy;
communicating a recognized phrase associated with the ASR decoder result or a failure code from the decoder proxy to the handler; and
issuing from the handler a query to a web server using the ASR decoder result.
5 Assignments
0 Petitions
Accused Products
Abstract
A digital speech enabled middleware module is disclosed that facilitates interaction between a large number of client devices and network-based automatic speech recognition (ASR) resources. The module buffers feature vectors associated with speech received from the client devices when the number of client devices is greater than the available ASR resources. When an ASR decoder becomes available, the module transmits the feature vectors to the ASR decoder and a recognition result is returned.
-
Citations
28 Claims
-
1. A method of processing speech data received from a mobile device, the method comprising:
-
receiving at a speech server a speech request from a mobile device to transmit an audio segment; notifying a session object communicating with the mobile device regarding the arrival of the audio segment; generating from the session object a handler to process the audio segment, the handler acquiring a decoder proxy for the audio segment from a decoder proxy cache; obtaining an automatic speech recognition (ASR) decoder result associated with the audio segment, the ASR decoder result being passed to the decoder proxy; communicating a recognized phrase associated with the ASR decoder result or a failure code from the decoder proxy to the handler; and issuing from the handler a query to a web server using the ASR decoder result. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method of processing speech data received from a mobile device, the method comprising:
-
receiving at a server dispatcher a speech request from a mobile device to transmit an audio segment; notifying, by the server dispatcher, a session object communicating with the mobile device regarding the arrival of the audio segment; and generating by the session object a handler to process the audio segment, the handler attempting to acquiring a decoder proxy for processing the audio segment from a decoder proxy cache, wherein if the handler is successful in its attempt to acquire the decoder proxy, then the handler streams computed cepstrum vectors to a decoding process; and if the handler is not successful in its attempt acquire the decoder proxy, then the handler buffers the computer cepstrum vectors and transmits them as soon as a decoder proxy becomes available. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A system for managing speech segments received from a plurality of mobile devices, the system comprising:
-
a server dispatcher that detects and routes all system I/O events; a decoder proxy cache containing a plurality of decoder proxies that are local representations of decoder processes; a session object that processes communications from each of the plurality of mobile devices, the session object being notified by the dispatcher when one of the plurality of mobile devices has initiated a speech request; a handler initiated by the session object, the handler acquiring one of the plurality of decoder proxies to process a speech segment associated with the speech request; and a decoder processing module that receives the speech segment from the acquired decoder proxy and returns an automatic speech recognition (ASR) result to the acquired decoder proxy, wherein the decoder proxy transmits the ASR result to the handler for use in querying an application server, whereupon the handler passes control of further speech requests from the plurality of mobile devices back to the server dispatcher. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. A method of recognizing speech from at least one mobile device, the method comprising:
-
receiving at a speech server a speech request from a mobile device to transmit an audio segment; initiating a handler to process the audio segment, the handler applying acoustic algorithms in feature space to the audio segment; acquiring a decoder proxy for the audio segment from a decoder proxy cache; obtaining an automatic speed recognition (ASR) decoder result associated with the audio segment, the ASR decoder result being passed to the handler via the decoder proxy; and issuing from the handler a query to a web server using the ASR decoder result. - View Dependent Claims (28)
-
Specification