System and method for speech recognition system
First Claim
Patent Images
1. A method of processing speech data, the method comprising, at a speech server:
- receiving an audio segment;
acquiring a decoder proxy for the audio segment from a decoder proxy cache;
passing to the decoder proxy a recognized phrase generated by performing speech recognition on the audio segment; and
issuing a query based on the recognized phrase.
4 Assignments
0 Petitions
Accused Products
Abstract
A digital speech enabled middleware module is disclosed that facilitates interaction between a large number of client devices and network-based automatic speech recognition (ASR) resources. The module buffers feature vectors associated with speech received from the client devices when the number of client devices is greater than the available ASR resources. When an ASR decoder becomes available, the module transmits the feature vectors to the ASR decoder and a recognition result is returned.
-
Citations
20 Claims
-
1. A method of processing speech data, the method comprising, at a speech server:
-
receiving an audio segment; acquiring a decoder proxy for the audio segment from a decoder proxy cache; passing to the decoder proxy a recognized phrase generated by performing speech recognition on the audio segment; and issuing a query based on the recognized phrase. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method of processing speech data, the method comprising, at a speech sensor:
-
receiving an audio segment; processing the audio segment via a decoder proxy acquired from a decoder proxy cache; if the decoder proxy is successfully acquired, streaming computed cepstrum vectors to a decoding process; and if the decoder proxy is not successfully acquired, buffering the computed cepstrum vectors and transmitting the computed cepstrum vectors when the decoder proxy is acquired. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A system for managing speech segments received from a plurality of user devices, the system comprising:
-
a processor; a server dispatcher module for controlling the processor to detect and route system I/O events; a decoder proxy cache containing a plurality of decoder proxies, each decoder proxy representing a decoder process; a handler module controlling the processor to acquire a decoder proxy for a speech segment from the decoder proxy cache; and a decoder processing module for controlling the processor to receive the speech segment from the decoder proxy and return the automatic speech recognition result to the decoder proxy, wherein the handler module further controls the processor to query an application sensor based on the automatic speech recognition result. - View Dependent Claims (18, 19, 20)
-
Specification