System and method for distributed speech recognition with a cache feature
First Claim
1. A system for decoding speech to access services via a communications device, comprising:
- an input device for receiving speech input;
a feature extraction engine, the feature extraction engine extracting at least one feature from the speech input;
a local model store;
a first interface to a network, the network comprising a network model store, the network model store being configured to generate at least one service depending on the at least one feature extracted from the speech input; and
a processor, communicating with the input device, the feature extraction engine, the local model store and the first interface, the processor testing the at least one feature extracted from the speech input against the local model store to act upon a service request, the processor being configured to initiate a transmission of the at least one feature extracted from the speech input to the network via the first interface when no match is found between the local model store and the at least one feature extracted from the speech input.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention equips a cellular telephone or other communications device with improved voice recognition and command capability. A cellular handset may be equipped with a digital signal processing or other hardware to enhance speech detection and command decoding, but still be relatively constrained in terms of the amount of electronic memory or other storage available on the device, or the processing power or battery life offered by the device. In embodiments, the cellular handset or other device may perform a first-stage decoding of a voice or other command, for instance to perform a voice browsing function over the Internet or a directory. The handset may perform a look-up of the detected command or service against a local memory cache of stored commands, services and models and if a match is found, proceed directly to performing the desired service. If a match is not found in the device memory, the voice signal may be communicated to a server or other resource in the cellular or other network, for remote or distributed decoding of the command or action. When that service is returned to the handset, the service along with the associated model may be stored into electronic memory or other storage for future access, in caching fashion. A user'"'"'s most frequently used, or latest used, commands and services may be locally stored on the device, for instance, enabling prompt response times within those commands or services.
27 Citations
59 Claims
-
1. A system for decoding speech to access services via a communications device, comprising:
-
an input device for receiving speech input;
a feature extraction engine, the feature extraction engine extracting at least one feature from the speech input;
a local model store;
a first interface to a network, the network comprising a network model store, the network model store being configured to generate at least one service depending on the at least one feature extracted from the speech input; and
a processor, communicating with the input device, the feature extraction engine, the local model store and the first interface, the processor testing the at least one feature extracted from the speech input against the local model store to act upon a service request, the processor being configured to initiate a transmission of the at least one feature extracted from the speech input to the network via the first interface when no match is found between the local model store and the at least one feature extracted from the speech input. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A method for decoding speech to access services via a communications device, comprising:
-
receiving speech input;
extracting at least one feature from the speech input;
testing the at least one feature extracted from the speech input against a local model store in a communication device to act upon a service request; and
when no match if found between the local model store and the at least one feature extracted from the speech input transmitting the at least one feature extracted from the speech input via a first interface to a network, and generating a link to at least one service depending on the at least one feature extracted from the speech input. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A communications system for decoding speech to access services via a communications device, comprising:
-
an input device for receiving speech input;
a feature extraction engine, the feature extraction engine extracting at least one feature from the speech input;
a local model store;
a first interface to a network;
a network, the network comprising a network model store, the network model store being configured to generate at least one service depending on the at least one feature extracted from the speech input; and
a processor, communicating with the input device, the feature extraction engine, the local model store and the first interface, the processor testing the at least one feature extracted from the speech input against the local model store to act upon a service request, the processor being configured to initiate a transmission of the at least one feature extracted from the speech input to the network via the first interface when no match is found between the local model store and the at least one feature extracted from the speech input. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
-
39. A network system for decoding speech to access services inputted via a communications device, comprising:
-
a network model store, the network model store being configured to generate at least one service depending on at least one feature extracted from speech input to a communications device; and
a first interface to the communications device, the communications device comprising an input device for receiving the speech input, a feature extraction engine, the feature extraction engine extracting the at least one feature from the speech input, a local model store, and a processor, communicating with the input device, the feature extraction engine, the local model store and the first interface; and
a network processor, the network processor being configured to test the at least one feature extracted from the speech input against the network model store to act upon a service request, the network processor being configured to initiate a transmission of the at least one service to the communications device. - View Dependent Claims (40, 41, 42, 43, 44, 45, 46, 47, 48)
-
-
49. A system for decoding speech to access services via a communications device, comprising:
-
input means for receiving speech input;
feature extraction means, the feature extraction means extracting at least one feature from the speech input;
local model store means;
first interface means to a wireless network, the network comprising network model store means, the network model store means being configured to generate at least one service depending on the at least one feature extracted from the speech input; and
processor means, communicating with the input means, the feature extraction means, the local model store means and the first interface means, the processor means testing the at least one feature extracted from the speech input against the local model store means to act upon a service request, the processor means being configured to initiate a transmission of the at least one feature extracted from the speech input to the network via the first interface means when no match is found between the local model store means and the at least one feature extracted from the speech input. - View Dependent Claims (50, 51, 52, 53, 54, 55, 56, 57, 58, 59)
-
Specification