Configurable speech recognition system using multiple recognizers
First Claim
1. A client device for use in a client/server speech recognition system comprising the client device and a server device, wherein the server device includes a remote speech recognizer remote from the client device, the client device comprising:
- a user interface, the user interface comprising an input interface configured to receive input audio comprising speech, wherein the user interface is further configured to support interacting with a user using any of a plurality of different interaction techniques; and
at least one processor programmed to;
determine an availability status, at a point in time, of the remote speech recognizer for performing speech recognition on input audio received via the input interface; and
provide via the user interface, an indication of the availability status, wherein providing the indication of the availability status comprises;
selecting one of the plurality of different interaction techniques based, at least in part, on the availability status; and
providing the indication of the availability status using the selected one of the plurality of different interaction techniques.
3 Assignments
0 Petitions
Accused Products
Abstract
Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture. An indication of the availability of the remote speech recognition to perform speech recognition at a point in time may be provided to a user of the client device via a user interface of the client device.
43 Citations
17 Claims
-
1. A client device for use in a client/server speech recognition system comprising the client device and a server device, wherein the server device includes a remote speech recognizer remote from the client device, the client device comprising:
-
a user interface, the user interface comprising an input interface configured to receive input audio comprising speech, wherein the user interface is further configured to support interacting with a user using any of a plurality of different interaction techniques; and at least one processor programmed to; determine an availability status, at a point in time, of the remote speech recognizer for performing speech recognition on input audio received via the input interface; and provide via the user interface, an indication of the availability status, wherein providing the indication of the availability status comprises; selecting one of the plurality of different interaction techniques based, at least in part, on the availability status; and providing the indication of the availability status using the selected one of the plurality of different interaction techniques. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method of providing an indication of availability status of a server device in a client/server speech recognition system comprising a client device and the server device, wherein the server device includes a remote speech recognizer remote from the client device, the method comprising:
-
providing a user interface, the user interface comprising an input interface configured to receive input audio comprising speech, wherein the user interface is further configured to support interacting with a user using any of a plurality of different interaction techniques; determining an availability status of the remote speech recognizer for performing speech recognition on input audio received via the input interface; and providing via the user interface, an indication of the availability status, wherein providing the indication of the availability status comprises; selecting one of the plurality of different interaction techniques based, at least in part, on the availability status; and providing the indication of the availability status using the selected one of the plurality of different interaction techniques. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A non-transitory computer-readable storage medium encoded with a plurality of instructions that, when executed by at least one processor on a client device in a client/server speech recognition system comprising the client device and a server device, wherein the server device has a remote speech recognizer remote from the client device, perform a method comprising:
-
providing a user interface, the user interface comprising an input interface configured to receive input audio comprising speech, wherein the user interface is further configured to support interacting with a user using any of a plurality of different interaction techniques; determining an availability status of the remote speech recognizer for performing speech recognition on input audio received via the input interface; and providing via the user interface, an indication of the availability status, wherein providing the indication of the availability status comprises; selecting one of the plurality of different interaction techniques based, at least in part, on the availability status; and providing the indication of the availability status using the selected one of the plurality of different interaction techniques. - View Dependent Claims (14, 15, 16, 17)
-
Specification