Configurable speech recognition system using multiple recognizers
First Claim
1. A method of performing speech recognition in a distributed speech recognition system comprising an electronic device having an embedded speech recognizer and a network device having a remote speech recognizer remote from the electronic device, the method comprising:
- receiving, by the electronic device, input audio uninterrupted by one or more prompts output from the electronic device, wherein the input audio comprises input speech;
identifying multiple types of information in the input speech;
determining whether speech recognition by the remote speech recognizer is desired, wherein the determining is based, at least in part, on the identified types of information in the input speech; and
in response to determining that speech recognition by the remote speech recognizer is desired, processing a first portion of the input speech by the embedded speech recognizer and sending a second portion of the input speech to the network device for recognition by the remote speech recognizer.
3 Assignments
0 Petitions
Accused Products
Abstract
Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.
-
Citations
20 Claims
-
1. A method of performing speech recognition in a distributed speech recognition system comprising an electronic device having an embedded speech recognizer and a network device having a remote speech recognizer remote from the electronic device, the method comprising:
-
receiving, by the electronic device, input audio uninterrupted by one or more prompts output from the electronic device, wherein the input audio comprises input speech; identifying multiple types of information in the input speech; determining whether speech recognition by the remote speech recognizer is desired, wherein the determining is based, at least in part, on the identified types of information in the input speech; and in response to determining that speech recognition by the remote speech recognizer is desired, processing a first portion of the input speech by the embedded speech recognizer and sending a second portion of the input speech to the network device for recognition by the remote speech recognizer. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A non-transitory computer-readable storage medium encoded with a plurality of instructions that, when executed by at least one processor on an electronic device in a distributed speech recognition system comprising the electronic device having an embedded speech recognizer and a network device having a remote speech recognizer remote from the electronic device, perform a method comprising:
-
receiving, by the electronic device, input audio uninterrupted by one or more prompts output from the electronic device, wherein the input audio comprises input speech; identifying multiple types of information in the input speech; determining whether speech recognition by the remote speech recognizer is desired, wherein the determining is based, at least in part, on the identified types of information in the input speech; and in response to determining that speech recognition by the remote speech recognizer is desired, processing a first portion of the input speech by the embedded speech recognizer and sending a second portion of the input speech to the network device for recognition by the remote speech recognizer. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. An electronic device for use in a distributed speech recognition system comprising the electronic device and a network device remote from the electronic device, the electronic device, comprising:
-
an embedded speech recognizer configured to receive input audio uninterrupted by one or more prompts output from the electronic device, wherein the input audio comprises input speech; and at least one processor programmed to; identify multiple types of information in the input speech; determine whether speech recognition by the remote speech recognizer is desired, wherein the determining is based, at least in part, on the identified types of information in the input speech; and in response to determining that speech recognition by the remote speech recognizer is desired, process a first portion of the input speech by the embedded speech recognizer and send a second portion of the input speech to the network device for recognition by the remote speech recognizer. - View Dependent Claims (17, 18, 19, 20)
-
Specification