Distributed voice user interface
First Claim
Patent Images
1. A local device comprising:
- a primary functionality component;
an input component configured to receive speech input;
a processing component coupled to the input component, the processing component configured to;
identify keywords in the speech input,determine whether the local device is capable of processing the speech input based on whether one or more keywords are identified in the speech input,if the local device is capable of processing the speech input, process the speech input, generate corresponding local control signals, and transmit the local control signals to the primary functionality component to direct an action in the primary functionality component, andif the local device is not capable of processing the speech input, extract feature parameters from the speech input for processing at a remote system, receive remote control signals from the remote system responsive to the remote system performing speech recognition on the feature parameters by storing an acoustic model of the feature parameters and recognizing a command based on a previously stored acoustic model associated with the local device to address specific characteristics of the feature parameters, and send the remote control signals to the primary functionality component; and
a transceiver coupled to the processing component and configured to establish communications between the local device and the remote system, wherein the communications comprise;
high bandwidth communications configured to return data supporting audio or video output at the local device, andlow bandwidth communications configured to return data supporting the remote control signals.
5 Assignments
0 Petitions
Accused Products
Abstract
A distributed voice user interface system includes a local device which receives speech input issued from a user. Such speech input may specify a command or a request by the user. The local device performs preliminary processing of the speech input and determines whether it is able to respond to the command or request by itself. If not, the local device initiates communication with a remote system for further processing of the speech input.
119 Citations
41 Claims
-
1. A local device comprising:
-
a primary functionality component; an input component configured to receive speech input; a processing component coupled to the input component, the processing component configured to; identify keywords in the speech input, determine whether the local device is capable of processing the speech input based on whether one or more keywords are identified in the speech input, if the local device is capable of processing the speech input, process the speech input, generate corresponding local control signals, and transmit the local control signals to the primary functionality component to direct an action in the primary functionality component, and if the local device is not capable of processing the speech input, extract feature parameters from the speech input for processing at a remote system, receive remote control signals from the remote system responsive to the remote system performing speech recognition on the feature parameters by storing an acoustic model of the feature parameters and recognizing a command based on a previously stored acoustic model associated with the local device to address specific characteristics of the feature parameters, and send the remote control signals to the primary functionality component; and a transceiver coupled to the processing component and configured to establish communications between the local device and the remote system, wherein the communications comprise; high bandwidth communications configured to return data supporting audio or video output at the local device, and low bandwidth communications configured to return data supporting the remote control signals. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method of providing a voice interface for a primary functionality component in a local device, comprising:
-
receiving, at the local device, a speech input; identifying keywords in the speech input; establishing communications between the local device and a remote system, wherein the communications comprise; high bandwidth communications configured to return data supporting audio or video output at the local device, and low bandwidth communications configured to return data supporting the remote control signals; determining, at the local device, whether the local device is capable of processing the speech input based on whether one or more keywords are identified in the speech input; if the local device is capable of processing the speech input, then processing the speech input at the local device, generating corresponding local control signals, and transmitting the local control signals to the primary functionality component to direct an action in the primary functionality component; and if the local device is not capable of processing the speech input, then extracting feature parameters from the speech input for processing at the remote system, sending the feature parameters to the remote system for processing by storing an acoustic model of the feature parameters and recognizing a command based on a previously stored acoustic model associated with the local device to address specific characteristics of the feature parameters, receiving remote control signals from the remote system responsive to the feature parameters via the low bandwidth communications, and sending the remote control signals to the primary functionality component. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A tangible computer readable medium having stored thereon computer-executable instructions that, if executed by a computing device, cause the computing device to perform a method comprising:
-
receiving, at the local device, a speech input; identifying keywords in the speech input; establishing communications between the local device and a remote system, wherein the communications comprise; high bandwidth communications configured to return data supporting audio or video output at the local device, and low bandwidth communications configured to return data supporting the remote control signals; determining, at the local device, whether the local device is capable of processing the speech input based on whether one or more keywords are identified in the speech input; if the local device is capable of processing the speech input, then processing the speech input at the local device, generating corresponding local control signals, and transmitting the local control signals to the primary functionality component to direct an action in the primary functionality component; and if the local device is not capable of processing the speech input, then extracting feature parameters from the speech input for processing at the remote system, sending the feature parameters to the remote system for processing by storing an acoustic model of the feature parameters and recognizing a command based on a previously stored acoustic model associated with the local device to address specific characteristics of the feature parameter, receiving remote control signals from the remote system responsive to the feature parameters via the low bandwidth communications, and sending the remote control signals to the primary functionality component. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36)
-
-
37. A local device comprising:
-
receiving means to receive a speech input; identifying means to identify keywords in the speech input; establishing means to establish communications between the local device and a remote system, wherein the communications comprise; high bandwidth communications configured to return data supporting audio or video output at the local device, and low bandwidth communications configured to return data supporting the remote control signals; determining means to determine whether a local device is capable of processing the speech input based on whether one or more keywords are identified in the speech input; first responding means to respond to the speech input if the determining means determines that the local device is capable of processing the speech input, comprising; processing means to process the speech input at the local device, generating means to generate corresponding local control signals, and transmitting means to transmit the local control signals to a primary functionality component to direct an action in the primary functionality component; and second responding means to respond to the speech input if the determining means determines that the local device is not capable of processing the speech input, comprising; extracting means to extract feature parameters from the speech input for processing at the remote system, first sending means to send the feature parameters to the remote system for processing by storing an acoustic model of the feature parameters and recognizing a command based on a previously stored acoustic model associated with the local device to address specific characteristics of the feature parameter, receiving means to receive remote control signals from the remote system responsive to the feature parameters via the low bandwidth communications, and second sending means to send the remote control signals to the primary functionality component.
-
-
38. A local device comprising:
-
a primary functionality component; an input component configured to receive speech input; a processing component coupled to the input component, the processing component configured to; identify keywords in the speech input, determine whether the local device is capable of processing the speech input based on whether one or more keywords are identified in the speech input, if the local device is capable of processing the speech input, process the speech input, generate corresponding local control signals, and transmit the local control signals to the primary functionality component to direct an action in the primary functionality component, and if the local device is not capable of processing the speech input, extract feature parameters from the speech input for processing at a remote system, receive remote control signals from the remote system responsive to the remote system performing speech recognition on the feature parameters by storing an acoustic model of the feature parameters and recognizing a command based on a previously stored acoustic model associated with the local device to address specific characteristics of the feature parameters, and send the remote control signals to the primary functionality component; and a recording component configured to record the speech input and to play back the recorded speech input for transmission to the remote system.
-
-
39. A method of providing a voice interface for a primary functionality component in a local device, comprising:
-
receiving, at the local device, a speech input; identifying keywords in the speech input; determining, at the local device, whether the local device is capable of processing the speech input based on whether one or more keywords are identified in the speech input; if the local device is capable of processing the speech input, then processing the speech input at the local device, generating corresponding local control signals, and transmitting the local control signals to the primary functionality component to direct an action in the primary functionality component; if the local device is not capable of processing the speech input, then extracting feature parameters from the speech input for processing at a remote system, sending the feature parameters to the remote system for processing by storing an acoustic model of the feature parameters and recognizing a command based on a previously stored acoustic model associated with the local device to address specific characteristics of the feature parameters, receiving remote control signals from the remote system responsive to the feature parameters, and sending the remote control signals to the primary functionality component; recording the speech input; and playing back the recorded speech input for transmission to the remote system.
-
-
40. A tangible computer readable medium having stored thereon computer-executable instructions that, if executed by a computing device, cause the computing device to perform a method comprising:
-
receiving, at the local device, a speech input; identifying keywords in the speech input; determining, at the local device, whether the local device is capable of processing the speech input based on whether one or more keywords are identified in the speech input; if the local device is capable of processing the speech input, then processing the speech input at the local device, generating corresponding local control signals, and transmitting the local control signals to the primary functionality component to direct an action in the primary functionality component; if the local device is not capable of processing the speech input, then extracting feature parameters from the speech input for processing at a remote system, sending the feature parameters to the remote system for processing by storing an acoustic model of the feature parameters and recognizing a command based on a previously stored acoustic model associated with the local device to address specific characteristics of the feature parameter, receiving remote control signals from the remote system responsive to the feature parameters, and sending the remote control signals to the primary functionality component; recording the speech input; and playing back the recorded speech input for transmission to the remote system.
-
-
41. A local device comprising:
-
receiving means to receive a speech input; identifying means to identify keywords in the speech input; determining means to determine whether a local device is capable of processing the speech input based on whether one or more keywords are identified in the speech input; first responding means to respond to the speech input if the determining means determines that the local device is capable of processing the speech input, comprising; processing means to process the speech input at the local device, generating means to generate corresponding local control signals, and transmitting means to transmit the local control signals to a primary functionality component to direct an action in the primary functionality component; second responding means to respond to the speech input if the determining means determines that the local device is not capable of processing the speech input, comprising; extracting means to extract feature parameters from the speech input for processing at a remote system, first sending means to send the feature parameters to the remote system for processing by storing an acoustic model of the feature parameters and recognizing a command based on a previously stored acoustic model associated with the local device to address specific characteristics of the feature parameter, receiving means to receive remote control signals from the remote system responsive to the feature parameters, and second sending means to send the remote control signals to the primary functionality component; recording means to record the speech input; and playback means to play back the recorded speech input for transmission to the remote system.
-
Specification