Distributed voice user interface
First Claim
Patent Images
1. A system for providing a distributed voice interface to a device, comprising:
- a transceiver configured to receive input from the device via a communication network, wherein the input is the result of preliminary signal processing comprising keyword detection by the device prior to receipt of the input at the transceiver;
a memory configured to store an acoustic model of the input; and
a processing module coupled to the transceiver and configured to perform speech recognition on the received input based at least in part on a previously stored acoustic model in order to recognize a command,wherein the transceiver is further configured to transmit data to the device, responsive to the command, via the communication network using communication channels comprising;
a high bandwidth communication channel configured to transmit data supporting audio or video output at the device, anda low bandwidth communication channel configured to transmit data supporting control signals for operation of a primary functionality component of the device, andwherein the data comprises audio data generated to be consistent with audio data generated by the device based on a type of the device.
4 Assignments
0 Petitions
Accused Products
Abstract
A distributed voice user interface system includes a local device which receives speech input issued from a user. Such speech input may specify a command or a request by the user. The local device performs preliminary processing of the speech input and determines whether it is able to respond to the command or request by itself. If not, the local device initiates communication with a remote system for further processing of the speech input.
-
Citations
25 Claims
-
1. A system for providing a distributed voice interface to a device, comprising:
-
a transceiver configured to receive input from the device via a communication network, wherein the input is the result of preliminary signal processing comprising keyword detection by the device prior to receipt of the input at the transceiver; a memory configured to store an acoustic model of the input; and a processing module coupled to the transceiver and configured to perform speech recognition on the received input based at least in part on a previously stored acoustic model in order to recognize a command, wherein the transceiver is further configured to transmit data to the device, responsive to the command, via the communication network using communication channels comprising; a high bandwidth communication channel configured to transmit data supporting audio or video output at the device, and a low bandwidth communication channel configured to transmit data supporting control signals for operation of a primary functionality component of the device, and wherein the data comprises audio data generated to be consistent with audio data generated by the device based on a type of the device. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for providing a distributed voice interface comprising:
-
receiving an audio input comprising results from preliminary signal processing, the preliminary signal processing comprising keyword detection on a speech input; storing an acoustic model of the audio input; performing speech recognition on the received audio input, based at least in part on a previously stored acoustic model in order to recognize a command; and transmitting data to a device over a network, responsive to the command, using communication channels comprising; a high bandwidth communication channel configured to transmit data supporting audio or video output at the device, and a low bandwidth communication channel configured to transmit data supporting control signals for operation of a primary functionality component of the device, wherein the data comprises audio data generated to be consistent with audio data generated by the device based on a type of the device. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer-readable medium having computer program logic recorded thereon, execution of which, by a computing device, causes the computing device to perform operations comprising:
-
receiving an audio input from a device via a communication network, the audio input based at least in part on speech input, wherein the audio input is the result of preliminary signal processing comprising keyword detection by the device prior to receipt of the audio input; performing speech recognition on the received audio input based at least in part on a previously stored acoustic model in order to recognize a command; and transmitting data to the device, responsive to the command, via the communication network using communication channels comprising; a high bandwidth communication channel configured to transmit data supporting audio or video output at the device, and a low bandwidth communication channel configured to transmit data supporting control signals for operation of a primary functionality component of the device, wherein the data comprises audio data generated to be consistent with audio data generated by the device based on a type of the device. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
-
22. A system for providing a distributed voice interface to a device, comprising:
-
transceiver means for receiving input from the device via a communication network, wherein the input is the result of preliminary signal processing comprising keyword detection by the device prior to receipt of the input at the transceiver means; memory means for storing an acoustic model of the input; and processing means for performing speech recognition on the received input based at least in part on a previously stored acoustic model in order to recognize a command, wherein the transceiver means are further for transmitting data to the device, responsive to the command, via the communication network using communication channels comprising; a high bandwidth communication channel configured to transmit data supporting audio or video output at the device, and a low bandwidth communication channel configured to transmit data supporting control signals for operation of a primary functionality component of the device, wherein the data comprises audio data generated to be consistent with audio data generated by the device based on a type of the device. - View Dependent Claims (23)
-
-
24. A system for providing a distributed voice interface to a device, comprising:
-
a communication module configured to receive input from the device via a communication network, wherein the input is the result of preliminary signal processing comprising keyword detection by the device prior to receipt of the input at the communication module; a memory module configured to store an acoustic model of the input; and a processing module coupled to the communication module and configured to perform speech recognition on the received input based at least in part on a previously stored acoustic model in order to recognize a command, wherein the communication module is further configured to transmit data to the device, responsive to the command, via the communication network using communication channels comprising; a high bandwidth communication channel configured to transmit data supporting audio or video output at the device, and a low bandwidth communication channel configured to transmit data supporting control signals for operation of a primary functionality component of the device, wherein the data comprises audio data generated to be consistent with audio data generated by the device based on a type of the device. - View Dependent Claims (25)
-
Specification