Use of local voice input and remote voice processing to control a local visual display
First Claim
1. A method of controlling a visual display using voice commands, the method comprising:
- receiving an audio signal comprising voice commands from a user;
encoding the audio signal for transmission;
transmitting the encoded audio signal to a remote system;
in response to the transmission, receiving data from the remote system, wherein the data are configured to cause a display to display visual output; and
displaying the visual output on the visual display.
0 Assignments
0 Petitions
Accused Products
Abstract
A user uses voice commands to modify the contents of a visual display through an audio input device where the audio input device does not necessarily have speech recognition capabilities. The audio input device, such as a telephone, captures audio including spoken voice commands from a user and transmits the audio to a remote system. The remote system is configured to use automated speech recognition to recognize the voice commands. The recognized commands are interpreted by the remote system to respond to the user by transmitting data to be displayed on the visual display. The visual display can be integrated with the audio input device, such as in a web-enabled mobile phone, a video phone or an internet video phone, or the visual display can be separate, such as on a television or a computer display.
75 Citations
20 Claims
-
1. A method of controlling a visual display using voice commands, the method comprising:
-
receiving an audio signal comprising voice commands from a user;
encoding the audio signal for transmission;
transmitting the encoded audio signal to a remote system;
in response to the transmission, receiving data from the remote system, wherein the data are configured to cause a display to display visual output; and
displaying the visual output on the visual display. - View Dependent Claims (2, 3, 4)
-
-
5. A method of controlling a visual display using voice commands, the method comprising:
-
receiving a transmission of input data from a remote location, wherein the input data is based at least upon voice commands spoken by a user at the remote location;
processing the input data using automated speech recognition to identify the voice commands; and
based at least upon the identified voice commands, transmitting output data to the remote location, wherein the output data is responsive to the voice commands and wherein the output data is configured to effect output by the visual display. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A system for controlling a visual display, the system comprising:
-
a sound input device configured to receive, encode and transmit sounds;
a speech processing device located remote from the sound input device, the speech processing device configured to receive and process the encoded and transmitted sounds;
a server device configured to output data based upon output received from the speech processing device; and
a visual output device located proximate the sound input device, the visual output device comprising the visual display, the visual output device configured to control the display based on output received from the server device. - View Dependent Claims (17, 18, 19, 20)
-
Specification