Global speech user interface
First Claim
Patent Images
1. A system for enabling a user to select media content in an entertainment environment, said system comprising:
- a remote control device comprising;
a set of user activated keys adapted to execute functions of the entertainment environment;
a microphone for receiving user speech;
coupled to the microphone, a speech activation circuit adapted to enable a speech signal; and
coupled to the speech activation circuit, a transmitter adapted to transmit the speech signal from the remote control device;
coupled to the remote control device, a speech engine comprising a speech recognizer configured to receive the speech signal, an application wrapper configured to recognize substantive meaning embodied in the speech signal, and output commands for enabling selection of media content when substantive meaning is recognized; and
coupled to the speech engine, a media content controller configured to receive commands representing the recognized substantive meaning, to select media content, and to use selected media content in the entertainment environment;
wherein;
every function that can be executed by activation of the user activated keys can also be executed by the speech engine in response to the recognized substantive meaning;
the speech engine provides user visual indications responsive to the speech signal; and
the user visual indications comprise at least one overlay on a display screen, said overlay from the group of items consisting of;
a dialog box;
a list of context-sensitive spoken commands;
a list of digital cable services available to the user.
5 Assignments
0 Petitions
Accused Products
Abstract
A global speech user interface (GSUI) comprises an input system to receive a user'"'"'s spoken command, a feedback system along with a set of feedback overlays to give the user information on the progress of his spoken requests, a set of visual cues on the television screen to help the user understand what he can say, a help system, and a model for navigation among applications. The interface is extensible to make it easy to add new applications.
147 Citations
14 Claims
-
1. A system for enabling a user to select media content in an entertainment environment, said system comprising:
-
a remote control device comprising; a set of user activated keys adapted to execute functions of the entertainment environment; a microphone for receiving user speech; coupled to the microphone, a speech activation circuit adapted to enable a speech signal; and coupled to the speech activation circuit, a transmitter adapted to transmit the speech signal from the remote control device; coupled to the remote control device, a speech engine comprising a speech recognizer configured to receive the speech signal, an application wrapper configured to recognize substantive meaning embodied in the speech signal, and output commands for enabling selection of media content when substantive meaning is recognized; and coupled to the speech engine, a media content controller configured to receive commands representing the recognized substantive meaning, to select media content, and to use selected media content in the entertainment environment;
wherein;every function that can be executed by activation of the user activated keys can also be executed by the speech engine in response to the recognized substantive meaning; the speech engine provides user visual indications responsive to the speech signal; and the user visual indications comprise at least one overlay on a display screen, said overlay from the group of items consisting of; a dialog box; a list of context-sensitive spoken commands; a list of digital cable services available to the user.
-
-
2. A system for enabling a user to select media content in an entertainment environment said system comprising:
-
a remote control device comprising; a set of user activated keys adapted to execute functions of the entertainment environment; a microphone for receiving user speech; coupled to the microphone, a speech activation circuit adapted to enable a speech signal; and coupled to the speech activation circuit a transmitter adapted to transmit the speech signal from the remote control device; coupled to the remote control device, a speech engine comprising a speech recognizer configured to receive the speech signal, an application wrapper configured to recognize substantive meaning embodied in the speech signal, and output commands for enabling selection of media content when substantive meaning is recognized; and coupled to the speech engine, a media content controller configured to receive commands representing the recognized substantive meaning, to select media content, and to use selected media content in the entertainment environment;
wherein;every function that can be executed by activation of the user activated keys can also be executed by the speech engine in response to the recognized substantive meaning; and the application wrapper provides a binary indication denoting whether or not substantive meaning has been successfully recognized in the speech signal. - View Dependent Claims (3, 4)
-
-
5. A method for selecting media content in an entertainment environment, said method comprising:
-
receiving user speech and generating a speech signal in response to the user speech; determining and displaying a user feedback message corresponding to the generated speech signal; and selecting and using media content in the entertainment environment based upon the generated speech signal;
wherein;the feedback message comprises one of three levels, the three levels corresponding to progressively more detailed feedback imparted to the user, said three levels presented to the user based upon sequentially occurring first, second, and third unsuccessful speech recognitions, respectively. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A system for selecting media content in an entertainment environment, said system comprising:
-
means for receiving user speech and for generating a speech signal in response to the user speech; coupled to the receiving and generating means, means for determining substantive meaning from the generated speech signal, and for displaying on a user display two distinct categories of feedback messages corresponding to the generated speech signal, where the two categories correspond to two distinct types of problems; and coupled to the determining and displaying means, means for selecting and using media content in the entertainment environment based upon the generated speech signal, wherein; the first category comprises a recognition feedback message informing the user of a problem with recognizing substantive meaning; and the second category comprises an application feedback message informing the user regarding issues with selection and use of media content.
-
Specification