Robust voice browser system and voice activated device controller
First Claim
1. A method of operating an extended-function computer system by selectively retrieving information in response to spoken commands received by the extended-function computer system, the method comprising:
- (a) identifying, as one of a plurality of data characterizing speech commands of a speech-recognition lexicon, audio data indicative of words naturally spoken into a microphone of an electronic-communication device of a user;
(b) using identified data characterizing the speech commands to access a corresponding descriptor file from a plurality of descriptor files, wherein each of the descriptor files identify (i) a web-accessible information source, and (ii) select data of the web-accessible information source;
(c) fetching, from the web-accessible information source identified by an accessed descriptor file, responsive data specified by select data identified by the accessed descriptor file;
(d) generating audio response data containing indicia of a message for the user, which message is responsive to the identified data characterizing the speech commands, and which message is based on the responsive data;
(e) directing the audio response data to the electronic-communication device of the user; and
(f) improving functionality of a voice-responsive system to allow selective retrieval of different kinds of information in response to commands spoken via the electronic-communication device of a user in communication with the voice-responsive system, further comprising;
storing, in a storage device accessible by the voice-responsive system, a first speech recognition grammar that is associated with a first function, and a second speech recognition grammar, different from the first speech recognition grammar, that is associated with a second function, different from the first function; and
storing, in the storage device, for each of the first function and the second function, respective function definitions, different from one another, each configured to be executed by a web browsing server of the voice-responsive system upon recognizing that a command, spoken by the user of an electronic-communication device, corresponds to the respective speech recognition grammar;
wherein each function definition identifies;
(i) a URL of an information source;
(ii) select responsive information to be retrieved from the information source; and
(iii) a responsive message, in a format required by the voice-responsive system so that the voice-responsive system can synthesize an audio response message to be played on a speaker of the electronic-communication device of the user.
4 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates to an extended-function device for selectively retrieving information in response to naturally spoken commands provided via an electronic-communication device of a user that is used to query a corresponding descriptor file that identifies a web-accessible information source and fetches responsive data specified by select data identified by the accessed descriptor file. An audio response data containing indicia of a message for the user, which message is responsive to the identified naturally spoken command, and based on the responsive data is directed to the electronic-communication device of the user.
-
Citations
26 Claims
-
1. A method of operating an extended-function computer system by selectively retrieving information in response to spoken commands received by the extended-function computer system, the method comprising:
-
(a) identifying, as one of a plurality of data characterizing speech commands of a speech-recognition lexicon, audio data indicative of words naturally spoken into a microphone of an electronic-communication device of a user; (b) using identified data characterizing the speech commands to access a corresponding descriptor file from a plurality of descriptor files, wherein each of the descriptor files identify (i) a web-accessible information source, and (ii) select data of the web-accessible information source; (c) fetching, from the web-accessible information source identified by an accessed descriptor file, responsive data specified by select data identified by the accessed descriptor file; (d) generating audio response data containing indicia of a message for the user, which message is responsive to the identified data characterizing the speech commands, and which message is based on the responsive data; (e) directing the audio response data to the electronic-communication device of the user; and (f) improving functionality of a voice-responsive system to allow selective retrieval of different kinds of information in response to commands spoken via the electronic-communication device of a user in communication with the voice-responsive system, further comprising; storing, in a storage device accessible by the voice-responsive system, a first speech recognition grammar that is associated with a first function, and a second speech recognition grammar, different from the first speech recognition grammar, that is associated with a second function, different from the first function; and storing, in the storage device, for each of the first function and the second function, respective function definitions, different from one another, each configured to be executed by a web browsing server of the voice-responsive system upon recognizing that a command, spoken by the user of an electronic-communication device, corresponds to the respective speech recognition grammar; wherein each function definition identifies; (i) a URL of an information source; (ii) select responsive information to be retrieved from the information source; and (iii) a responsive message, in a format required by the voice-responsive system so that the voice-responsive system can synthesize an audio response message to be played on a speaker of the electronic-communication device of the user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. An apparatus having an extended capability of selectively retrieving information in response to naturally spoken commands, the apparatus comprising:
-
(a) a transceiver coupled to a network and capable of sending to and receiving information via the network from an electronic-communication device of a user, which device has a microphone; (b) a database containing a plurality of descriptor files, each of the descriptor files identifying (i) a web-accessible information source, and (ii) select data of the web-accessible information source; (c) a speech-recognition engine, coupled to the transceiver and having access to the database, programmed to automatically identify, as one of a plurality of speech commands of a speech-recognition lexicon, audio data indicative of words spoken into the microphone of the electronic-communication device of a user; (d) a media server, coupled to the speech-recognition engine and having access to the database, programmed to access a descriptor file from the plurality of descriptor files in the database based on the identified speech command; (e) a content fetcher, coupled to the media server, programmed to retrieve, from the web-accessible information source identified by the accessed descriptor file, responsive data specified by the select data identified by the accessed descriptor file, further comprising; means for improving functionality of a voice-responsive system to allow selective retrieval of different kinds of information in response to commands spoken via the electronic-communication device in communication with the voice-responsive system, further comprising; a first speech recognition grammar stored in a storage device accessible by the voice-responsive system, that is associated with a first function, and a second speech recognition grammar, different from the first speech recognition grammar, that is associated with a second function, different from the first function; and stored respective function definitions in the storage device, for each of the first function and the second function, different from one another, each configured to be executed by a web browsing server of the voice-responsive system upon recognizing that a command, spoken by the user of an electronic-communication device, corresponds to the respective speech recognition grammar, wherein each function definition identifies a URL of an information source and a select responsive information to be retrieved from the information source and a responsive message, in a format required by the voice-responsive system; and (f) a synthesizer coupled to the content fetcher and programmed to automatically generate audio response data containing indicia of a message for the user, which message is responsive to the identified speech command, and which message is based on the responsive data; and (g) wherein the apparatus is programmed to automatically direct the audio response data to the electronic-communication device of the user. - View Dependent Claims (21, 22)
-
-
23. An electronic-communication device having a capability of selectively retrieving information in response to naturally spoken commands, comprising:
-
(i) a microphone; (ii) wherein the electronic-communication device is in communication with a remote computer system via a network; (iii) wherein the remote computer system comprises; (a) a transceiver coupled to the network and capable of sending to and receiving information via the network from the electronic-communication device; (b) a database containing a plurality of descriptor files, each of the descriptor files identifying (i) a web-accessible information source, and (ii) select data of the web-accessible information source; (c) a speech-recognition engine, coupled to the transceiver and having access to the database, programmed to automatically identify, as one of a plurality of speech commands of a speech-recognition lexicon, audio data indicative of words spoken into the microphone; (d) a media server, coupled to the speech-recognition engine and having access to the database, programmed to access a descriptor file from the plurality of descriptor files in the database based on the identified speech command; (e) a content fetcher, coupled to the media server, programmed to retrieve, from the web-accessible information source identified by the accessed descriptor file, responsive data specified by the select data identified by the accessed descriptor file, further comprising; means for improving functionality of a voice-responsive system to allow selective retrieval of different kinds of information in response to commands spoken via the electronic-communication device in communication with the voice-responsive system, further comprising; a first speech recognition grammar stored in a storage device accessible by the voice-responsive system, that is associated with a first function, and a second speech recognition grammar, different from the first speech recognition grammar, that is associated with a second function, different from the first function; and stored respective function definitions in the storage device, for each of the first function and the second function, different from one another, each configured to be executed by a web browsing server of the voice-responsive system upon recognizing that a command, spoken by the user of an electronic-communication device, corresponds to the respective speech recognition grammar, wherein each function definition identifies a URL of an information source and a select responsive information to be retrieved from the information source and a responsive message, in a format required by the voice-responsive system; (f) a synthesizer coupled to the content extraction agent and programmed to automatically generate audio response data containing indicia of a message for the user, which message is responsive to the identified speech command, and which message is based on the responsive data; and (g) wherein the remote computer system is programmed to automatically direct the audio response data to the electronic-communication device; and (iv) a speaker adapted to convert the audio response data to an audible sound. - View Dependent Claims (24, 25, 26)
-
Specification