Personal Voice-Based Information Retrieval System
First Claim
Patent Images
1. A method, comprising:
- (a) receiving a speech command from a voice-enabled device, over a network, by a speech-recognition engine coupled to a media server by an interactive voice response application including a user-defined search, the speech-recognition engine adapted to convert the speech command into a data message, the media server adapted to identify and access at least one or more websites containing information of interest to a particular user, the speech-recognition engine adapted to select particular speech-recognition grammar describing the speech command received and assigned to fetching content relating to the data message converted from the speech command and assigned to the user-defined search including a web request, along with a uniform resource locator of an identified web site from the one or more websites containing information of interest to the particular user and responsive to the web request;
(b) selecting, by the media server, at least one information-source-retrieval instruction stored for the particular speech-recognition grammar in a database coupled to the media server and adapted to retrieve information from the at least one or more websites;
(c) accessing, by a web-browsing server, a portion of the information source to retrieve information relating to the speech command, by using a processor of the web-browsing server, which processor (i) performs an instruction that requests information from an identified web page, (ii) utilizes a command to execute a content extractor within the web-browsing server to separate a portion of the information that is relevant from other information on the web page using a name of a named object including the information, the information derived from only a portion of the web page containing information pertinent to the speech command, the content extractor adapted to use a content- descriptor file containing a description of the portion of information and the content-descriptor file adapted to indicate a location of the portion of the information within the information source;
(d) selecting by the web-browsing server, the information relating to the speech command from the information source and retrieving only the portion of the information requested by the speech command according to the at least one information-source-retrieval instruction;
(e) converting the information retrieved from the information source into an audio message by a speech-synthesis engine, the speech-synthesis engine coupled to the media server; and
(f) transmitting the audio message by the voice-enabled device to the particular user.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention relates to a system for retrieving information from a network such as the Internet. A user creates a user-defined record in a database that identifies an information source, such as a web site, containing information of interest to the user. This record identifies the location of the information source and also contains a recognition grammar based upon a speech command assigned by the user. Upon receiving the speech command from the user that is described within the recognition grammar, a network interface system accesses the information source and retrieves the information requested by the user.
15 Citations
35 Claims
-
1. A method, comprising:
-
(a) receiving a speech command from a voice-enabled device, over a network, by a speech-recognition engine coupled to a media server by an interactive voice response application including a user-defined search, the speech-recognition engine adapted to convert the speech command into a data message, the media server adapted to identify and access at least one or more websites containing information of interest to a particular user, the speech-recognition engine adapted to select particular speech-recognition grammar describing the speech command received and assigned to fetching content relating to the data message converted from the speech command and assigned to the user-defined search including a web request, along with a uniform resource locator of an identified web site from the one or more websites containing information of interest to the particular user and responsive to the web request; (b) selecting, by the media server, at least one information-source-retrieval instruction stored for the particular speech-recognition grammar in a database coupled to the media server and adapted to retrieve information from the at least one or more websites; (c) accessing, by a web-browsing server, a portion of the information source to retrieve information relating to the speech command, by using a processor of the web-browsing server, which processor (i) performs an instruction that requests information from an identified web page, (ii) utilizes a command to execute a content extractor within the web-browsing server to separate a portion of the information that is relevant from other information on the web page using a name of a named object including the information, the information derived from only a portion of the web page containing information pertinent to the speech command, the content extractor adapted to use a content- descriptor file containing a description of the portion of information and the content-descriptor file adapted to indicate a location of the portion of the information within the information source; (d) selecting by the web-browsing server, the information relating to the speech command from the information source and retrieving only the portion of the information requested by the speech command according to the at least one information-source-retrieval instruction; (e) converting the information retrieved from the information source into an audio message by a speech-synthesis engine, the speech-synthesis engine coupled to the media server; and (f) transmitting the audio message by the voice-enabled device to the particular user. - View Dependent Claims (2, 3, 4)
-
-
5. A voice-browsing system for retrieving information from an information source that is periodically updated with current information, by speech commands received from a particular user provided via a voice-enabled device after establishing a connection between the voice-enabled device and a media server of the voice-browsing system, said voice-browsing system comprising:
-
(a) a speech-recognition engine including a processor and coupled to the media server, the media server initiating a voice-response application once the connection between the voice-enabled device and the voice-browsing system is established, the speech-recognition engine adapted to receive a speech command from a particular user via the voice-enabled device, the media server configured to identify and access the information source via a network, the speech-recognition engine adapted to convert the speech command into a data message by selecting speech-recognition grammar established to correspond to the speech command received from the particular user and assigned to perform searches; (b) the media server further configured to select at least one information-source-retrieval instruction corresponding to the speech-recognition grammar established for the speech command, the at least one information-source-retrieval instruction stored in a database associated with the media server and adapted to retrieve information; (d) a web-browsing server coupled to the media server and adapted to access at least a portion of the information source to retrieve information indicated by the speech command, by using a processor of the web-browsing server, which processor (i) performs an instruction that requests information from an identified web page, and (ii) utilizes a command to execute a content extractor within the web-browsing server to separate a portion of the information from other information, the information derived from only a portion of a web page containing information relevant to the speech command, wherein the content extractor uses a content-descriptor file containing a description of the portion of information and wherein the content-descriptor file indicates a location of the portion of the information within the information source, and selecting, by the web-browsing server, the information relevant from the information source and retrieving only the portion of the information that is relevant according to the at least one information-source-retrieval instruction; and (e) a speech-synthesis engine including a processor and coupled to the media server, the speech-synthesis engine adapted to convert the information retrieved from the information source into audio and convey the audio by the voice-enabled device. - View Dependent Claims (6, 7, 8, 9)
-
-
10. A method of selectively retrieving information in response to spoken commands received by a voice-browsing system, the method comprising:
-
(a) identifying, as one of a plurality of speech commands of a speech-recognition lexicon, audio data indicative of words spoken into a microphone of an electronic-communication device of a user; (b) using the identified speech command to access a corresponding descriptor file from a plurality of descriptor files stored in a database associated with the voice-browsing system, and using the corresponding descriptor file to identify (i) a web-accessible information source, and (ii) request information; (c) using the request information to fetch, from the information source identified by an accessed descriptor file, response data including a named object including content; (d) using the named object to extract the content from the response data; (e) generating audio response data containing indicia of a message for the user, which message is responsive to the identified speech command, and which message is based on the extracted content; and (f) directing a command to play the audio response data using the electronic-communication device of the user. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. An apparatus with a capability of selectively retrieving information in response to spoken commands, the apparatus comprising:
-
(a) a transceiver coupled to a network and capable of sending to and receiving information via the network from an electronic-communication device of a user, which device has a microphone; (b) a database containing a plurality of descriptor files, each of the descriptor files identifying (i) a web-accessible information source, and (ii) request information; (c) a speech-recognition engine, coupled to the transceiver and having access to the database, programmed to automatically identify, as one of a plurality of speech commands of a speech-recognition lexicon, audio data indicative of words spoken into the microphone of the electronic-communication device of a user; (d) a media server, coupled to the speech-recognition engine and having access to the database, programmed to access a descriptor file from the plurality of descriptor files in the database based on the identified speech command; (e) a web browsing server, coupled to the media server and programmed; (i) to retrieve, from the information source identified by the accessed descriptor file, responsive data specified by the request information identified by the accessed descriptor file, wherein the response data includes a named object including content; and (ii) to use the name of the named object to extract the content from the response data; and (f) a synthesizer coupled to the web browsing server and programmed to generate audio response data containing indicia of a message for the user, which message is responsive to the identified speech command, and which message is based on the extracted content; (g) the apparatus is programmed to direct a command to play an audio response data using the electronic-communication device of the user. - View Dependent Claims (20, 21, 22, 23)
-
-
24. A method of executing improved functionality of a voice-responsive system to allow selective retrieval of different kinds of information in response to commands spoken via an electronic communication device of a user in communication with the voice-responsive system, the method comprising:
-
(a) storing, in a storage device accessible by the voice-responsive system, a speech recognition grammar that is associated with an executable function; and (b) storing, in the storage device, for the executable function, an executable function definition configured to be executed by a web browsing server of the voice-responsive system upon recognizing that a command, spoken by a user of an electronic-communication device, corresponds to the speech recognition grammar; (c) wherein the executable function definition identifies; (i) information used to generate requests to an information source that includes a URL to identify the information source and to extract content from a named object within response data obtainable from the information source accessible by the URL; and (ii) information used to format an audible message from extracted content, so that a command to synthesize an audio response message will generate a coherent sentence that responds to the command spoken by a user, wherein the audio response message is adapted to be played on a speaker of the electronic-communication device of the user. - View Dependent Claims (25, 26, 27, 28, 29, 30)
-
-
31. An apparatus having a capability of selectively retrieving information in response to spoken commands, comprising:
-
(a) a microphone; and (b) a speaker coupled to the microphone; and (c) wherein the electronic-communication device is in communication with a remote computer system via a network to initiate user-defined searches; and (d) wherein the remote computer system comprises; (i) a speech-recognition engine, coupled to a transceiver and having access to a database, programmed to identify, as one of a plurality of speech commands of a speech-recognition lexicon, audio data indicative of words spoken into the microphone of the electronic-communication device of a user; (ii) a media server, coupled to the speech-recognition engine and having access to a database containing a plurality of descriptor files, programmed to use the identified speech command to access the corresponding descriptor file from the plurality of descriptor files, wherein the corresponding descriptor file is used to identify (i) a web-accessible information source, and (ii) request information; (iii) a web browsing server programmed; (A) to use the request information to fetch, from the information source identified by the accessed descriptor file, response data including a named object including particular content; and (B) to use a name associated with the named object to extract the content from the response data; (iv) a speech-synthesizer coupled to the web browsing server and programmed to generate audio response data containing indicia of a message for the user, which message is responsive to the identified speech command, and which message is based on the extracted content; and (v) wherein the remote computer system is programmed to direct a command to play the audio response data on the speaker. - View Dependent Claims (32, 33, 34, 35)
-
Specification