Personal voice-based information retrieval system
DCFirst Claim
Patent Images
1. A method, comprising:
- (a) receiving a speech command from a voice-enabled device of a particular user, over a network, by a speech-recognition engine coupled to a media server by an interactive voice response application including a user-defined search, the speech-recognition engine adapted to convert the speech command into a data message, the media server adapted to identify and access at least one or more websites containing information of interest to the particular user, the speech-recognition engine adapted to select particular speech-recognition grammar describing the speech command received and assigned to fetching content relating to the data message converted from the speech command and assigned to the user-defined search including a web request, along with a uniform resource locator of an identified web site from the one or more websites containing information of interest to the particular user and responsive to the web request;
(b) selecting, by the media server, at least one information-source-retrieval instruction stored for the particular speech-recognition grammar in a database coupled to the media server and adapted to retrieve information from the at least one or more websites;
(c) accessing, by a web-browsing server, a portion of an information source to retrieve information relating to the speech command, by using a processor of the web-browsing server, which processor (i) performs an instruction that requests information from an identified web site, (ii) utilizes a command to execute a content extractor within the web-browsing server to separate a portion of information that is relevant from other information on the web page using a name of a named object including the information, the information derived from only a portion of the web page containing information pertinent to the speech command, the content extractor adapted to use a content-descriptor file containing a description of the portion of information and the content-descriptor file adapted to indicate a location of the portion of the information within the information source;
(d) selecting, by the web-browsing server, the information relating to the speech command from the information source and retrieving only the portion of the information requested by the speech command according to the at least one information-source-retrieval instruction;
(e) converting the information retrieved from the information source into an audio message by a speech-synthesis engine, the speech-synthesis engine coupled to the media server; and
(f) transmitting the audio message by the voice-enabled device to the particular user.
1 Assignment
Litigations
0 Petitions
Accused Products
Abstract
The present invention relates to a system for retrieving information from a network such as the Internet. A user creates a user-defined record in a database that identifies an information source, such as a web site, containing information of interest to the user. This record identifies the location of the information source and also contains a recognition grammar based upon a speech command assigned by the user. Upon receiving the speech command from the user that is described within the recognition grammar, a network interface system accesses the information source and retrieves the information requested by the user.
418 Citations
28 Claims
-
1. A method, comprising:
-
(a) receiving a speech command from a voice-enabled device of a particular user, over a network, by a speech-recognition engine coupled to a media server by an interactive voice response application including a user-defined search, the speech-recognition engine adapted to convert the speech command into a data message, the media server adapted to identify and access at least one or more websites containing information of interest to the particular user, the speech-recognition engine adapted to select particular speech-recognition grammar describing the speech command received and assigned to fetching content relating to the data message converted from the speech command and assigned to the user-defined search including a web request, along with a uniform resource locator of an identified web site from the one or more websites containing information of interest to the particular user and responsive to the web request; (b) selecting, by the media server, at least one information-source-retrieval instruction stored for the particular speech-recognition grammar in a database coupled to the media server and adapted to retrieve information from the at least one or more websites; (c) accessing, by a web-browsing server, a portion of an information source to retrieve information relating to the speech command, by using a processor of the web-browsing server, which processor (i) performs an instruction that requests information from an identified web site, (ii) utilizes a command to execute a content extractor within the web-browsing server to separate a portion of information that is relevant from other information on the web page using a name of a named object including the information, the information derived from only a portion of the web page containing information pertinent to the speech command, the content extractor adapted to use a content-descriptor file containing a description of the portion of information and the content-descriptor file adapted to indicate a location of the portion of the information within the information source; (d) selecting, by the web-browsing server, the information relating to the speech command from the information source and retrieving only the portion of the information requested by the speech command according to the at least one information-source-retrieval instruction; (e) converting the information retrieved from the information source into an audio message by a speech-synthesis engine, the speech-synthesis engine coupled to the media server; and (f) transmitting the audio message by the voice-enabled device to the particular user. - View Dependent Claims (2, 3, 4)
-
-
5. A voice-browsing system for retrieving information from an information source that is periodically updated with current information, by speech commands received from a particular user provided via a voice-enabled device after establishing a connection between the voice-enabled device and a media server of the voice-browsing system, said voice-browsing system comprising:
-
(a) a speech-recognition engine including a processor and coupled to the media server, the media server initiating a voice-response application once the connection between the voice-enabled device and the voice-browsing system is established, the speech-recognition engine adapted to receive a speech command from a particular user via the voice-enabled device, the media server configured to identify and access the information source via a network, the speech-recognition engine adapted to convert the speech command into a data message by selecting speech-recognition grammar established to correspond to the speech command received from the particular user and assigned to perform searches; (b) the media server further configured to select at least one information-source-retrieval instruction corresponding to the speech-recognition grammar established for the speech command, the at least one information-source-retrieval instruction stored in a database associated with the media server and adapted to retrieve information; (c) a web-browsing server coupled to the media server and adapted to access at least a portion of the information source to retrieve information indicated by the speech command, by using a processor of the web-browsing server, which processor (i) performs an instruction that requests information from an identified web page within the information source, and (ii) utilizes a command to execute a content extractor within the web-browsing server to separate a portion of the information from other information, the information derived from only a portion of a web page containing information relevant to the speech command, wherein the content extractor uses a content-descriptor file containing a description of the portion of information and wherein the content-descriptor file indicates a location of a portion of the information within the information source, and selecting, by the web-browsing server, an information type relevant from the information source and retrieving only a portion of the information that is relevant according to the at least one information-source-retrieval instruction; and (d) a speech-synthesis engine including a processor and coupled to the media server, the speech-synthesis engine adapted to convert the information retrieved from the information source into audio and convey the audio by the voice-enabled device. - View Dependent Claims (6, 7, 8, 9)
-
-
10. A method of selectively retrieving information in response to spoken commands received by a voice-browsing system, the method comprising:
-
(a) identifying, one of a plurality of speech commands of a speech-recognition lexicon, based on audio data indicative of words spoken into a microphone of an electronic-communication device of a user; (b) using the identified speech command to access a corresponding descriptor file from a plurality of descriptor files stored in a database associated with the voice-browsing system, and using the corresponding descriptor file to identify (i) a web-accessible information source, and (ii) request information; (c) using the request information to fetch, from the information source identified by an accessed descriptor file, response data including a named object including content; (d) using the named object to extract the content from the response data; (e) generating audio response data containing indicia of a message for the user, which message is responsive to the identified speech command, and which message is based on the extracted content; and (f) directing a command to play the audio response data using the electronic-communication device of the user. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. An apparatus with a capability of selectively retrieving information in response to spoken commands, the apparatus comprising:
-
(a) a transceiver coupled to a network and capable of sending to and receiving information via the network from an electronic-communication device of a user, which device has a microphone; (b) a database containing a plurality of descriptor files, each of the descriptor files identifying (i) a web-accessible information source, and (ii) request information for fetching from the web-accessible source; (c) a speech-recognition engine, coupled to the transceiver and having access to the database, programmed to automatically identify, one of a plurality of speech commands of a speech-recognition lexicon, based on audio data indicative of words spoken into the microphone of the electronic-communication device of a user; (d) a media server, coupled to the speech-recognition engine and having access to the database, programmed to access a descriptor file from the plurality of descriptor files in the database based on the identified speech command expressing the request information; (e) a web browsing server, coupled to the media server and programmed; (i) to retrieve, from the web-accessible information source identified by the accessed descriptor file, responsive data specified by the request information identified by the accessed descriptor file, wherein the responsive data includes a named object including content; and (ii) to use a name of the named object to identify content relating to the name and to extract the content from the response data; and (f) a synthesizer coupled to the web browsing server and programmed to generate and transmit audio response data containing indicia of a message about the request information for the user, which message is responsive to the identified speech command, and which message is based on the extracted content; (g) the apparatus is programmed to direct a command to play an audio response data using the electronic-communication device of the user. - View Dependent Claims (20, 21, 22, 23)
-
-
24. An apparatus having a capability of selectively retrieving information in response to spoken commands, comprising:
-
(a) a microphone; and (b) a speaker coupled to the microphone; and (c) wherein the electronic-communication device is in communication with a remote computer system via a network to initiate user-defined searches; and (d) wherein the remote computer system comprises; (i) a speech-recognition engine, coupled to a transceiver and having access to a database, programmed to identify, one of a plurality of speech commands of a speech-recognition lexicon, based on audio data indicative of words spoken into the microphone of the electronic-communication device of a user; (ii) a media server, coupled to the speech-recognition engine and having access to a database containing a plurality of descriptor files, programmed to use the identified speech command to access a corresponding descriptor file from the plurality of descriptor files, wherein the corresponding descriptor file is used to identify (i) a web-accessible information source, and (ii) request information to fetch from the web-accessible information source; (iii) a web browsing server programmed; (A) to use the request information to fetch, from the web-accessible information source identified by the accessed descriptor file, response data including a named object including particular content; and (B) to use a name associated with the named object to identify content relating to the name and to extract the content from the response data; (iv) a speech-synthesizer coupled to the web browsing server and programmed to generate and transmit audio response data containing indicia of a message for the user about the request information, which message is responsive to the identified speech command, and which message is based on the extracted content; and (v) wherein the remote computer system is programmed to direct a command to play the audio response data on the speaker. - View Dependent Claims (25, 26, 27, 28)
-
Specification