Personal voice-based information retrieval system

US 10,320,981 B2
Filed: 09/18/2017
Issued: 06/11/2019
Est. Priority Date: 02/04/2000
Status: Expired due to Fees

- Alert
- Pin

First Claim

Patent Images

1. A method, comprising:

(a) receiving a speech command from a voice-enabled device of a particular user, over a network, by a speech-recognition engine coupled to a media server by an interactive voice response application including a user-defined search, the speech-recognition engine adapted to convert the speech command into a data message, the media server adapted to identify and access at least one or more websites containing information of interest to the particular user, the speech-recognition engine adapted to select particular speech-recognition grammar describing the speech command received and assigned to fetching content relating to the data message converted from the speech command and assigned to the user-defined search including a web request, along with a uniform resource locator of an identified web site from the one or more websites containing information of interest to the particular user and responsive to the web request;

(b) selecting, by the media server, at least one information-source-retrieval instruction stored for the particular speech-recognition grammar in a database coupled to the media server and adapted to retrieve information from the at least one or more websites;

(c) accessing, by a web-browsing server, a portion of an information source to retrieve information relating to the speech command, by using a processor of the web-browsing server, which processor (i) performs an instruction that requests information from an identified web site, (ii) utilizes a command to execute a content extractor within the web-browsing server to separate a portion of information that is relevant from other information on the web page using a name of a named object including the information, the information derived from only a portion of the web page containing information pertinent to the speech command, the content extractor adapted to use a content-descriptor file containing a description of the portion of information and the content-descriptor file adapted to indicate a location of the portion of the information within the information source;

(d) selecting, by the web-browsing server, the information relating to the speech command from the information source and retrieving only the portion of the information requested by the speech command according to the at least one information-source-retrieval instruction;

(e) converting the information retrieved from the information source into an audio message by a speech-synthesis engine, the speech-synthesis engine coupled to the media server; and

(f) transmitting the audio message by the voice-enabled device to the particular user.

View all claims

1 Assignment

Timeline View

Assignment View

Litigations

0 Petitions

Accused Products

Abstract

The present invention relates to a system for retrieving information from a network such as the Internet. A user creates a user-defined record in a database that identifies an information source, such as a web site, containing information of interest to the user. This record identifies the location of the information source and also contains a recognition grammar based upon a speech command assigned by the user. Upon receiving the speech command from the user that is described within the recognition grammar, a network interface system accesses the information source and retrieves the information requested by the user.

418 Citations

28 Claims

1. A method, comprising:
- (a) receiving a speech command from a voice-enabled device of a particular user, over a network, by a speech-recognition engine coupled to a media server by an interactive voice response application including a user-defined search, the speech-recognition engine adapted to convert the speech command into a data message, the media server adapted to identify and access at least one or more websites containing information of interest to the particular user, the speech-recognition engine adapted to select particular speech-recognition grammar describing the speech command received and assigned to fetching content relating to the data message converted from the speech command and assigned to the user-defined search including a web request, along with a uniform resource locator of an identified web site from the one or more websites containing information of interest to the particular user and responsive to the web request;
  
  (b) selecting, by the media server, at least one information-source-retrieval instruction stored for the particular speech-recognition grammar in a database coupled to the media server and adapted to retrieve information from the at least one or more websites;
  
  (c) accessing, by a web-browsing server, a portion of an information source to retrieve information relating to the speech command, by using a processor of the web-browsing server, which processor (i) performs an instruction that requests information from an identified web site, (ii) utilizes a command to execute a content extractor within the web-browsing server to separate a portion of information that is relevant from other information on the web page using a name of a named object including the information, the information derived from only a portion of the web page containing information pertinent to the speech command, the content extractor adapted to use a content-descriptor file containing a description of the portion of information and the content-descriptor file adapted to indicate a location of the portion of the information within the information source;
  
  (d) selecting, by the web-browsing server, the information relating to the speech command from the information source and retrieving only the portion of the information requested by the speech command according to the at least one information-source-retrieval instruction;
  
  (e) converting the information retrieved from the information source into an audio message by a speech-synthesis engine, the speech-synthesis engine coupled to the media server; and
  
  (f) transmitting the audio message by the voice-enabled device to the particular user.
- View Dependent Claims (2, 3, 4)
- - 2. The method of claim 1, wherein the speech command is received by at least one of a landline telephone, a wireless telephone, and an Internet Protocol telephone and the media server is operatively connected to at least one of a local-area network, a wide-area network, and the Internet.
  - 3. The method of claim 2, wherein the media server functions as a user-interface system adapted to provide access to a voice-browsing system.
  - 4. The method of claim 2, further comprising:
    - a clipping engine adapted to initially generate the content-descriptor file that indicates the location of the portion of the information within the identified web site.

5. A voice-browsing system for retrieving information from an information source that is periodically updated with current information, by speech commands received from a particular user provided via a voice-enabled device after establishing a connection between the voice-enabled device and a media server of the voice-browsing system, said voice-browsing system comprising:
- (a) a speech-recognition engine including a processor and coupled to the media server, the media server initiating a voice-response application once the connection between the voice-enabled device and the voice-browsing system is established, the speech-recognition engine adapted to receive a speech command from a particular user via the voice-enabled device, the media server configured to identify and access the information source via a network, the speech-recognition engine adapted to convert the speech command into a data message by selecting speech-recognition grammar established to correspond to the speech command received from the particular user and assigned to perform searches;
  
  (b) the media server further configured to select at least one information-source-retrieval instruction corresponding to the speech-recognition grammar established for the speech command, the at least one information-source-retrieval instruction stored in a database associated with the media server and adapted to retrieve information;
  
  (c) a web-browsing server coupled to the media server and adapted to access at least a portion of the information source to retrieve information indicated by the speech command, by using a processor of the web-browsing server, which processor (i) performs an instruction that requests information from an identified web page within the information source, and (ii) utilizes a command to execute a content extractor within the web-browsing server to separate a portion of the information from other information, the information derived from only a portion of a web page containing information relevant to the speech command, wherein the content extractor uses a content-descriptor file containing a description of the portion of information and wherein the content-descriptor file indicates a location of a portion of the information within the information source, and selecting, by the web-browsing server, an information type relevant from the information source and retrieving only a portion of the information that is relevant according to the at least one information-source-retrieval instruction; and
  
  (d) a speech-synthesis engine including a processor and coupled to the media server, the speech-synthesis engine adapted to convert the information retrieved from the information source into audio and convey the audio by the voice-enabled device.
- View Dependent Claims (6, 7, 8, 9)
- - 6. The voice-browsing system claim 5, further comprising:
    - an interface to an associated website by the network to locate requested information.
  - 7. The voice-browsing system of claim 5, wherein the voice-enabled device accesses the voice-browsing system by at least one of a landline telephone, a wireless telephone, and an Internet Protocol telephonic connection and wherein the media server operatively connects to the network, by at least one of a local-area network, a wide-area network, and the Internet.
  - 8. The voice-browsing system of claim 5, wherein the media server functions as a user-interface system adapted to provide access to a voice-browsing system.
  - 9. The voice-browsing system of claim 5, further comprising:
    - a clipping engine adapted to generate the content-descriptor file, by which, an instruction is used by the web-browsing server to request information from the identified web site and the information is displayed on the voice-enabled device, wherein the information is only the portion of the web page containing information relevant to the speech command.

10. A method of selectively retrieving information in response to spoken commands received by a voice-browsing system, the method comprising:
- (a) identifying, one of a plurality of speech commands of a speech-recognition lexicon, based on audio data indicative of words spoken into a microphone of an electronic-communication device of a user;
  
  (b) using the identified speech command to access a corresponding descriptor file from a plurality of descriptor files stored in a database associated with the voice-browsing system, and using the corresponding descriptor file to identify (i) a web-accessible information source, and (ii) request information;
  
  (c) using the request information to fetch, from the information source identified by an accessed descriptor file, response data including a named object including content;
  
  (d) using the named object to extract the content from the response data;
  
  (e) generating audio response data containing indicia of a message for the user, which message is responsive to the identified speech command, and which message is based on the extracted content; and
  
  (f) directing a command to play the audio response data using the electronic-communication device of the user.
- View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
- - 11. The method of claim 10, wherein the content is located in the response data using the named object regardless of the location of the named object within the response data.
  - 12. The method of claim 11, wherein the fetching occurs on a web browsing server, and wherein the web browsing server receives the identified speech command from a different server.
  - 13. The method of claim 12, further comprising:
    - using Internet Protocol to communicate with the electronic-communication device of the user.
  - 14. The method of claim 12, further comprising:
    - using a telecommunication network to communicate with the electronic-communication device of the user.
  - 15. The method of claim 12, wherein the electronic-communication device of the user is a voice-enabled wireless unit that is not a telephone.
  - 16. The method of claim 12, wherein the corresponding descriptor file identifies the web-accessible information source and information used to generate proper requests to the information source with a specific URL format including search parameters.
  - 17. The method of claim 12, wherein using the request information to fetch comprises fetching the response data from a database stored on a Local Area Network (LAN) or a Wide Area Network (WAN).
  - 18. The method of claim 12, further comprising:
    - using the named object to determine a beginning and an end of the content within the response data.

19. An apparatus with a capability of selectively retrieving information in response to spoken commands, the apparatus comprising:
- (a) a transceiver coupled to a network and capable of sending to and receiving information via the network from an electronic-communication device of a user, which device has a microphone;
  
  (b) a database containing a plurality of descriptor files, each of the descriptor files identifying (i) a web-accessible information source, and (ii) request information for fetching from the web-accessible source;
  
  (c) a speech-recognition engine, coupled to the transceiver and having access to the database, programmed to automatically identify, one of a plurality of speech commands of a speech-recognition lexicon, based on audio data indicative of words spoken into the microphone of the electronic-communication device of a user;
  
  (d) a media server, coupled to the speech-recognition engine and having access to the database, programmed to access a descriptor file from the plurality of descriptor files in the database based on the identified speech command expressing the request information;
  
  (e) a web browsing server, coupled to the media server and programmed;
  
  (i) to retrieve, from the web-accessible information source identified by the accessed descriptor file, responsive data specified by the request information identified by the accessed descriptor file, wherein the responsive data includes a named object including content; and
  
  (ii) to use a name of the named object to identify content relating to the name and to extract the content from the response data; and
  
  (f) a synthesizer coupled to the web browsing server and programmed to generate and transmit audio response data containing indicia of a message about the request information for the user, which message is responsive to the identified speech command, and which message is based on the extracted content;
  
  (g) the apparatus is programmed to direct a command to play an audio response data using the electronic-communication device of the user.
- View Dependent Claims (20, 21, 22, 23)
- - 20. The apparatus of claim 19, wherein the web browsing server is further programmed to use the accessed descriptor file to format a request for a content fetcher.
  - 21. The apparatus of claim 20, wherein the content fetcher is executed in response to a command included in the accessed descriptor file that is executed on the web browsing server.
  - 22. The apparatus of claim 19, wherein the speech-recognition engine is within the media server.
  - 23. The apparatus of claim 19, wherein the web browsing server is further programmed to use the named object to determine a beginning and an end of the content within the responsive data.

24. An apparatus having a capability of selectively retrieving information in response to spoken commands, comprising:
- (a) a microphone; and
  
  (b) a speaker coupled to the microphone; and
  
  (c) wherein the electronic-communication device is in communication with a remote computer system via a network to initiate user-defined searches; and
  
  (d) wherein the remote computer system comprises;
  
  (i) a speech-recognition engine, coupled to a transceiver and having access to a database, programmed to identify, one of a plurality of speech commands of a speech-recognition lexicon, based on audio data indicative of words spoken into the microphone of the electronic-communication device of a user;
  
  (ii) a media server, coupled to the speech-recognition engine and having access to a database containing a plurality of descriptor files, programmed to use the identified speech command to access a corresponding descriptor file from the plurality of descriptor files, wherein the corresponding descriptor file is used to identify (i) a web-accessible information source, and (ii) request information to fetch from the web-accessible information source;
  
  (iii) a web browsing server programmed;
  
  (A) to use the request information to fetch, from the web-accessible information source identified by the accessed descriptor file, response data including a named object including particular content; and
  
  (B) to use a name associated with the named object to identify content relating to the name and to extract the content from the response data;
  
  (iv) a speech-synthesizer coupled to the web browsing server and programmed to generate and transmit audio response data containing indicia of a message for the user about the request information, which message is responsive to the identified speech command, and which message is based on the extracted content; and
  
  (v) wherein the remote computer system is programmed to direct a command to play the audio response data on the speaker.
- View Dependent Claims (25, 26, 27, 28)
- - 25. The apparatus of claim 24, wherein the network is the Internet.
  - 26. The apparatus of claim 24, wherein the network is a telecommunication network.
  - 27. The apparatus of claim 24, wherein the electronic-communication device is a voice-enabled wireless unit that is not a telephone.
  - 28. The apparatus of claim 24, wherein the web browsing server is further programmed to use the named object to determine a beginning and an end of the content within the responsive data.

Specification

Resources

Litigation Campaign Assessment

Litigation Data

Current Assignee
Parus Holdings, Inc.
Original Assignee
Parus Holdings, Inc.
Inventors
Kurganov, Alexander
Primary Examiner(s)
Hashem, Lisa

Application Number

US15/707,951
Publication Number

US 20180007201A1
Time in Patent Office

631 Days
Field of Search
US Class Current
CPC Class Codes

G06F 16/638   Presentation of query results

G06F 16/95   Retrieval from the web

G06F 3/167   Audio in a user interface, ...

G10L 13/08   Text analysis or generation...

G10L 15/02   Feature extraction for spee...

G10L 15/06   Creation of reference templ...

G10L 15/08   Speech classification or se...

G10L 15/22   Procedures used during a sp...

G10L 15/26   Speech to text systems G10L...

G10L 17/24   the user being prompted to ...

G10L 2015/223   Execution procedure of a sp...

G10L 25/54   for retrieval

H04L 67/02   based on web technology, e....

H04M 2201/39   using speech synthesis spee...

H04M 2201/40   using speech recognition sp...

H04M 2201/405   involving speaker-dependent...

H04M 2207/40   terminals with audio html b...

H04M 3/4938   comprising a voice browser ...

Personal voice-based information retrieval system

First Claim

1 Assignment

Litigations

0 Petitions

Accused Products

Abstract

418 Citations

28 Claims

Specification

Solutions

Use Cases

Quick Links

Personal voice-based information retrieval system

First Claim

1 Assignment

Subscription Required

Subscription Required

Litigations

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

418 Citations

28 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links