System and method of automating a spoken dialogue service
First Claim
Patent Images
1. A method comprising:
- extracting, via a processor and independent of user input, first task data from a web-site, wherein the first task data is based on a structure of a web page in the web-site;
formatting the first task data into first formatted task data comprising a first topic section, a first hyperlink section, and a first content/answer section;
extracting, independent of the user input, second task data from the web-site, wherein the second task data is based on the structure of the web page in the web-site;
formatting the second task data into second formatted task data comprising a second topic section, a second hyperlink section, and a second content/answer section;
analyzing the first formatted task data and the second formatted task data, to yield an analysis;
when, based on the analysis, the first topic section is determined to be thematically coherent with the second topic section, merging the first formatted task data and the second formatted task data into third formatted task data;
receiving a spoken natural language user query;
organizing task data into a ranked hierarchical structure based on the spoken natural language user query, wherein the task data comprises one of (1) the first formatted task data and the second formatted task data, and (2) the third formatted task data;
generating a ranked list of relevant responses to the spoken natural language user query using the ranked hierarchical structure to perform vector space modeling; and
initiating a two-way, natural language spoken dialog to provide a response to the spoken natural language user query according to the ranked list.
5 Assignments
0 Petitions
Accused Products
Abstract
A system and method of generating and operating a spoken dialog service for a web-site are disclosed. The system parses web-site data and organizes the web-site data in a task knowledge data bank. The system receives text associated with a user query; processes the received text in a spoken language understanding (SLU) module, the SLU module using the web-site data from the task knowledge data bank; generates a ranked list of relevant responses to the user query; generates a hierarchical tree using the web-site data and the ranked list of relevant responses to the user query, generates a response to the user query using the hierarchical tree; and presents the response to the user.
75 Citations
33 Claims
-
1. A method comprising:
-
extracting, via a processor and independent of user input, first task data from a web-site, wherein the first task data is based on a structure of a web page in the web-site; formatting the first task data into first formatted task data comprising a first topic section, a first hyperlink section, and a first content/answer section; extracting, independent of the user input, second task data from the web-site, wherein the second task data is based on the structure of the web page in the web-site; formatting the second task data into second formatted task data comprising a second topic section, a second hyperlink section, and a second content/answer section; analyzing the first formatted task data and the second formatted task data, to yield an analysis; when, based on the analysis, the first topic section is determined to be thematically coherent with the second topic section, merging the first formatted task data and the second formatted task data into third formatted task data; receiving a spoken natural language user query; organizing task data into a ranked hierarchical structure based on the spoken natural language user query, wherein the task data comprises one of (1) the first formatted task data and the second formatted task data, and (2) the third formatted task data; generating a ranked list of relevant responses to the spoken natural language user query using the ranked hierarchical structure to perform vector space modeling; and initiating a two-way, natural language spoken dialog to provide a response to the spoken natural language user query according to the ranked list. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method comprising:
-
detecting, via a processor, when new web-site data is added to a web-site, the new web-site data being based on a structure of a web page in the web-site; extracting the new web-site data; formatting the new web-site data into new formatted web-site data comprising a new topic section, a new hyperlink section, and a new content/answer section; analyzing, via a processor, the new formatted web-site data and old formatted web-site data, to yield an analysis, the old formatted web-site data comprising an old topic section, an old hyperlink section, and an old content/answer section; when, based on the analysis, the new topic section is determined to be thematically coherent with the old topic section, merging the new formatted web-site data and the old formatted web-site data into third formatted web-site data; receiving a spoken natural language user query; organizing web-site data into a ranked hierarchical structure based on the new web site data for the spoken natural language user query, wherein the web-site data comprises one of (1) the new formatted web-site data and the old formatted web-site data, and (2) the third formatted task data; generating a ranked list of relevant responses to the spoken natural language user query, using the ranked hierarchical structure to perform vector space modeling; and initiating a two-way, natural language spoken dialog to provide a response to the spoken natural language user query based on the ranked list.
-
-
14. A method comprising:
-
automatically detecting, via a processor, when additional web-site structure data is added to a web-site, wherein the additional web-site structure data is based on a structure of the web-site; extracting the additional web-site structure data from the web-site; formatting the additional web-site structure data into formatted additional data comprising an additional topic section, an additional hyperlink section, and an additional content/answer section; analyzing, via a processor, the formatted additional data and formatted original data, to yield an analysis, wherein the formatted original data comprises an original topic section, an original hyperlink section, and an original content/answer section; when, based on the analysis, the original topic section is determined to be thematically coherent with the additional topic section, merging the formatted original data and the formatted additional data into third formatted data; receiving the spoken natural language user query; organizing data into a ranked hierarchical structure based on the spoken natural language user query, wherein the data comprises one of (1) the formatted original data and the formatted original data, and (2) the third formatted data; generating a ranked list of relevant responses to the spoken natural language user query using the data; and synchronizing a two-way, natural language spoken dialog to provide a response to the spoken natural language user query according to the ranked list. - View Dependent Claims (15)
-
-
16. A system comprising:
-
a processor; and a computer-readable storage medium having instructions stored which, when executed on the processor, perform operations comprising; extracting, via a processor and independent of user input, first task data from a web-site, wherein the first task data is based on a structure of a web page in the web-site; formatting the first task data into first formatted task data comprising a first topic section, a first hyperlink section, and a first content/answer section; extracting, independent of the user input, second task data from the web-site, wherein the second task data is based on the structure of the web page in the web-site; formatting the second task data into second formatted task data comprising a second topic section, a second hyperlink section, and a second content/answer section; analyzing the first formatted task data and the second formatted task data, to yield an analysis; when, based on the analysis, the first topic section is determined to be thematically coherent with the second topic section, merging the first formatted task data and the second formatted task data into third formatted task data; receiving a spoken natural language user query; organizing task data into a ranked hierarchical structure based on the spoken natural language user query, wherein the task data comprises one of (1) the first formatted task data and the second formatted task data, and (2) the third formatted task data; generating a ranked list of relevant responses to the spoken natural language user query-using the ranked hierarchical structure to perform vector space modeling; and initiating a two-way, natural language spoken dialog using the ranked list. - View Dependent Claims (17, 18, 19, 20, 21)
-
-
22. A computer-readable storage device having instructions which, when executed on a computing device, cause the computing device to perform operations comprising:
-
parsing web-site data independent of user input, the web-site data being based on a structure of a web-page in a web-site, yielding parsed web-site data; formatting the parsed web-site data into formatted parsed web-site data comprising a parsed topic section, a parsed hyperlink section, and a parsed content/answer section such that a text segment from the web-page is extracted and represented using the task data; analyzing the formatted parsed web-site data and formatted old web-site data, to yield an analysis, the formatted old web-site data comprising an old topic section, an old hyperlink section, and an old parsed content/answer section; when, based on the analysis, the parsed topic section is determined to be thematically coherent with the old topic section, merging the formatted parsed web-site data and the formatted old web-site data into a third formatted web-site data; receiving a spoken natural language user query; organizing web-site data into a ranked hierarchical structure based on the spoken natural language user query, wherein the web-site data comprises one of (1) the formatted parsed web-site data and the formatted old web-site data, and (2) the third formatted web-site data; generating a ranked list of relevant responses to the spoken natural language user query using the ranked hierarchical structure to perform vector space modeling; and using a spoken dialog model to generate a spoken, audible response to the spoken natural language user query based on the ranked list; and initiating a two-way, natural language spoken dialog which delivers the spoken, audible response. - View Dependent Claims (23)
-
-
24. A system comprising:
-
a processor; and a computer-readable storage medium having instructions stored which, when executed on the processor, cause the processor to perform operations comprising; parsing web-site data, independent of user input, the web-site data being based on a structure of a web-site to yield parsed web-site data; formatting the parsed web-site data, to yield formatted parsed web-site data comprising a parsed topic section, a parsed hyperlink section, and a parsed content/answer section; analyzing the formatted parsed web-site data and formatted old web-site data, to yield an analysis, the formatted old web-site data comprising an old topic section, an old hyperlink section, and an old parsed content/answer section; when, based on the analysis, the parsed topic section is determined to be thematically coherent with the old topic section, merging the formatted parsed web-site data and the formatted old web-site data into a third formatted web-site data; receiving a spoken natural language user query; organizing web-site data into a ranked hierarchical structure based on the spoken natural language user query, wherein the web-site data comprises one of (1) the formatted parsed web-site data and the formatted old web-site data, and (2) the third formatted web-site data; generating a ranked list of relevant responses to the spoken natural language user query using the web-site data to perform vector space modeling; and using the ranked list to generate a spoken, audible response to the spoken natural language user query from web-site content as part of a two-way natural language spoken dialog, wherein the two-way, natural language spoken dialog is configured for operating without translating web-pages of the web-site. - View Dependent Claims (25, 26)
-
-
27. A method comprising:
-
parsing web-site data, via a processor and independent of user input, the web-site data being based on a structure of a web-site, to yield parsed web-site data; formatting the parsed web-site data, to yield formatted parsed web-site data comprising a parsed topic section, a parsed hyperlink section, and a parsed content/answer section; analyzing the formatted parsed web-site data and formatted old web-site data, to yield an analysis, the formatted old web-site data comprising an old topic section, an old hyperlink section, and an old parsed content/answer section; when, based on the analysis, the parsed topic section is determined to be thematically coherent with the old topic section, merging the formatted parsed web-site data and the formatted old web-site data into third formatted web-site data; receiving, via a processor, a spoken natural language user query; organizing web-site data into a ranked hierarchical structure based on the spoken natural language user query, wherein the web-site data comprises one of (1) the formatted parsed web-site data and the formatted old web-site data, and (2) the third formatted web-site data; generating a ranked list of relevant responses to the spoken natural language user query using data from the ranked hierarchical structure to perform vector space modeling; generating a spoken, audible response to the spoken natural language user query using the ranked hierarchical structure; and initiating a two-way natural language spoken dialog to provide a response to the spoken natural language user query based on the ranked list. - View Dependent Claims (28, 29, 30)
-
-
31. A method comprising:
-
receiving, via a processor, a spoken natural language user query; receiving text associated with the spoken natural language user query; receiving first web-site data based on a structure of a web-site; formatting the first web-site data into first formatted web-site data comprising a first topic section, a first hyperlink section, and a first content/answer section; receiving second web-site data based on a structure of the web-site; formatting the second web-site data into second formatted web-site data comprising a second topic section, a second hyperlink section, and a second content/answer section; analyzing the first formatted web-site data and the second formatted web-site data, to yield an analysis; when, based on the analysis, the first topic section is determined to be thematically coherent with the second topic section, merging the first formatted web-site data and the second formatted web-site data into a third formatted web-site data; organizing web-site data into a ranked hierarchical structure based on the spoken natural language user query and the text, wherein the web-site data comprises one of (1) the first formatted web-site data and the second formatted web-site data, and (2) the third formatted web-site data; generating a ranked list of relevant response to the spoken natural language user query using the ranked hierarchical structure to perform vector space modeling; generating a spoken, audible response to the spoken natural language user query using the ranked hierarchical tree via a two-way, natural language spoken dialog; and presenting the spoken, audible response to the user. - View Dependent Claims (32, 33)
-
Specification