System and method of providing a spoken dialog interface to a website
First Claim
1. A method of generating data for use by a plurality of components in a spoken dialog service, the method comprising:
- converting data from a structured database to a structured text data set and a structured task knowledge base for use by at least a portion of a plurality of spoken dialog service components;
extracting linguistic items from the structured database for use by at least another portion of the plurality of spoken dialog service components; and
automatically generating a spoken dialog service by training each of the plurality of spoken dialog service components using the structured text data, the structured task knowledge base or the linguistic items.
5 Assignments
0 Petitions
Accused Products
Abstract
Disclosed is a system and method for generating a spoken dialog service from website data. Spoken dialog components typically include an automatic speech recognition module, a language understanding module, a dialog management module, a language generation module and a test-to-speech module. These components are capable of being automatically trained from processed website data. A website analyzer converts a website into structured text data set and a structured task knowledge base. The website analyzer further extracts linguistic items from the website data. The dialog components are automatically trained from the structured text data set, structured task knowledge base and linguistic items.
-
Citations
42 Claims
-
1. A method of generating data for use by a plurality of components in a spoken dialog service, the method comprising:
-
converting data from a structured database to a structured text data set and a structured task knowledge base for use by at least a portion of a plurality of spoken dialog service components; extracting linguistic items from the structured database for use by at least another portion of the plurality of spoken dialog service components; and automatically generating a spoken dialog service by training each of the plurality of spoken dialog service components using the structured text data, the structured task knowledge base or the linguistic items. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. An analyzer module for generating a database of information used to build a spoken dialog service, the analyzer module comprising:
-
means for generating a structured text data set and a structured task knowledge base from website data for use by at least one spoken dialog component; means for extracting linguistic items from the website data for use by at least one spoken dialog component; and means for automatically training each component of a spoken dialog service using the structured text data set, structured task knowledge base and linguistic items. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. A system for generating a spoken dialog service from website data, the system comprising:
-
A. a plurality of spoken dialog components comprising an automatic speech recognition module, a language understanding module, a dialog management module, a language generation module and a text-to-speech module; and B. a website analyzer having; 1. means for generating a structured text data set and a structured task knowledge base from website data for use by at least one of the spoken dialog components; 2. means for extracting linguistic items from the website data for use by at least one of the spoken dialog components; and 3. means for automatically training each component of a spoken dialog service using the structured text data set, structured task knowledge base and linguistic items. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39)
-
-
40. A method of segmenting webpages from a website for use in automatically generating a spoken dialog interface to the website, the method comprising:
-
representing each webpage in a website by at least one set of vectors; encoding each webpage as a sequence of features vectors; detecting boundaries between information units on each webpage; and classifying information blocks organized according to the detected boundaries between information units into information class symbols, wherein the classified information blocks are utilized for generating a spoken dialog interface to the website. - View Dependent Claims (41, 42)
-
Specification