Electronic shopping agent which is capable of operating with vendor sites which have disparate formats
First Claim
1. Apparatus for retrieving comparative item information from a plurality of merchant sites having disparate information formats in response to a request, including a category and a keyword, from a user, the apparatus comprising:
- a database containing a plurality of categories and, for each category, at least one URL for one of the plurality of merchant sites;
a query generator responsive to the request category for composing a query by concatenating a URL obtained from the database with the request category with the request keyword wherein the query generator composes a plurality of queries for the requested category;
a search engine for retrieving information from the plurality of merchant sites with the query; and
an automatic learning object for processing retrieved information to extract the item information and wherein a plurality of automatic learning objects are created in parallel with automatic learning object being created to process information retreived from each query.
1 Assignment
0 Petitions
Accused Products
Abstract
A shopping bot uses real time agents that automatically contact disparate web pages representing vast variety of different categories and merchants and retrieve and unify the information therein for display when a request for the information is made. Consequently, there is no need to create a “wrapper”, or a “information adapter” for each category or each merchant because the same agent can retrieve and process information in various formats. In particular, the shopping bot generates queries from keywords entered by a user and a database of URL information. Information returned by the queries is filtered, parsed and mapped to a standard format. The formatted information can then be displayed. Since the information is converted to the standard format in real time, the invention allows quick addition of online merchants and additional product categories can be added quickly and easily. Further, information about a product item can be easily enlarged as market needs increase without changing a database of codes one-by-one. In accordance with one embodiment, the database of URL information includes URLs specific to site directories at each merchant site so that queries can be easily generated by appending user-provided keywords.
-
Citations
25 Claims
-
1. Apparatus for retrieving comparative item information from a plurality of merchant sites having disparate information formats in response to a request, including a category and a keyword, from a user, the apparatus comprising:
-
a database containing a plurality of categories and, for each category, at least one URL for one of the plurality of merchant sites;
a query generator responsive to the request category for composing a query by concatenating a URL obtained from the database with the request category with the request keyword wherein the query generator composes a plurality of queries for the requested category;
a search engine for retrieving information from the plurality of merchant sites with the query; and
an automatic learning object for processing retrieved information to extract the item information and wherein a plurality of automatic learning objects are created in parallel with automatic learning object being created to process information retreived from each query. - View Dependent Claims (2, 3, 4, 5, 6, 7)
a filter for removing formatting information in the retrieved information; and
a parser for parsing the filtered information into one or more data trees, each data tree having one or more nodes.
-
-
4. Apparatus according to claim 3 wherein the automatic learning object further comprises:
-
a mechanism which examines each node for relevant information; and
an extraction mechanism which extracts the item information from the relevant information.
-
-
5. Apparatus according to claim 1 wherein the retrieved information is coded in HTML code and wherein the automatic learning object processes the HTML code to remove HTML formatting tags.
-
6. Apparatus according to claim 1 wherein the retrieved information is coded in XML code and wherein the automatic learning object processes the XML code to remove XML formatting tags.
-
7. Apparatus according to claim 1 wherein the database includes at least one URL for a search engine located in one of the plurality of merchant sites.
-
8. A method for retrieving comparative item information from a plurality of merchant sites having disparate information formats in response to a request, including a category and a keyword, from a user, the method comprising:
-
(a) constructing a database containing a plurality of categories and, for each category, at least one URL for one of the plurality of merchant sites;
(b) Composing a query in response to the request category by concatenating a URL obtained from the database with the request category with the request keyword;
(c) using a search engine to retrieve information from the plurality of merchant sites with the query;
(d) creating an automatic learning object for processing retrieved information to extract the item information; and
(e) wherein step (b) comprises composing a plurality of queries for the requested category and wherein step (d) comprises creating a plurality of automatic learning objects in parallel with one automatic learning object being created to process information retrieved from each query. - View Dependent Claims (9, 10, 11, 12, 13, 14)
(d1) removing formatting information in the retrieved information; and
(d2) parsing the filtered information into one or more data trees, each data tree having one or more nodes.
-
-
11. A method according to claim 10 wherein step (d) further comprises:
-
(d3) examining each node for relevant information; and
(d4) extracting the item information from the relevant information.
-
-
12. A method according to claim 8 wherein the retrieved information is coded in HTML code and wherein step (d) comprises processing the HTML code to remove HTML formatting tags.
-
13. A method according to claim 8 wherein the retrieved information is coded in XML code and wherein step (d) comprises processing the XML code to remove XML formatting tags.
-
14. A method according to claim 8 wherein the database is constructed to include at least one URL for a search engine located in one of the plurality of merchant sites.
-
15. A computer program product for retrieving comparative item information from a plurality of merchant sites having disparate information formats in response to a request, including a category and a keyword, from a user, the computer program product comprising a computer usable medium having computer readable program code thereon, including:
-
program code for constructing a database containing a plurality of categories and, for each category, at least one URL for one of the plurality of merchant sites;
program code for composing a query in response to the request category by concatenating a URL obtained from the database with the request category with the request keyword;
program code for creating a search engine to retrieve information from the plurality of merchant sites with the query;
program code for creating an automatic learning object for processing retrieved information to extract the item information; and
wherein the program code for composing a query comprises program code for composing a plurality of queries for the requested category and wherein the program code for creating a search engine comprises program code for creating a plurality of automatic learning objects in parallel with one automatic learning object being created to process information retrieved from each query. - View Dependent Claims (16, 17, 18, 19, 20, 21, 24)
program code for removing formatting information in the retrieved information; and
program code for parsing the filtered information into one or more data trees, each data tree having one or more nodes.
-
-
18. A computer program product according to claim 17 wherein the program code for creating an automatic learning object further comprises:
-
program code for examining each node for relevant information; and
program code for extracting the item information from the relevant information.
-
-
19. A computer program product according to claim 15 wherein the retrieved information is coded in HTML code and wherein the program code for creating an automatic learning object comprises program code for processing the HTML code to remove HTML formatting tags.
-
20. A computer program product according to claim 15 wherein the retrieved information is coded in XML code and wherein the program code for creating an automatic learning object comprises program code for processing the XML code to remove XML formatting tags.
-
21. A computer program product according to claim 15 wherein the database is constructed to include at least one URL for a search engine located in one of the plurality of merchant sites.
-
24. A computer data signal according to claim 20 wherein the program code for creating an automatic learning object comprises:
-
program code for removing formatting information in the retrieved information; and
program code for parsing the filtered information into one or more data trees, each data tree having one or more nodes.
-
-
22. A computer data signal embodied in a carrier wave for retrieving comparative item information from a plurality of merchant sites having disparate information formats in response to a request, including a category and a keyword, from a user, the computer data signal comprising:
-
program code for constructing a database containing a plurality of catageories and, for each category, at least one URL for one of the plurality of merchant sites;
program code for composing a query in response to the request category by concatenating a URL obtained from the database with the request category with the request keyword;
program code for creating a search engine to retrieve information from the plurality of merchant sites with the query;
program code for creating an automatic learning object for processing retrieved information to extract the item information; and
wherein the program code for composing a query comprises program code for composing a plurality of queries for the requested category and wherein the program code for creating a search engine comprises program code for creating a plurality of automatic learning objects in parallel with one automatic learning object being created to process information retrieved from each query. - View Dependent Claims (23, 25)
-
Specification