Method and system for processing business data
First Claim
1. A method of verifying business data comprising:
- (a) looking up a first profile data for a business using at least one URL;
(b) looking up a second profile data for said business using a business identifier; and
(c) comparing said first profile data and said second profile data, thereby verifying that said second profile data is valid.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system that collects data from resources connected to a network for addition to a database that contains data records for businesses. A database of URL records is built according to a data structure that includes data elements that are useful to determine if an entity described by the data elements qualifies as a business. The data elements of the two databases are used to form web mining strategies. A distributing processing system is used to mine huge numbers of web pages in parallel. The bandwidth and transmission times are shortened at the distributed device end by summarizing web page content in an index that is returned to a central processor in the form of a byte. The central processor analyzes the byte and earmarks for a complete content extraction only those web pages that have enough business content.
-
Citations
24 Claims
-
1. A method of verifying business data comprising:
-
(a) looking up a first profile data for a business using at least one URL;
(b) looking up a second profile data for said business using a business identifier; and
(c) comparing said first profile data and said second profile data, thereby verifying that said second profile data is valid. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method of developing new business profile data comprising:
-
(a) looking up a first profile data for a business using at least one URL;
(b) looking in a database for a second profile data for said business using one or more data elements of said first profile data; and
(c) if said second profile data is not found, determining if said first profile data qualifies as a business and, if so, assigning a business identifier thereto to form said new business profile data. - View Dependent Claims (7)
-
-
8. A method for processing profile data, wherein said profile data includes separate profile data records for a plurality of business concerns, wherein each of said profile data records includes a plurality of data elements, and wherein each of said profile data records is identified by a business identifier, said method comprising:
-
(a) comparing a plurality of URL data with said profile data, wherein said URL data includes a plurality of URL data records, and wherein each of said URL data records includes a URL and at least one business data element for a business concern;
(b) developing a plurality of unmatched URL data records, wherein said at least one business data element is unmatched to any data element in said plurality of profile data records;
(c) using the URL of a first one of said unmatched URL records to locate on a network one or more sites that contains additional business data elements for said first URL record;
(d) adding said additional data elements to said first unmatched URL record; and
(d) determining if said updated first unmatched URL record qualifies as a business and, if so, assigning a business identifier thereto and adding to said plurality of data records for a plurality of business concerns. - View Dependent Claims (9, 10, 12, 13, 14)
-
-
11. A method for mining data from a plurality of resources connected to a network, said method comprising:
-
(a) maintaining a plurality of URL records in a first database that includes a plurality of fields for each URL record;
(b) maintaining a plurality of business data records in a second database that includes a plurality of fields for each business data record; and
(c) deriving a mining strategy from data elements stored in one or more of the fields of said first and second databases to mine data elements from said plurality of resources for storage in the fields of said first database.
-
-
15. A method of processing the content of a web page comprising:
-
(a) arranging the content of said web page into a plurality of content categories; and
(b) forming an index that summarizes said content categories. - View Dependent Claims (16, 17)
-
-
18. A data mining system comprising:
-
means for serving a URL; and
at least one supplier device for forming an index of the content of a web page indicated by said URL and returning said index to said serving means.
-
-
19. A method of filtering a plurality of web pages for mining a business content comprising:
-
(a) eliminating any of said plurality of web pages that contain adult content;
(b) eliminating any of said plurality of web pages that do not pass a predictability test of containing business content; and
(c) mining any of said plurality of web pages remaining after steps (a) and (b) for business content.
-
-
20. A computer system that verifies and develops business profile data, said computer system comprising:
-
first look up means for looking up a first profile data for a business using at least one URL;
second look up means for looking for a second profile data for said business using a business identifier;
compare means for comparing said first profile data and said second profile data, if said second profile data is found, thereby verifying that said second profile data is valid; and
establishing means for establishing said second profile data with said first profile data if said second profile data is not found. - View Dependent Claims (21, 22, 23, 24)
-
Specification