System and method for determining if one web site has the same information as another web site
First Claim
1. A method of determining if one web site has the same information as another web site, the method comprising:
- receiving a signal to select a form configured to find data in a file, the file containing information displayed on a web site and accessed via a network;
wherein a form overlays a file to filter out particular information;
applying the selected form to the file and selectively identifying item information available in the file;
copying identified item information to a first data file, the identified item information being related to a specific product or service; and
comparing the first data file and a second data file to determine if the specific product or service of the first data file is related to the specific product or service of the second data file.
9 Assignments
0 Petitions
Accused Products
Abstract
A method of determining if one web site has the same information as another web site includes receiving a signal to select a form configured to find data in a file containing information displayed on a web site and accessed via a network, applying the selected form to the file and selectively identifying item information available in the file, copying identified item information to a first data file, the identified item information being related to a specific product or service, and comparing the first data file and a second data file to determine if the specific product or service of the first data file is related to the specific product or service of the second data file.
-
Citations
25 Claims
-
1. A method of determining if one web site has the same information as another web site, the method comprising:
-
receiving a signal to select a form configured to find data in a file, the file containing information displayed on a web site and accessed via a network;
wherein a form overlays a file to filter out particular information;
applying the selected form to the file and selectively identifying item information available in the file;
copying identified item information to a first data file, the identified item information being related to a specific product or service; and
comparing the first data file and a second data file to determine if the specific product or service of the first data file is related to the specific product or service of the second data file. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
selectively identifying a set of candidate forms configured to find product and service data on a web site; and
determining useful forms from the set of candidate forms and how the useful forms should be filled out.
-
-
3. The method of claim 1, wherein the step of comparing the first data file and a second data file comprises:
-
identifying item attributes; and
comparing available item attributes in the first data file with item attributes in the second file.
-
-
4. The method of claim 3, wherein a first item associated with the first file is determined to be the same as a second item associated with the second file if more than a predetermined percentage of item attributes in the first file do not conflict with item attributes in the second file.
-
5. The method of claim 3, wherein a first item associated with the first file is determined to be the same as a second item associated with the second file if a threshold of item attributes in the first file are common with item attributes in the second file.
-
6. The method of claim 1, further comprising creating a third file containing item information from the first file and the second file.
-
7. The method of claim 6, further comprising adding the third file to a data structure containing associative and inheritance relationships with other data files.
-
8. The method of claim 1, further comprising adding the first file to an existant data structure.
-
9. The method of claim 1, wherein the step of comparing the first data file and a second data file to determine if the specific product or service of the first data file is related to the specific product or service of the second data file comprises determining if the first data file and second data file contain information on the same specific product or service.
-
10. A system comprising:
-
means for receiving a signal to select a form configured to find data in a file containing information displayed on a web site and accessed via a network;
wherein a form overlays a file to filter out particular information;
means for applying the selected form to the file and selectively identifying item information available in the file;
means for copying specific products or services item information identified to a first data file, the item information being related to specific products or services; and
means for comparing the first data file and a second data file to determine if the specific product or service of the first data file is related to the specific product or service of the second data file. - View Dependent Claims (11, 12, 13, 14, 15, 16)
means for selectively identifying a set of candidate forms configured to find product and service data on a web site; and
means for determining useful forms from the set of candidate forms and how the useful forms should be filled out.
-
-
12. The system of claim 10, wherein the means for comparing the first data file and a second data file comprises:
-
means for identifying item attributes; and
means for comparing available item attributes in the first data file with item attributes in the second file.
-
-
13. The system of claim 10, further comprising means for creating a third file containing item information from the first file and the second file.
-
14. The system of claim 13, further comprising means for adding the third file to a data structure containing associative and inheritance relationships with other data files.
-
15. The system of claim 10, further comprising means for adding the first file to an existant data structure contained in a database.
-
16. The system of claim 10, wherein the means for comparing the first data file and a second data file to determine if the specific product or service of the first data file is related to the specific product or service of the second data file comprises means for determining if the first data file and second data file contain information on the same specific product or service.
-
17. A computer program product comprising computer readable program code for automatically determining if one web site has the same information as another web site, the program code in the computer program product comprising:
-
first computer readable program code for receiving a signal to select a form configured to find data in a file containing information displayed on a web site and accessed via a network;
wherein a form overlays a file to filter out particular information;
second computer readable program code for applying the selected form to the file and selectively identifying item information available in the file;
third computer readable program code for copying specific products or services item information identified to a first data file, the item information being related to specific products or services; and
fourth computer readable program code for comparing the first data file and a second data file to determine if the specific product or service of the first data file is related to the specific product or service of the second data file. - View Dependent Claims (18, 19)
program code for identifying item attributes; and
program code for comparing available item attributes in the first data file with item attributes in the second file.
-
-
19. The program code of claim 17, further comprising fifth computer readable program code for creating a third file containing item information from the first file and the second file.
-
20. A system configured to identify Internet-based data and determine if an Internet web page contains information related to an item, the system comprising:
-
means for locating Internet-based data, the Internet-based data including item information;
means for copying the located Internet-based data into a plurality of files;
means for comparing item information in a first file with item information in a second file;
wherein the means for copying the located Internet-based data into a plurality of files comprises means for identifying a form for finding item data on a web site; and
wherein the form overlays a web page to filter out particular information. - View Dependent Claims (21, 22, 23)
-
-
24. A system comprising:
-
a processing unit, the processing unit being configured to receive a signal to select a form configured to find data in a file containing information displayed on a web site and accessed via a network, the processing unit further configured to apply the selected form to the file and selectively identify item information available in the file;
wherein the selected form overlays a file to filter out particular information;
a data management unit coupled to the processing unit, the data management unit being configured to copy specific products or services item information identified to a first data file, the item information being related to specific products or services; and
a comparison unit coupled to the processing unit, the comparison unit configured to compare the first data file and a second data file to determine if the specific product or service of the first data file is related to the specific product or service of the second data file. - View Dependent Claims (25)
-
Specification