System and method for synchronized web scraping
First Claim
Patent Images
1. A method comprising:
- obtaining, by at least one processing device, information associated with a product, service, or event from each of two or more web pages associated with websites that list the product, service, or event;
determining, by the at least one processing device, that at least some of the information associated with the product, service, or event has changed at at least one of the two or more web pages;
in response to the determining that the at least some information associated with the product, service, or event has changed at the at least one of the two or more web pages, performing synchronized scraping, by the at least one processing device, based on the obtained information, the synchronized scraping performed concurrently from the two or more web pages to obtain scraped data of the same type for the same product, service, or event from each corresponding web page at a same time;
producing, by the at least one processing device, a comparison result based on a comparison of the scraped data for the same product, service, or event from each corresponding web page; and
presenting the comparison result on a graphical user interface.
1 Assignment
0 Petitions
Accused Products
Abstract
A method includes obtaining information associated with a product, service, or event. The method also includes scraping data based on the obtained information substantially concurrently from two or more web pages associated with websites that list a same product, service, or event to produce scraped data for the same product, service, or event from each corresponding web page at substantially a same time.
7 Citations
20 Claims
-
1. A method comprising:
-
obtaining, by at least one processing device, information associated with a product, service, or event from each of two or more web pages associated with websites that list the product, service, or event; determining, by the at least one processing device, that at least some of the information associated with the product, service, or event has changed at at least one of the two or more web pages; in response to the determining that the at least some information associated with the product, service, or event has changed at the at least one of the two or more web pages, performing synchronized scraping, by the at least one processing device, based on the obtained information, the synchronized scraping performed concurrently from the two or more web pages to obtain scraped data of the same type for the same product, service, or event from each corresponding web page at a same time; producing, by the at least one processing device, a comparison result based on a comparison of the scraped data for the same product, service, or event from each corresponding web page; and presenting the comparison result on a graphical user interface. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An apparatus comprising:
at least one processing device configured to; obtain information associated with a product, service, or event from each of two or more web pages associated with websites that list the product, service, or event; determine that at least some of the information associated with the product, service, or event has changed at at least one of the two or more web pages; in response to the determination that the at least some information associated with the product, service, or event has changed at the at least one of the two or more web pages, perform synchronized scraping of data based on the obtained information, the synchronized scraping performed concurrently from the two or more web pages to obtain scraped data of the same type for the same product, service, or event from each corresponding web page at a same time; produce a comparison result based on a comparison of the scraped data for the same product, service, or event from each corresponding web page; and present the comparison result on a graphical user interface. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
17. A non-transitory computer readable storage medium comprising instructions that, when executed by at least one processing device, cause the at least one processing device to:
-
obtain information associated with a product, service, or event from each of two or more web pages associated with websites that list the product, service, or event; determine that at least some of the information associated with the product, service, or event has changed at at least one of the two or more web pages; in response to the determination that the at least some information associated with the product, service, or event has changed at the at least one of the two or more web pages, perform synchronized scraping of data based on the obtained information, the synchronized scraping performed concurrently from the two or more web pages to obtain scraped data of the same type for the same product, service, or event from each corresponding web page at a same time; produce a comparison result based on a comparison of the scraped data for the same product, service, or event from each corresponding web page; and present the comparison result on a graphical user interface. - View Dependent Claims (18, 19, 20)
-
Specification