Method and apparatus for retrieving and processing data
First Claim
Patent Images
1. A method comprising:
- capturing a web page from a web site;
extracting data from the web page using a data harvesting script;
normalizing the extracted data; and
storing the normalized data in a database.
4 Assignments
0 Petitions
Accused Products
Abstract
Data is captured from a web site or other data source. Data is extracted from the web page using a data harvesting script or other data acquisition routine. The extracted data is then normalized and stored in a database. If data cannot be extracted from the web page, a copy of the captured web page is stored without personal information contained in the web page. The data harvesting script is then edited based on an analysis of the captured web page.
-
Citations
16 Claims
-
1. A method comprising:
-
capturing a web page from a web site;
extracting data from the web page using a data harvesting script;
normalizing the extracted data; and
storing the normalized data in a database. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method comprising:
-
capturing a web page from a web site;
attempting to extract data from the web page using a data harvesting script;
removing personal information from the captured web page;
storing the captured web page without the personal information; and
if data cannot be extracted from the web page, analyzing the web page and the data harvesting script to determine why data could not be extracted from the web page. - View Dependent Claims (10, 11, 12, 13)
-
-
14. An apparatus comprising:
-
a data capture module configured to capture a web page from a web site associated with a financial institution;
a data extraction module coupled to the data capture module and configured to extract data from the captured web page using a data harvesting script, the data extraction module further configured to normalize the extracted data; and
a database control module coupled to the data extraction module and configured to store the normalized data in a database. - View Dependent Claims (15, 16)
-
Specification