Method and apparatus for retrieving and processing data
First Claim
Patent Images
1. A method comprising:
- capturing a web page from a web site;
extracting data from the web page using a data harvesting script;
normalizing the extracted data; and
storing the normalized data in a database.
5 Assignments
0 Petitions
Accused Products
Abstract
Data is captured from a web site or other data source. Data is extracted from the web page using a data harvesting script or other data acquisition routine. The extracted data is then normalized and stored in a database. If data cannot be extracted from the web page, a copy of the captured web page is stored without personal information contained in the web page. The data harvesting script is then edited based on an analysis of the captured web page.
-
Citations
27 Claims
-
1. A method comprising:
-
capturing a web page from a web site;
extracting data from the web page using a data harvesting script;
normalizing the extracted data; and
storing the normalized data in a database. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method comprising:
-
retrieving financial data associated with a user'"'"'s financial account from a data source;
identifying data of interest retrieved from the data source;
normalizing the identified data; and
storing the normalized data in a database. - View Dependent Claims (10, 11)
-
-
12. A method comprising:
-
capturing a web page from a web site;
attempting to extract data from the web page using a data harvesting script;
removing personal information from the captured web page;
storing the captured web page without the personal information; and
if data cannot be extracted from the web page, analyzing the web page and the data harvesting script to determine why data could not be extracted from the web page. - View Dependent Claims (13, 14, 15, 16)
-
-
17. A method comprising:
-
capturing a first web page from a first financial institution web site;
capturing a second web page from a second financial institution web site;
extracting data from the first web page using a first data harvesting script;
extracting data from the second web page using a second data harvesting script;
normalizing the data extracted from the first web page and the second web page; and
storing the normalized data in a database. - View Dependent Claims (18, 19, 20, 21)
-
-
22. An apparatus comprising:
-
a data capture module configured to capture a web page from a web site associated with a financial institution;
a data extraction module coupled to the data capture module and configured to extract data from the captured web page using a data harvesting script, the data extraction module further configured to normalize the extracted data; and
a database control module coupled to the data extraction module and configured to store the normalized data in a database. - View Dependent Claims (23, 24)
-
-
25. One or more computer readable media having stored thereon a plurality of instructions that, when executed by a processor, causes the processor to perform acts comprising:
-
capturing a web page from a financial institution web site;
attempting to extract data from the captured web page using a data harvesting script;
removing personal information from the captured web page;
storing the captured web page without the personal information; and
if data cannot be extracted from the web page, analyzing the web page to determine why data could not be extracted from the web page. - View Dependent Claims (26, 27)
-
Specification