System and method for cleansing enterprise data
First Claim
Patent Images
1. A method of cleansing a plurality of enterprise databases comprising:
- extracting data from a plurality enterprise databases;
extracting reference data from a most-trusted database, the reference data being associated with resources in a service provider network;
generating at least one of a cross-reference table and mapping information from metadata associated with the reference data extracted from the enterprise database and metadata associated with data extracted from the most-trusted database;
accumulating the data extracted from the plurality of enterprise databases and the reference data extracted from the most-trusted database in an aggregate extracted database, from which discords are determined, the aggregate extracted database being independent of the plurality of enterprise databases and the most-trusted database;
formatting the data extracted from the plurality of enterprise databases in a specified format;
determining discords between the data extracted from the plurality of enterprise databases and the reference data extracted from the most-trusted database in accordance with a set of rules, the determining using the reference data extracted from the most-trusted database as a reference against which the data extracted from the enterprise database is verified;
determining discords between the data extracted from the plurality of enterprise databases and industry standard data from an industry standard database, the data extracted from the plurality of enterprise databases being verified against the industry standard data of the industry standard database;
determining whether at least one of the discords requires manual intervention or whether an automatic system correction process is configured to automatically correct the discords;
modifying the data extracted from the plurality of enterprise databases automatically based on the discords and in response to a determination that the automatic system correction process is configured to automatically correct the discords;
reformatting the data extracted from the enterprise database to an original format after modifying the data based on the discords; and
storing the modified data in the plurality of enterprise databases.
4 Assignments
0 Petitions
Accused Products
Abstract
A method, system, and article of manufacture for cleansing an enterprise database extracts data from an enterprise database, extracts reference data from a most-trusted database, determines discords between the data extracted from the enterprise database and the reference data extracted from the most-trusted database in accordance with a set of business rules, modifies the data extracted from the enterprise database based on the discords, and stores the modified data in the enterprise database. The data may be modified automatically with or without manual intervention. Discords are preferably determined on an atomic level.
36 Citations
18 Claims
-
1. A method of cleansing a plurality of enterprise databases comprising:
-
extracting data from a plurality enterprise databases; extracting reference data from a most-trusted database, the reference data being associated with resources in a service provider network; generating at least one of a cross-reference table and mapping information from metadata associated with the reference data extracted from the enterprise database and metadata associated with data extracted from the most-trusted database; accumulating the data extracted from the plurality of enterprise databases and the reference data extracted from the most-trusted database in an aggregate extracted database, from which discords are determined, the aggregate extracted database being independent of the plurality of enterprise databases and the most-trusted database; formatting the data extracted from the plurality of enterprise databases in a specified format; determining discords between the data extracted from the plurality of enterprise databases and the reference data extracted from the most-trusted database in accordance with a set of rules, the determining using the reference data extracted from the most-trusted database as a reference against which the data extracted from the enterprise database is verified; determining discords between the data extracted from the plurality of enterprise databases and industry standard data from an industry standard database, the data extracted from the plurality of enterprise databases being verified against the industry standard data of the industry standard database; determining whether at least one of the discords requires manual intervention or whether an automatic system correction process is configured to automatically correct the discords; modifying the data extracted from the plurality of enterprise databases automatically based on the discords and in response to a determination that the automatic system correction process is configured to automatically correct the discords; reformatting the data extracted from the enterprise database to an original format after modifying the data based on the discords; and storing the modified data in the plurality of enterprise databases. - View Dependent Claims (2, 3, 4, 5, 6, 7, 18)
-
-
8. A system for cleansing a plurality of enterprise databases comprising:
-
a plurality of enterprise databases holding enterprise data; a most-trusted database holding reference data being associated with resources in a service provider network; and a computing device, the computing device configured to; extract enterprise data from the plurality of enterprise databases; extract the reference data from the most-trusted database; generate at least one of a cross-reference table and mapping information from metadata associated with the data extracted from the plurality of enterprise databases and metadata associated with reference data extracted from the most-trusted database; accumulate the data extracted from the plurality of enterprise databases and the reference data extracted from the most-trusted database in an aggregate extracted database, from which discords are determined, the aggregate extracted database being independent of the plurality of enterprise databases and the most-trusted database; format the data extracted from the plurality of enterprise databases in a specified format, determine discords between the enterprise data extracted from the plurality of enterprise databases and the reference data extracted from the most-trusted database in accordance with a set of rules, the computing device determining discords using the reference data extracted from the most-trusted database as a reference against which the enterprise data extracted from the plurality of enterprise databases is verified; determine discords between the data extracted from the plurality of enterprise databases and industry standard data from an industry standard database, the data extracted from the plurality of enterprise databases being verified against the industry standard data of the industry standard database; determine whether at least one of the discords requires manual intervention or whether an automatic system correction process is configured to automatically correct the discords; modify the enterprise data extracted from the plurality of enterprise databases automatically based on the discords and in response to a determination that the automatic system correction process is configured to automatically correct the discords; reformat the data extracted from the plurality of enterprise databases to an original format after modifying the data based on the discords; and store the modified enterprise data in the plurality of enterprise databases. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. An article of manufacture for cleansing a plurality of enterprise databases comprising:
-
a machine-readable medium comprising at least one program which when executed implements the steps of; extracting data from a plurality of enterprise databases; extracting reference data from a most-trusted database, the reference data being associated with resources in a service provider network; generating at least one of a cross-reference table and mapping information from metadata associated with the reference data extracted from the plurality enterprise databases and metadata associated with data extracted from the most-trusted database; accumulating the data extracted from the plurality of enterprise databases and the reference data extracted from the most-trusted database in an aggregate extracted database, from which discords are determined, the aggregate extracted database being independent of the plurality of enterprise databases and the most-trusted database; formatting the data extracted from the plurality of enterprise databases to a specified format; determining discords between the data extracted from the plurality of enterprise databases and the reference data extracted from the most-trusted database in accordance with a set of rules, the determining using the reference data extracted from the most-trusted database as a reference against which the data extracted from the plurality of enterprise databases is verified; determining discords between the data extracted from the plurality of enterprise databases and industry standard data from an industry standard database, the data extracted from the plurality of enterprise databases being verified against the industry standard data of the industry standard database; determining whether at least one of the discords requires manual intervention or whether an automatic system correction process is configured to automatically correct the discords; modifying the data extracted from the plurality of enterprise databases automatically based on the discords and in response to a determination that the automatic system correction process is configured to automatically correct the discords; reformatting the data extracted from the plurality of enterprise databases to an original format after modifying the data based on the discords; and storing the modified data in the plurality of enterprise databases.
-
Specification