Method of conducting data quality analysis
First Claim
Patent Images
1. A method of analyzing data quality comprising the steps of:
- profiling source data;
performing metadata level analysis and creating quality tags to identify problems with metadata;
performing data content level analysis and creating quality tags to identify problems with data;
generating at least one report describing at least a portion of the identified metadata and data problems.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for creating a data quality report for a given set of source data. The source data is profiled and then analysis is preferably performed at the relation level, metadata level, and data content level analysis. Any inconsistencies noted during analysis are noted with quality tags preferably comprising a common status and a type describing the category of the identified inconsistency. Reports are then generated that describe and summarize the information contained in the quality tags created during the analysis.
-
Citations
17 Claims
-
1. A method of analyzing data quality comprising the steps of:
-
profiling source data;
performing metadata level analysis and creating quality tags to identify problems with metadata;
performing data content level analysis and creating quality tags to identify problems with data;
generating at least one report describing at least a portion of the identified metadata and data problems. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A method for analyzing data quality for a given set of source data, the method comprising:
-
a. profiling source data;
b. performing relation analysis comprising;
i. creating a catalog;
ii. importing metadata into the catalog from a file characterized by a file structure and a file encoding;
iii. comparing the source data with the file structure and noting inconsistencies with at least one quality tag; and
iv. comparing the source data with the file encoding and noting inconsistencies with at least one quality tag;
c. performing metadata analysis comprising;
i. opening an attribute list for the source data; and
ii. comparing the attribute list to the metadata and noting inconsistencies with at least one quality tag;
d. performing data content analysis comprising;
i. opening an attribute list for the source data;
ii. reviewing source data patterns and noting data pattern inconsistencies with at least one quality tag; and
iii. reviewing the source data values and noting inconsistencies with at least one quality tag;
e. generating reports comprising;
i. exporting the catalog to a repository; and
ii. executing report generation commands.
-
Specification