Systems and methods for providing data quality management
First Claim
1. A system for providing data quality management, the system comprising:
- a memory storing instructions; and
a processor connected to a network and configured to execute the instructions to;
extract a plurality of first data elements from a data source;
generate a data profile based on the first data elements;
automatically create a first set of rules based on the first data elements and the data profile, the first set of rules assessing data quality according to a threshold;
generate a second set of rules based on the first data elements and the first set of rules;
extract a plurality of second data elements;
assess the second data elements based on a comparison of the second data elements to the second set of rules;
receive a request from a user to adjust settings, the adjusted settings influencing identification of a node with a concentration of defects;
detect one or more defects based on the comparison and the user request, at least one of the detected one or more defects including an event;
cluster the assessed data elements into multiple segments according to the detected one or more defects in the assessed data elements, the multiple segments corresponding to data quality scores;
analyze, using a decision tree algorithm, data quality according to the detected one or more defects to determine a pocket of defect concentration and aggregate data quality; and
transmit signals representing the data quality scores and the data quality analysis to a client device for display to a user.
1 Assignment
0 Petitions
Accused Products
Abstract
A system for providing data quality management may include a processor configured to execute instructions to: extract a plurality of first data elements from a data source; generate a data profile based on the first data elements; automatically create a first set of rules based on the first data elements and the data profile, the first set of rules assessing data quality according to a threshold; generate a second set of rules based on the first data elements and the first set of rules; extract a plurality of second data elements; assess the second data elements based on a comparison of the second data elements to the second set of rules; detect defects based on the comparison; analyze data quality according to the detected defects; and transmit signals representing the data quality analysis to a client device for display to a user.
-
Citations
20 Claims
-
1. A system for providing data quality management, the system comprising:
-
a memory storing instructions; and a processor connected to a network and configured to execute the instructions to; extract a plurality of first data elements from a data source; generate a data profile based on the first data elements; automatically create a first set of rules based on the first data elements and the data profile, the first set of rules assessing data quality according to a threshold; generate a second set of rules based on the first data elements and the first set of rules; extract a plurality of second data elements; assess the second data elements based on a comparison of the second data elements to the second set of rules; receive a request from a user to adjust settings, the adjusted settings influencing identification of a node with a concentration of defects; detect one or more defects based on the comparison and the user request, at least one of the detected one or more defects including an event; cluster the assessed data elements into multiple segments according to the detected one or more defects in the assessed data elements, the multiple segments corresponding to data quality scores; analyze, using a decision tree algorithm, data quality according to the detected one or more defects to determine a pocket of defect concentration and aggregate data quality; and transmit signals representing the data quality scores and the data quality analysis to a client device for display to a user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method for providing data quality management, the method comprising the following operations performed by a processor:
-
extracting a plurality of first data elements from a data source; generating a data profile based on the first data elements; automatically creating a first set of rules based on the first data elements and the data profile, the first set of rules assessing data quality according to a threshold; generating a second set of rules based on the first data elements and the first set of rules; extracting a plurality of second data elements; assessing the second data elements based on a comparison of the second data elements to the second set of rules; receiving a request from a user to adjust settings, the adjusted settings influencing identification of nodes with a concentration of defects; detecting one or more defects based on the comparison and the user request, at least one of the detected one or more defects including an event; clustering the assessed data elements into multiple segments according to the detected one or more defects in the assessed data elements, the multiple segments corresponding to data quality scores; analyzing, using a decision tree algorithm, data quality according to the detected one or more defects to determine a pocket of defect concentration and aggregate data quality; and transmitting signals representing the data quality scores and the data quality analysis to a client device for display to a user. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A non-transitory computer-readable medium storing instructions executable by a processor to perform a method for providing data quality management, the method comprising:
-
extracting a plurality of first data elements from a data source; generating a data profile based on the first data elements; automatically creating a first set of rules based on the first data elements and the data profile, the first set of rules assessing data quality according to a threshold; generating a second set of rules based on the first data elements and the first set of rules; extracting a plurality of second data elements; assessing the second data elements based on a comparison of the second data elements to the second set of rules; receiving a request from a user to adjust settings, the adjusted settings influencing identification of nodes with a concentration of defects; detecting one or more defects based on the comparison; clustering the assessed data elements into multiple segments according to the detected one or more defects in the assessed data elements, the multiple segments corresponding to data quality scores; analyzing, using a decision tree algorithm, data quality according to the detected one or more defects to determine a pocket of defect concentration and aggregate data quality; and transmitting signals representing the data quality scores and the data quality analysis to a client device for display to a user. - View Dependent Claims (18, 19, 20)
-
Specification