Systems and methods for validating data
First Claim
1. A system for validating data comprising:
- one or more processors and a memory storing instructions that, when executed by the one or more processors, cause the system to;
select a data set including data to validate based on user input received from a user in interacting with a graphical user interface;
select a validator including at least one validation parameter to use in validating the data in the data set based on the user input;
apply the validator to the data in the data set to determine whether one or more statistics generated through application of the validator to the data in the data set using the at least one validation parameter is valid according to at least one validation routine associated with the validator;
flag the data set as invalid if at least one of the one or more statistics generated through application of the validator are determined to be invalid according to the at least one validation routine;
generate a data quality report for the data set indicating whether the data set is valid or invalid based on a determination of whether the one or more statistics are valid according to the at least one validation routine;
selectively present the data quality report to the user through the graphical user interface;
re-apply the validator to the data in the data set to determine whether the one or more statistics generated through application of the validator to the data in the data set using the at least one validation parameter are valid according to the at least one validation routine;
generate another data quality report for the data indicating whether the data set is valid or invalid based on a determination made through re-application of the validator to the data in the data set of whether the one or more statistics are valid according to the at least one validation routine; and
selectively present the another data quality report to the user through the graphical user interface.
3 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are validating data in a data set. A data set including data to validate and a validator to use in validating the data is selected based on user input generated based on interactions of a user with a graphical user interface. The validator is applied to the data to determine whether one or more statistics generated through application of the validator to the data is valid or invalid based on a validation routine associated with the validator. A data quality report indicating whether the data set is valid or invalid, based on a determination of whether the one or more statistics is valid or invalid, is generated and selectively presented to the user through the graphical user interface.
-
Citations
18 Claims
-
1. A system for validating data comprising:
one or more processors and a memory storing instructions that, when executed by the one or more processors, cause the system to; select a data set including data to validate based on user input received from a user in interacting with a graphical user interface; select a validator including at least one validation parameter to use in validating the data in the data set based on the user input; apply the validator to the data in the data set to determine whether one or more statistics generated through application of the validator to the data in the data set using the at least one validation parameter is valid according to at least one validation routine associated with the validator; flag the data set as invalid if at least one of the one or more statistics generated through application of the validator are determined to be invalid according to the at least one validation routine; generate a data quality report for the data set indicating whether the data set is valid or invalid based on a determination of whether the one or more statistics are valid according to the at least one validation routine; selectively present the data quality report to the user through the graphical user interface; re-apply the validator to the data in the data set to determine whether the one or more statistics generated through application of the validator to the data in the data set using the at least one validation parameter are valid according to the at least one validation routine; generate another data quality report for the data indicating whether the data set is valid or invalid based on a determination made through re-application of the validator to the data in the data set of whether the one or more statistics are valid according to the at least one validation routine; and selectively present the another data quality report to the user through the graphical user interface. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
10. A method being implemented by a computing system including one or more physical processors and storage media storing machine-readable instructions, the method comprising:
-
selecting a data set including data to validate based on user input received from a user in interacting with a graphical user interface; selecting a validator including at least one validation parameter to use in validating the data in the data set based on the user input; applying the validator to the data in the data set to determine whether one or more statistics generated through application of the validator to the data in the data set using the at least one validation parameter is valid according to at least one validation routine associated with the validator; flagging the data set as invalid if at least one of the one or more statistics generated through application of the validator are determined to be invalid according to the at least one validation routine; generating a data quality report for the data set indicating whether the data set is valid or invalid based on a determination of whether the one or more statistics are valid according to the at least one validation routine; selectively presenting the data quality report to the user through the graphical user interface; re-applying the validator to the data in the data set to determine whether the one or more statistics generated through application of the validator to the data in the data set using the at least one validation parameter are valid according to the at least one validation routine; generating another data quality report for the data indicating whether the data set is valid or invalid based on a determination made through re-application of the validator to the data in the data set of whether the one or more statistics are valid according to the at least one validation routine; and selectively presenting the another data quality report to the user through the graphical user interface. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. A non-transitory computer readable medium comprising instructions that, when executed, cause one or more processors to perform:
-
selecting a data set including data to validate based on user input received from a user in interacting with a graphical user interface; selecting a validator including at least one validation parameter to use in validating the data in the data set based on the user input; applying the validator to the data in the data set to determine whether one or more statistics generated through application of the validator to the data in the data set using the at least one validation parameter is valid according to at least one validation routine associated with the validator; flagging the data set as invalid if at least one of the one or more statistics generated through application of the validator are determined to be invalid according to the at least one validation routine; generating a data quality report for the data set indicating whether the data set is valid or invalid based on a determination of whether the one or more statistics are valid according to the at least one validation routine; selectively presenting the data quality report to the user through the graphical user interface; re-applying the validator to the data in the data set to determine whether the one or more statistics generated through application of the validator to the data in the data set using the at least one validation parameter are valid according to the at least one validation routine; generating another data quality report for the data indicating whether the data set is valid or invalid based on a determination made through re-application of the validator to the data in the data set of whether the one or more statistics are valid according to the at least one validation routine; and selectively presenting the another data quality report to the user through the graphical user interface.
-
Specification