SYSTEM FOR DATABASE DATA QUALITY PROCESSING
First Claim
Patent Images
1. A system for determining data quality issues in a database, the system comprising:
- a computer hardware processor in a physical computing device, the computer hardware processor being configured to;
receive a data set comprising first data and second data;
apply a first quality assignment rule to the first data to determine;
(i) that a first value corresponding to the first data exceeds a first threshold, and (ii) a first score for the first data;
apply the first quality assignment rule to the second data to determine;
(i) that a second value corresponding to the second data exceeds a second threshold, and (ii) a second score for the second data;
apply a second quality assignment rule to the first data to determine;
(i) that a third value corresponding to the first data exceeds a third threshold, and (ii) an updated first score, from the first score, for the first data;
apply the second quality assignment rule to the second data to determine that a fourth value corresponding to the second data does not exceed the third threshold;
determine a subset of the data set based at least on the updated first score and the second score, wherein the subset of the data set does not include the first data; and
cause presentation, in a user interface, of the subset of the data set.
3 Assignments
0 Petitions
Accused Products
Abstract
In an embodiment, a system can determine potential data quality issues in a database. The system applies quality assignment rules to a data set. The quality assignment rules access data from the data set or calculate one or more values from data entries of the data set. Data entries or determined values that satisfy the quality assignment rules receive one or more scores. The system then presents a subset of the data set based on the determined one or more scores. Accordingly, a user of the system can determine the source of the data quality issues such as a broken or miscalibrated data gathering device.
44 Citations
20 Claims
-
1. A system for determining data quality issues in a database, the system comprising:
a computer hardware processor in a physical computing device, the computer hardware processor being configured to; receive a data set comprising first data and second data; apply a first quality assignment rule to the first data to determine;
(i) that a first value corresponding to the first data exceeds a first threshold, and (ii) a first score for the first data;apply the first quality assignment rule to the second data to determine;
(i) that a second value corresponding to the second data exceeds a second threshold, and (ii) a second score for the second data;apply a second quality assignment rule to the first data to determine;
(i) that a third value corresponding to the first data exceeds a third threshold, and (ii) an updated first score, from the first score, for the first data;apply the second quality assignment rule to the second data to determine that a fourth value corresponding to the second data does not exceed the third threshold; determine a subset of the data set based at least on the updated first score and the second score, wherein the subset of the data set does not include the first data; and cause presentation, in a user interface, of the subset of the data set. - View Dependent Claims (2, 3, 4, 5, 6)
-
7. A method for determining data quality issues with fleet vehicle operation information, the method comprising:
-
receiving vehicle telematics data comprising first data and second data, the first data corresponding to a first vehicle and the second data corresponding to a second vehicle; applying a first quality assignment rule to the first data to determine;
(i) that a first value corresponding to the first data exceeds a first threshold, and (ii) a first score for the first data;applying the first quality assignment rule to the second data to determine;
(i) that a second value corresponding to the second data exceeds a second threshold, and (ii) a second score for the second data;applying a second quality assignment rule to the first data to determine;
(i) that a third value corresponding to the first data exceeds a third threshold, and (ii) an updated first score, from the first score, for the first data;applying the second quality assignment rule to the second data to determine that a fourth value corresponding to the second data does not exceed the third threshold; determining a subset of the vehicle telematics data based at least on the updated first score and the second score, wherein the subset of the vehicle telematics data does not include the first data; and causing presentation, in a user interface, of the subset of the vehicle telematics data. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A system for determining data quality issues with fleet vehicle operation information, the system comprising:
a computer hardware processor in a physical computing device, the computer hardware processor being configured to; receive vehicle telematics data, the vehicle telematics data comprising first data and second data; apply a first quality assignment rule to the first data to determine;
(i) that a first value corresponding to the first data exceeds a first threshold, and (ii) a first score for the first data;apply the first quality assignment rule to the second data to determine;
(i) that a second value corresponding to the second data exceeds a second threshold, and (ii) a second score for the second data;apply a second quality assignment rule to the first data to determine;
(i) that a third value corresponding to the first data exceeds a third threshold, and (ii) an updated first score, from the first score, for the first data;apply the second quality assignment rule to the second data to determine that a fourth value corresponding to the second data does not exceed the third threshold; determine a subset of the vehicle telematics data based at least on the updated first score and the second score, wherein the subset of the vehicle telematics data does not include the first data; and cause presentation, in a user interface, of the subset of the vehicle telematics data. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
Specification