Automated system and method of data scrubbing
First Claim
Patent Images
1. A method for performing data scrubbing at attribute level, comprising:
- receiving, by a processor, data containing at least one significant attribute or at least one non-significant attribute and associated values from distributed data sources, the data sources being assigned weight against each value of the at least one significant attribute or the at least one non-significant attribute; and
applying, by the processor, a ranking matrix process to the received data, the ranking matrix process comprising;
for sources referring to different values for a significant attribute or a non-significant attribute, computing a combined weight therefrom;
in response to the combined weight of the significant attribute exceeding a predetermined promotion threshold value, determining if the combined weight exceeds a predetermined confirmation threshold value;
in response to the combined weight of the non-significant attribute and the significant attribute exceeding the predetermined promotion threshold value and the predetermined confirmation threshold value respectively, promoting values associated with the non-significant attribute and the significant attribute to a final templated copy;
in response to the combined weight of the significant attribute or the non-significant attribute being less than the predetermined promotion threshold value, computing a total weight of all values for the significant attribute or the non-significant attribute from all sources; and
in response to the total weight exceeding a predetermined task threshold, raising a work item for a user to create a manual source and reapplying the ranking matrix process.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method enabling automated data cleansing and scrubbing at the attribute level is disclosed. A consolidated view may be provided of the scrubbed data or narratives that gets promoted to a final copy and the data or narratives received from multiple sources on a single user interface.
-
Citations
20 Claims
-
1. A method for performing data scrubbing at attribute level, comprising:
-
receiving, by a processor, data containing at least one significant attribute or at least one non-significant attribute and associated values from distributed data sources, the data sources being assigned weight against each value of the at least one significant attribute or the at least one non-significant attribute; and applying, by the processor, a ranking matrix process to the received data, the ranking matrix process comprising; for sources referring to different values for a significant attribute or a non-significant attribute, computing a combined weight therefrom; in response to the combined weight of the significant attribute exceeding a predetermined promotion threshold value, determining if the combined weight exceeds a predetermined confirmation threshold value; in response to the combined weight of the non-significant attribute and the significant attribute exceeding the predetermined promotion threshold value and the predetermined confirmation threshold value respectively, promoting values associated with the non-significant attribute and the significant attribute to a final templated copy; in response to the combined weight of the significant attribute or the non-significant attribute being less than the predetermined promotion threshold value, computing a total weight of all values for the significant attribute or the non-significant attribute from all sources; and in response to the total weight exceeding a predetermined task threshold, raising a work item for a user to create a manual source and reapplying the ranking matrix process. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system for performing data scrubbing, comprising:
-
a hardware processor; and a memory storing instructions, wherein the hardware processor is configured by the instructions to; provide an input interface configured to receive, for an event, data containing at least one significant attribute or at least one non-significant attribute and associated values from distributed data sources; apply a ranking matrix process for determining values associated with the at least one significant attribute or the at least one non-significant attribute to be promoted to a final templated copy based upon a combination of a predefined ranking attribute rule and a source weighting rule, wherein the ranking matrix process comprises; for sources referring to different values for a significant attribute or a non-significant attribute, computing a combined weight therefrom; in response to the combined weight of the significant attribute exceeding a predetermined promotion threshold value, determining if the combined weight exceeds a predetermined confirmation threshold value; in response to the combined weight of the non-significant attribute and the significant attribute exceeding the predetermined promotion threshold value and the predetermined confirmation threshold value respectively, promoting values associated with the non-significant attribute and the significant attribute to a final templated copy; in response to the combined weight of the significant attribute or the non-significant attribute being less than the predetermined promotion threshold value, computing a total weight of all values for the significant attribute or the non-significant attribute from all sources; and in response to the total weight exceeding a predetermined task threshold, raising a work item for a user to create a manual source and reapplying the ranking matrix process; and displaying the final templated copy along with the data received from the distributed sources on a graphical user interface, the graphical interface including a summary section to display key attributes of the event and a main section to display source headings, attribute headings, the final templated copy and incoming data, and a toolbar section adapted to perform a plurality of icon-based operations responsive to the summary section and the main section. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19)
-
-
20. A non-transitory computer readable medium embodying a program executable in a computer for performing data scrubbing at attribute level, the program comprising computer executable instructions for:
-
receiving data containing at least one significant attribute or at least one non-significant attribute and associated values from distributed data sources, the data sources being assigned weight against each value of the at least one significant attribute or the at least one non-significant attribute; and applying a ranking matrix process to the received data, the ranking matrix process comprising; for sources referring to different values for a significant attribute or a non-significant attribute, computing a combined weight therefrom; in response to the combined weight of the significant attribute exceeding a predetermined promotion threshold value, determining if the combined weight exceeds a predetermined confirmation threshold value; in response to the combined weight of the non-significant attribute and the significant attribute exceeding the predetermined promotion threshold value and the predetermined confirmation threshold value respectively, promoting values associated with the non-significant attribute and the significant attribute to a final templated copy; in response to the combined weight of the significant attribute or the non-significant attribute being less than the predetermined promotion threshold value, computing a total weight of all values for the significant attribute or the non-significant attribute from all sources; and in response to the total weight exceeding a predetermined task threshold, raising a work item for a user to create a manual source and reapplying the ranking matrix process.
-
Specification