Detecting and displaying exceptions in tabular data
First Claim
Patent Images
1. A system that facilitates automatic detection of exceptions in tabular data, the system comprising a processor and memory storing computer executable components, the computer executable components comprising:
- an application that contains tabular data expressed as a plurality of rows that include a plurality of columns; and
a predictive analysis component that receives the tabular data and predicts exceptions in the tabular data utilizing a probabilistic clustering algorithm with predictive capabilities, the predictive analysis component comprising;
a clustering module that groups the plurality of rows into one or more groups using the clustering algorithm, wherein the plurality of rows are grouped based on similarities between values in the columns of each of the rows in each group, and wherein the clustering module further identifies at least one row from the plurality of rows that is not grouped with at least one other row from the plurality of rows; and
a predictive module that determines a group to which the identified at least one row is most similar and compares the value of each column in the at least one row to a predicted value for the corresponding column of rows that are grouped into the most similar group, and based on the comparison determines at least one column of the identified at least one row that contains a value that is most different from the corresponding predicted value such that the at least one column is identified as having caused the identified at least one row to not be grouped with the most similar group.
2 Assignments
0 Petitions
Accused Products
Abstract
Data expressed as tabular data having columns and rows can be analyzed and data determined to be an exception can be flagged. In addition, reasons for flagging such data as exceptions can be presented to a user to facilitate further analysis and action on the data. A predictive analysis component can utilize a clustering algorithm with predictive capabilities to autonomously analyze the data. Periodic re-analysis of the data can be performed to determine if exceptions have changed based on new or modified data.
31 Citations
15 Claims
-
1. A system that facilitates automatic detection of exceptions in tabular data, the system comprising a processor and memory storing computer executable components, the computer executable components comprising:
-
an application that contains tabular data expressed as a plurality of rows that include a plurality of columns; and a predictive analysis component that receives the tabular data and predicts exceptions in the tabular data utilizing a probabilistic clustering algorithm with predictive capabilities, the predictive analysis component comprising; a clustering module that groups the plurality of rows into one or more groups using the clustering algorithm, wherein the plurality of rows are grouped based on similarities between values in the columns of each of the rows in each group, and wherein the clustering module further identifies at least one row from the plurality of rows that is not grouped with at least one other row from the plurality of rows; and a predictive module that determines a group to which the identified at least one row is most similar and compares the value of each column in the at least one row to a predicted value for the corresponding column of rows that are grouped into the most similar group, and based on the comparison determines at least one column of the identified at least one row that contains a value that is most different from the corresponding predicted value such that the at least one column is identified as having caused the identified at least one row to not be grouped with the most similar group. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method for exception detection, comprising:
-
building a predictive clustering model on tabular data using a probabilistic clustering algorithm, wherein the clustering model comprises a plurality of groups of rows, wherein the rows are grouped based on similarities between values in the columns of each of the rows in each group, the clustering model being stored in computer storage media of a computer device; comparing, by a processor of the computer device, each row contained in the tabular data to the clustering model; ascertaining, by the processor of the computer device, one or more rows that are an exception by determining that the differences between the one or more rows and the group that is most similar to the row are greater than a threshold; and presenting, by the processor of the computer device, the one or more rows that are an exception to a user including identifying at least one column of each of the one or more rows that caused the one or more rows to be ascertained as exceptions by comparing the value of each column in the one or more rows to a predicted value for the corresponding column of rows that are grouped into the most similar group and determining one or more particular columns that contain values that are most different from the corresponding predicted value such that the one or more particular columns are identified as having caused the row to be an exception. - View Dependent Claims (11, 12, 13, 14, 15)
-
Specification