Discovery-driven exploration of OLAP data cubes
First Claim
1. A method for locating data anomalies in a k-dimensional data cube, the method comprising the steps of:
- associating a surprise value with each cell of a data cube, the surprise value associated with a cell representing a degree of anomaly of a content of the cell with respect to other cells of the data cube; and
indicating a data anomaly when the surprise value associated with a cell exceeds a predetermined exception threshold.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for locating data anomalies in a k dimensional data cube that includes the steps of associating a surprise value with each cell of a data cube, and indicating a data anomaly when the surprise value associated with a cell exceeds a predetermined exception threshold. According to one aspect of the invention, the surprise value associated with each cell is a composite value that is based on at least one of a Self-Exp value for the cell, an In-Exp value for the cell and a Path-Exp value for the cell. Preferably, the step of associating the surprise value with each cell includes the steps of determining a Self-Exp value for the cell, determining an In-Exp value for the cell, determining a Path-Exp value for the cell, and then generating the surprise value for the cell based on the Self-Exp value, the In-Exp value and the Path-value.
213 Citations
44 Claims
-
1. A method for locating data anomalies in a k-dimensional data cube, the method comprising the steps of:
-
associating a surprise value with each cell of a data cube, the surprise value associated with a cell representing a degree of anomaly of a content of the cell with respect to other cells of the data cube; and indicating a data anomaly when the surprise value associated with a cell exceeds a predetermined exception threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. A program storage device comprising:
-
a storage area; and information stored in the storage area, the information being readable by a machine, and tangibly embodying a program of instructions executable by the machine for performing method steps comprising; associating a surprise value with each cell of a data cube, the surprise value associated with a cell representing a degree of anomaly of a content of the cell with respect to other cells of the data cube; and indicating a data anomaly when the surprise value associated with a cell exceeds a predetermined exception threshold. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44)
-
Specification