Method and system for partial-order analysis of multi-dimensional data
First Claim
1. A method for supporting data analysis through use of a partial-order structuring of multi-dimensional data and aggregated measurements stored in computer memory, and use of a pattern-comparison means, said pattern-comparison means having the ability to compare two patterns across a multiplicity of dimensions so as to determine whether the first pattern is more-general-than the second pattern, comprising the steps of:
- (a) loading of metadata defining the type hierarchy;
(b) transforming multi-dimensional fact data into a plurality of nodes, said nodes containing at least a pattern part and a measures part, and said pattern part having the ability to store a multiplicity of dimension values simultaneously;
(c) loading of said data nodes into a partial-order structure providing a functional representation of a rooted directed-acyclic graph whose arcs embody an ordering by the relationship established by the pattern-comparison means;
(d) loading of a plurality of additional nodes into said partial-order structure, wherein said additional nodes contain a pattern part that is more general than any pattern contained in said data nodes;
(e) aggregating the measures part from said data nodes into the measures part of said additional nodes;
(f) performing on-demand aggregation, facilitated by said partial-order structure, in response to a query for which insufficient pre-aggregated data is currently stored.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and system for performing data analysis where multiple, categorical dimensions and statistical measures are used. The method includes preparing multi-dimensional data into nodes and arranging them logically in the form of a partial-order database. The database may then be traversed by a data-mining tool, or queried in an OLAP manner. Queries are answered through a process of associative partial-order search, which includes optional support for “children-of” requests, and optional support for fuzzy pattern-matching. By aggregating measures on-the-fly, this method is able to preform well without requiring a large number of summary nodes, when taken as a percentage of the total node count.
198 Citations
10 Claims
-
1. A method for supporting data analysis through use of a partial-order structuring of multi-dimensional data and aggregated measurements stored in computer memory, and use of a pattern-comparison means, said pattern-comparison means having the ability to compare two patterns across a multiplicity of dimensions so as to determine whether the first pattern is more-general-than the second pattern, comprising the steps of:
-
(a) loading of metadata defining the type hierarchy;
(b) transforming multi-dimensional fact data into a plurality of nodes, said nodes containing at least a pattern part and a measures part, and said pattern part having the ability to store a multiplicity of dimension values simultaneously;
(c) loading of said data nodes into a partial-order structure providing a functional representation of a rooted directed-acyclic graph whose arcs embody an ordering by the relationship established by the pattern-comparison means;
(d) loading of a plurality of additional nodes into said partial-order structure, wherein said additional nodes contain a pattern part that is more general than any pattern contained in said data nodes;
(e) aggregating the measures part from said data nodes into the measures part of said additional nodes;
(f) performing on-demand aggregation, facilitated by said partial-order structure, in response to a query for which insufficient pre-aggregated data is currently stored. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A system of logic circuitry and computational instructions for supporting data analysis through use of a partial-order structuring of multi-dimensional data and aggregated measurements stored in computer memory, and use of a pattern-comparison means, said pattern-comparison means having the ability to compare two patterns across a multiplicity of dimensions so as to determine whether the first pattern is more-general-than the second pattern, comprising:
-
(a) means for loading of metadata defining the type hierarchy;
(b) means for transforming multi-dimensional fact data into a plurality of nodes, said nodes containing at least a pattern part and a measures part, and said pattern part having the ability to store a multiplicity of dimension values simultaneously;
(c) means for loading of said data nodes into a partial-order structure providing a functional representation of a rooted directed-acyclic graph whose arcs embody an ordering by the relationship established by the pattern-comparison means;
(d) means for loading of a plurality of additional nodes into said partial-order structure, wherein said additional nodes contain a pattern part that is more general than any pattern contained in said data nodes;
(e) means for aggregating the measures part from said data nodes into the measures part of said additional nodes;
(f) means for performing on-demand aggregation, facilitated by said partial-order structure, in response to a query for which insufficient pre-aggregated data is currently stored. - View Dependent Claims (7, 8, 9, 10)
-
Specification