×

Generating partitioned hierarchical groups based on data sets for business intelligence data models

  • US 10,698,924 B2
  • Filed: 05/22/2014
  • Issued: 06/30/2020
  • Est. Priority Date: 05/22/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method for generating a hierarchical group based on a set of data, the method comprising:

  • classifying two or more data items from a set of data with respect to a library of ontological concepts based at least in part on properties of the two or more data items, including detecting correlations between the properties of the two or more data items and one or more ontological concepts from the library of ontological concepts, wherein the properties of the two or more data items include data types defined for the two or more data items and ranges of data values in data fields of the two or more data items;

    classifying the two or more data items with respect to lexical correlations between the two or more data items, including determining correlations between one or more elements of headers of the two or more data items;

    analyzing the two or more data items based on one or more factors to determine whether the one or more factors contribute to defining a hierarchical relationship, wherein the analysis utilizes the one or more factors that comprise a set of heuristic rules and relative cardinality, wherein the set of heuristic rules discounts or disqualifies quantifiers or metrics associated with the two or more data items, and wherein the relative cardinality minimizes quantifiers or metrics through merging;

    generating a hierarchical group in which the two or more data items are partitioned into one or more hierarchical partitions based at least in part on the classifying with respect to the library of ontological concepts, the classifying with respect to the lexical correlations, and the analysis of the two or more data items based on the one or more factors, wherein each of the one or more hierarchical partitions comprises the two or more data items; and

    verifying a sampling of data in the one or more hierarchical partitions, including measuring correlations between data in the two or more data items in a particular hierarchical partition from the one or more hierarchical partitions to determine whether the particular hierarchical partition has a first data item at a leaf level of the particular hierarchical partition in a one-to-many relationship with a second data item at a base level of the particular hierarchical partition.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×