×

Multivariate insight discovery approach

  • US 10,255,345 B2
  • Filed: 10/09/2014
  • Issued: 04/09/2019
  • Est. Priority Date: 10/09/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • identifying, by the at least one processor, data types of a raw dataset and models of data of the raw dataset to determine attribute hierarchies;

    generating, by the at least one processor, a reduced dataset from the raw dataset based on the determined attribute hierarchies by;

    mapping, by the at least one processor, the attribute hierarchies to identify sets of equivalent attributes and measures;

    for each set of equivalent attributes, selecting, by the at least one processor, one of the equivalent attributes and discarding the remaining equivalent attributes; and

    for each set of equivalent attributes, selecting, by the at least one processor, one of the equivalent measures and discarding the remaining equivalent measures;

    aggregating over at least one attribute of the reduced dataset, by the at least one processor, to generate a preprocessed dataset with the same relevant statistical properties of the raw dataset, such that at least one type of statistical analysis produces the same results when applied to the preprocessed dataset as when applied to the raw dataset;

    identifying, by the at least one processor, subsets of the preprocessed dataset that include data that exhibits non-random patterns by performing the at least one type of statistical analysis;

    generating a score for each of the identified subsets of the preprocessed dataset, by the at least one processor, based on the data that exhibits non-random patterns included in each of the identified subsets;

    ranking each of the identified subsets for presence of non-random data structures, by the at least one processor, based on the score generated for each of the identified subsets;

    selecting, by the at least one processor, an identified subset based on the ranking of the identified subset; and

    generating, by the at least one processor, a visualization that highlights a non-random structure of the selected identified subset.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×