×

Dimension grouping and reduction for model generation, testing, and documentation

  • US 10,417,523 B2
  • Filed: 11/07/2017
  • Issued: 09/17/2019
  • Est. Priority Date: 11/07/2016
  • Status: Active Grant
First Claim
Patent Images

1. A non-transitory computer readable medium including executable instructions, the instructions being executable by a processor to perform a method, the method comprising:

  • receiving analysis data and output indicator, the output indicator indicating a subset of data of the analysis data, the analysis data including multiple dimensions associated with data points;

    receiving a lens function identifier, a metric function identifier, and a resolution function identifier;

    mapping data points from a transposition of the analysis data, to a reference space utilizing a lens function identified by the lens function identifier, the transposition of the analysis data transforming the analysis data such that the features are data points, the mapping of data points being performed by applying the lens functions across dimensions for each data point of the transposition of the analysis data;

    generating a cover of the reference space using a resolution function identified by the resolution identifier;

    clustering the data points mapped to the reference space using the cover and a metric function identified by the metric function identifier to determine each node of a plurality of nodes of a graph, each node including at least one data point;

    for each node, identifying data points that are members of that node to identify similar features;

    grouping features that are members of the same node as being similar to each other;

    for each feature, determining correlation with at least some of the subset of data of the analysis data and generate a correlation score;

    displaying at least a subset of groups that include features that are similar to each other and display the correlation score for each displayed feature;

    receiving a selection of a subset of features from the at least the subset of groups;

    generating a set of models, each model including at least one of the selection of the subset of features;

    determining fit of each generated model to the subset of data of the analysis data and generate a model score; and

    generating a report recommending the model with the highest score.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×