×

Decision forest based classifier for determining predictive importance in real-time data analysis

  • US 7,644,049 B2
  • Filed: 11/19/2004
  • Issued: 01/05/2010
  • Est. Priority Date: 11/19/2004
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method comprising:

  • generating predictive importance via an instance classification software and predictive importance network generator stored at a storage medium and executed by a processor coupled with the storage medium, the generating of the predictive importance includingfor a first feature of a data set having a plurality of features, training a classifier to predict the first feature in terms of other features in the data set to obtain a trained classifier, wherein the data set is progressively classified into branches via a decision criterion such that the decision criterion is applied at each decision point, the decision criterion including functions of features, the features including the first feature and a second feature, wherein the classifier includes a forest based classifier;

    scrambling the values of the second feature in the data set to obtain a scrambled data set, wherein scrambling including repeating the values of the second feature to be used for determining the predictive important of the second feature;

    executing the trained classifier on the scrambled data set, wherein the trained classifier to facilitate distinguishing of content of the data set by relevancy of the content, wherein the relevancy is based on features contained in the data set;

    determining the predictive importance of the second feature in predicting the first feature based at least in part on the accuracy of the trained classifier in predicting the first feature when executed with the scrambled data set, and based in part of other relevant features while ignoring other irrelevant features;

    creating a graph of the data set in which each of the first and the second features is a node of the graph and a label on an edge between the first node and the second node is based at least in part on the predictive importance of the first feature in terms of the second feature;

    applying the predictive importance to perform real-time diagnosis of factors including one or more of real-time medical analysis of a disease trend, real-time configuration of manufacturing settings to manufacture a product, and real-time safety analysis of a product; and

    displaying, via a display device, the real-time diagnosis of the factors based on the predictive importance.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×