×

Systems and methods for partitioning sets of features for a bayesian classifier

  • US 9,349,101 B2
  • Filed: 08/29/2014
  • Issued: 05/24/2016
  • Est. Priority Date: 08/29/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method of building a partition of features in an input set, in which feature subsets listed in a partition list have probabilistic interdependence among features in the feature subset, the method including:

  • accessing an input set including at least one input tuple comprising feature-values assigned to features;

    identifying input subtuples comprising unique feature subsets in the input tuple;

    accessing a tuple instance count data structure stored in memory that provides counts of tuples in a data set;

    computing class entropy scores for the input subtuples that have at least a threshold support count of instances in the tuple instance count data structure;

    adding feature subsets corresponding to the input subtuples to a partition list, including;

    ordering at least some of the input subtuples by non-decreasing class entropy score;

    traversing the ordered input subtuples, including;

    adding the feature subset of a current ordered input subtuple to the partition list, andpruning from subsequent consideration other input subtuples that include any features in the current ordered input subtuple; and

    reaching completion when all of the features of the input tuple have been added to the partition list; and

    storing the partition list in a memory, whereby it becomes available to use with a classifier.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×