×

Modular memoization, tracking and train-data management of feature extraction

  • US 10,572,822 B2
  • Filed: 07/21/2016
  • Issued: 02/25/2020
  • Est. Priority Date: 07/21/2016
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising using at least one hardware processor for, in a feature extraction step of a machine learning analysis:

  • receiving at least one electronic document representing a dependency graph comprising feature extractors at each graph node and directed edges corresponding to computational dependencies of the feature extractors, wherein each of the feature extractors is configured to reduce data to be informative and non-redundant, by deriving vector values or matrix values from the data;

    for at least some of said feature extractors;

    i) determining extractor defining data, the extractor defining data comprising;

    extractor data and computational dependencies of said graph node in said dependency graph, wherein the extractor data are selected from the group consisting of;

    an extractor class, an extractor parameter, a cohort class, a cohort parameter, and a cohort index list;

    ii) computing a node lookup key based on said extractor defining data, wherein, when the feature extractor is context-insensitive—

    the node lookup key is for an entire cohort, wherein a context-insensitive feature extractor is a feature extractor which computes an identical value for a same sample in the sub-cohort and in the entire cohort;

    iii) when the node lookup key is not associated with node features that are stored on a non-transitory computer-readable storage medium;

    computing new node features, storing said new node features on said non-transitory computer-readable storage medium, and associating said node lookup key with said new node features;

    iv) when the feature extractor is context-sensitive, or when the feature extraction step is directed to the entire cohort;

    retrieving said node features or said new node features from said non-transitory computer-readable storage medium, wherein a context-sensitive feature extractor is a feature extractor which computes a different value for a same sample in the sub-cohort and in the entire cohort;

    v) when the feature extractor is context-insensitive and the feature extraction step is directed to a sub-cohort;

    deriving a feature of the sub-cohort from the node features or the new node features of the entire cohort without recomputing the feature for the sub-cohort; and

    sending at least some of said node features or new node features as an output set of said dependency graph, thereby accelerating the feature extraction step of the machine learning analysis.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×