×

RE-SIZING DATA PARTITIONS FOR ENSEMBLE MODELS IN A MAPREDUCE FRAMEWORK

  • US 20150356149A1
  • Filed: 02/24/2015
  • Published: 12/10/2015
  • Est. Priority Date: 06/05/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • determining an initial number of base model partitions of data from a plurality of data sources;

    determining an initial base model partition size based at least in part on the initial number of base model partitions;

    evaluating the initial base model partition size at least in part with reference to at least one base model partition size reference;

    determining a finalized number of base model partitions based at least in part on the initial base model partition size;

    determining a revised base model partition size; and

    generating revised base models based at least in part on the revised base model partition size, wherein generating the revised base models comprises using a predictive modeling framework to randomly assign input data records from the plurality of data sources into the finalized number of base model partitions.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×