×

STRATIFIED SAMPLING USING ADAPTIVE PARALLEL DATA PROCESSING

  • US 20150186493A1
  • Filed: 12/27/2013
  • Published: 07/02/2015
  • Est. Priority Date: 12/27/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method for stratified sampling of a plurality of records, the method comprising:

  • partitioning a plurality of records into a plurality of splits, wherein each split includes at least a portion of the plurality of records;

    providing at least one split of the plurality of splits to a mapper;

    assigning at least a portion the records of the at least one split to a group, wherein each assignment to the group is based on a strata of the assigned record;

    filtering the records of the group, wherein each filtering is based on a comparison of a weight of a record to a local threshold of the mapper;

    shuffling the group to a reducer; and

    providing a stratified sampling of the plurality of records based on the group.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×