×

Data clustering, segmentation, and parallelization

  • US 10,503,755 B2
  • Filed: 11/15/2012
  • Issued: 12/10/2019
  • Est. Priority Date: 11/15/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method, including:

  • processing a first set of original records by a first processing entity to generate a second set of records that includes the original records and one or more copies of each original record, each original record including one or more fields, and the processing of each of at least some of the original records includinggenerating at least one copy of the original record, andassociating a first segment value with the original record and associating a second segment value with the copy, where the first segment value corresponds to a first portion of one or more data values of respective fields of the original record and the second segment value correspond to a second portion of the one or more data values of the respective fields of the original record, and where the second portion is different from the first portion, andpartitioning the second set of records among a plurality of recipient processing entities based on the segment values associated with the records in the second set, and, at each recipient processing entity, performing an operation based on one or more data values of the records received at the recipient processing entity to generate results.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×