×

PROFILING IN A MASSIVE PARALLEL PROCESSING ENVIRONMENT

  • US 20100250563A1
  • Filed: 03/27/2009
  • Published: 09/30/2010
  • Est. Priority Date: 03/27/2009
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method of profiling a data set in a parallel processing environment, comprising:

  • partitioning an initial data set vertically according to multiple attribute subsets;

    profiling one or more of the attribute subsets;

    generating a list of subjects or otherwise horizontal component values corresponding to a specific attribute value identified in the profiling;

    extracting values of multiple attributes for each said identified subject or otherwise horizontal component values;

    assembling sample results of said identified subjects or otherwise horizontal component values;

    merging the sample results to form a profiled subset of the initial data set; and

    transmitting, displaying or storing the profiled subset of the initial data set, a further processed version, or combinations thereof.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×