×

Parallel processing of count distinct values

  • US 20070239663A1
  • Filed: 04/06/2006
  • Published: 10/11/2007
  • Est. Priority Date: 04/06/2006
  • Status: Abandoned Application
First Claim
Patent Images

1. A method for performing a count distinct function on values in at least one column of data comprising:

  • a) splitting the data into chunks based on the values in the at least one column of data upon which the count distinct function is to be performed, where no value appears in more than one chunk;

    b) determining if each chunk is of a size that enables it to fit into available memory, and i) if not, recursively splitting the oversized chunks until each chunk is of a size that enables it to fit into available memory; and

    c) performing an in memory count distinct function on each chunk and summing a number of distinct values from each chunk for display in at least one cell of a results grid.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×