×

Data partitioning via bucketing bloom filters

  • US 7,743,013 B2
  • Filed: 06/11/2007
  • Issued: 06/22/2010
  • Est. Priority Date: 06/11/2007
  • Status: Expired due to Fees
First Claim
Patent Images

1. One or more computer storage media having stored thereon instructions that, when executed by one or more processors of a computing device, cause the one or more processors to:

  • identify a first data set of elements and a second data set of elements, wherein the first data set and the second data set are disjoint;

    assign each element in the first data set to a bucket of a first set of buckets;

    assign each element in the second data set to a bucket of a second set of buckets; and

    generate, for each bucket of the first set of buckets, a Bloom filter that indicates that each element assigned to the bucket of the first set of buckets is part of the first data set, and that indicates that each element assigned to a corresponding bucket of the second set of buckets is not part of the first data set,wherein to assign each element in the first data set to the bucket of the first set of buckets is to, for each element in the first data set;

    generate a hash value for the element by applying a hash function to the element; and

    assign the element to one bucket of the first set of multiple buckets that corresponds to the hash value for the element; and

    to assign each element in the second data set to the bucket of the second set of buckets is to, for each element in the second data set;

    generate a hash value for the element by applying the hash function to the element; and

    assign the element to one bucket of the second set of multiple buckets that corresponds to the hash value for the element.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×