×

Automated selection of generic blocking criteria

  • US 8,275,770 B2
  • Filed: 04/24/2009
  • Issued: 09/25/2012
  • Est. Priority Date: 04/24/2008
  • Status: Active Grant
First Claim
Patent Images

1. A computer implemented method of identifying a set of fields applicable to partition a plurality of records in an electronic database into one or more blocks based on a desired block size and independent of specific queries against the database, the method comprising:

  • receiving a desired block size;

    calculating field probabilities for a plurality of fields in the database, wherein each field probability represents an average cohort size for a field divided by the number of records in the database, each of the field probabilities associated with one of the fields in the database, and wherein the average cohort size for each field corresponds to the average number of records containing a same field value in the respective field;

    determining a set of fields by combining the field probabilities of one or more fields by mathematical calculation, wherein a product of the combined field probabilities and the number of records in the database is less than or equal to the desired block size, and wherein the set of fields is determined independent of specific queries against the database; and

    outputting the set of fields.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×