×

System and method for measuring the quality of document sets

  • US 8,560,529 B2
  • Filed: 07/25/2011
  • Issued: 10/15/2013
  • Est. Priority Date: 06/26/2007
  • Status: Active Grant
First Claim
Patent Images

1. A computer implemented method for comparing the distinctiveness of a plurality of sets within a collection of information, the method comprising:

  • sampling, by a computer system, from the collection of information to generate at least one set;

    establishing, automatically, at least one identifying characteristic within the at least one set;

    determining a statistical distribution of the at least one identifying characteristic associated with the at least one set; and

    generating, by the computer system, a relative measurement of distinctiveness based on the statistical distribution of the at least one identifying characteristic associated with the at least one set and at least one other set, wherein the generating the relative measure of distinctiveness comprises accounting for a set size of a measured set based on a measurement of distinctiveness for a comparison set and a size for the comparison set, and normalizing the relative measurement of distinctiveness based on the set size of the measured set and the size for the comparison set.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×