×

Method and apparatus for estimating the number of occurrences of frequent values in a data set

  • US 5,542,089 A
  • Filed: 07/26/1994
  • Issued: 07/30/1996
  • Est. Priority Date: 07/26/1994
  • Status: Expired due to Term
First Claim
Patent Images

1. A method of estimating the number of occurrences of values of query search keys in a data set stored in a digital computer for use by a query optimizer of the computer, the method comprising the steps of:

  • defining at least two independent hashing functions that map values of the data set to buckets of respective hashing tables that are maintained in data storage of the computer;

    obtaining a current value from among the values in the data set;

    mapping the current value to a multiplicity of hashing table buckets of the data storage that are defined by each hashing function and incrementing an associated bucket count in the data storage;

    determining if the incremented bucket count of each hashing table satisfies predetermined criteria for being a popular bucket;

    designating the current value as active if all of the buckets to which the current value is mapped are designated popular buckets and adding the current value to a list of active values in the data storage that are associated with at least one of the hashing tables;

    collecting predetermined, statistical data related to the current value if it has been designated active;

    repeating the steps of obtaining, mapping, determining, and designating until all values in the data set have been obtained; and

    producing estimates of the most frequent values in the data set from the collected statistical data and providing them to the query optimizer.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×