×

Multi-dimensional database record compression utilizing optimized cluster models

  • US 6,633,882 B1
  • Filed: 06/29/2000
  • Issued: 10/14/2003
  • Est. Priority Date: 06/29/2000
  • Status: Active Grant
First Claim
Patent Images

1. A method of querying a database containing data records stored in the database comprising the steps of:

  • a) clustering data from data records having multiple dimensions that are stored on a database to provide an initial cluster model having an initial probability distribution describing data records in the database;

    b) comparing the initial probability distribution with a representative sample of records in the database to determine a sufficiency of said initial probability distribution;

    c) modifying the cluster model to provide an adjusted cluster model that characterizes the data in the database, said modifying step performed by finding a region within an attribute space of the data records of high discrepancy between the initial cluster model and a data sample gathered from the database and increasing the cluster number of the cluster model and reclustering at least a portion of the data in the database to produce the adjusted cluster model to reduce discrepancies between the initial probability distribution and data sample from the database; and

    d) determining a sum or a count of data records from the database falling within specified ranges of the multiple dimensions by integrating a functional representation based on the probablity distribution of the adjusted cluster model over the ranges.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×