×

Partition boundary determination using random sampling on very large databases

  • US 20030004944A1
  • Filed: 07/02/2001
  • Published: 01/02/2003
  • Est. Priority Date: 07/02/2001
  • Status: Active Grant
First Claim
Patent Images

1. A method for database partition boundary determination, the method comprising the steps of:

  • providing a pre-configured number S defining a default sample size;

    selectively receiving a particular number defining a desired sample size and setting said number S equal to said particular number;

    providing a seed value for initializing a random number algorithm;

    randomly sampling S records of the database using the random sampling algorithm, wherein said S records are different each time said method is utilized with different seed values, and wherein said S records are different for successive utilizations of said method if at least one record has been added to or deleted from said database between successive utilizations of said method;

    storing statistics for each of said S records as stored statistics including a record key for each record; and

    , producing an approximation partition analysis based on said stored statistics, wherein said approximation partition analysis is not mathematically exact.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×