×

Partition boundary determination using random sampling on very large databases

  • US 7,024,401 B2
  • Filed: 07/02/2001
  • Issued: 04/04/2006
  • Est. Priority Date: 07/02/2001
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for database partition boundary determination in a database management system (DBMS), the method comprising:

  • providing a pre-configured number S defining a default sample size in a database analysis program;

    selectively receiving by the database analysis program a particular number defining a desired sample size and setting said number S equal to said particular number;

    providing a seed value to the database analysis program for initializing a random number algorithm;

    randomly sampling S records of the database by the database analysis program using the random sampling algorithm, wherein said S records are different each time said method is utilized with different seed values, and wherein said S records are different for successive utilizations of said method if at least one record has been added to or deleted from said database between successive utilizations of said method;

    storing statistics for each of said S records as stored statistics including a record key for each record; and

    , producing an approximation partition analysis based on said stored statistics, wherein said approximation partition analysis is not mathematically exact.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×