×

Method and apparatus for achieving uniform data distribution in a parallel database system

  • US 5,970,495 A
  • Filed: 06/10/1996
  • Issued: 10/19/1999
  • Est. Priority Date: 09/27/1995
  • Status: Expired due to Term
First Claim
Patent Images

1. A method of distributing data of a table partitioned across a parallel database system having a number of nodes comprising:

  • determining the data volume for nodes of said system associated with said table;

    comparing said data volume stored among said nodes;

    identifying groups of data in nodes having higher data volumes which may be distributed to nodes having lower data volumes to obtain a more uniform data distribution with minimum required data movement activity;

    moving said identified data to said nodes having lower data volumes to balance the data volumes of said nodes across which said data is partitioned;

    obtaining file size information for table partitions of said nodes by reading file attributes for said files and obtaining database statistics on data group volume and volume of data group usage;

    generating a distribution listing file depicting current data distribution;

    selecting one or more data groups for redistribution among selected nodes to which data is to be redistributed;

    generating a partitioning map for redistribution of said groups of data in which a best fit method has been applied to select data groups and a redistribution plan for redistribution according to descending weight of said data groups in which data will be substantially uniformly distributed among said nodes; and

    redistributing said data groups among said selected nodes in accordance with said partition map.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×