Method for balancing of distributed tree file structures in parallel computing systems to enable recovery after a failure

US 5,230,047 A
Filed: 04/16/1990
Issued: 07/20/1993
Est. Priority Date: 04/16/1990
Status: Expired due to Fees

First Claim

Patent Images

1. In a distributed computing network containing a plurality of interconnected nodes, each node comprising a processor and data storage means, a plurality of data files or non-volatile storage distributed among said nodes including a tree structure of key-index data, said tree structure of key-index data including a ROOT of the tree structure for a top level of the tree, a method for balancing, among nodes, said tree structure, comprising processor executed steps of:

a. providing said ROOT in a file in a non-volatile storage, said ROOT including first and second lists, each being a list of keys describing a distribution of key-index data indentifiers are distributed, copies of said ROOT;

c. determining if a first node contains an excess of said key-index data indentifiers in comparison to a second node;

d. upon a determination of said excess, moving said excess key-index data indentifiers to non-volatile storage on said second node to achieve an approximately balanced distribution thereof; and

e. updating said second lists in files containing copies of said ROOT in said first and second nodes by noting each move of a key index data identifier, whereby upon a malfunction of either said first or second node before said approximate balance is achieved, a difference in entries exists between said first and second lists in a ROOT file in a non-failed first or second node that enables system recovery.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A distributed network is described which contains a plurality of interconnected nodes each node including a processor and data storage apparatus. A plurality of key-index data identifiers are distributed among the nodes, with each node including a tree data structure in non-volatile storage defining locations of the key-index data identifiers. The tree data structure includes a ROOT data structure comprising two lists, "NEW ROOT" and "OLD ROOT", each comprised of an ordered array of boundaries assigned nodes for the top level of the tree. A method is described for balancing the tree data structure which comprises the steps of:

a. a providing in each of the nodes across which the key-index data identifiers are distributed, at least copies of the two lists, "NEW ROOT" and "OLD ROOT", of the ROOT data structure;

b. determining when a first node contains an excess of key-index data identifiers;

c. moving the excess of key-index data identifiers to a second node;

d. updating the first node/second node boundary value in "NEW ROOT" of the ROOT data structure and the copies of "NEW ROOT" in the first and second nodes to note the movement of the data file identifiers, whereby in the event of a malfunction of one of the nodes, a record exists in both of the nodes of both an updated and non-updated ROOT data structure to enable data recovery.

166 Citations

11 Claims

1. In a distributed computing network containing a plurality of interconnected nodes, each node comprising a processor and data storage means, a plurality of data files or non-volatile storage distributed among said nodes including a tree structure of key-index data, said tree structure of key-index data including a ROOT of the tree structure for a top level of the tree, a method for balancing, among nodes, said tree structure, comprising processor executed steps of:
- a. providing said ROOT in a file in a non-volatile storage, said ROOT including first and second lists, each being a list of keys describing a distribution of key-index data indentifiers are distributed, copies of said ROOT;
  
  c. determining if a first node contains an excess of said key-index data indentifiers in comparison to a second node;
  
  d. upon a determination of said excess, moving said excess key-index data indentifiers to non-volatile storage on said second node to achieve an approximately balanced distribution thereof; and
  
  e. updating said second lists in files containing copies of said ROOT in said first and second nodes by noting each move of a key index data identifier, whereby upon a malfunction of either said first or second node before said approximate balance is achieved, a difference in entries exists between said first and second lists in a ROOT file in a non-failed first or second node that enables system recovery.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
- - 2. The method as defined in claim 1 wherein said ROOT contains an indication of key-index data identifier boundary values in each said node, wherein step c further comprises the step of:
    - c1. establishing a new boundary value for said first node in said second list of said ROOT to enable a determination of key-index data identifiers to be moved from said first node.
  - 3. The method as defined in claim 2 wherein step (e) includes further the steps of:
    - e1. in said first node, establishing a pointed to indicate a next key-index data identifier to be moved; and
      
      e2. establishing a pointed in said second node, indicating a key-index data identifier boundary after moving a key-index data identifier into said second node from said first node.
  - 4. The method as defined in claim 3 further including the steps of:
    - e3. deleting a key-index data identifier in said first node when said first node receives from said second node an acknowledgement of receipt of said key-index data identifier; and
      
      e4. moving said pointer in said first node to a next key-index data identifier to be moved, until said pointer reaches said new boundary value.
  - 5. The method as defined in claim 4 comprising the further step of:
    - f. in the event of a malfunction in a node, determining if a rebalancing was in process between nodes by comparing, in each said node, the first and second lists of said ROOT data structure to determine identify or non-identity thereof.
  - 6. The method as defined in claim 5 comprising the further step of:
    - g. upon a determination in any said node of non-identity in said first and second lists, performing steps e1-e4, starting with a non-identical key-index data identifier in said second list found in step f.
  - 7. The method as defined in claim 6, further comprising the steps of:
    - h. terminating said step g in a said node when said pointer indicates a key-index data identifier whose value is less than a said boundary value established in step c1.
  - 8. The method as defined in claim 1 wherein said moving step (d) includes the further step of:
    - d1. retaining in said first node a copy of each key-index data identifier moved to said second node until acknowledgement of receipt and storage of a moved key-index data identifier in said second node is received.
  - 9. The method as defined in claim 8 comprising the further step of:
    - f. upon all said excess key-index data identifiers having been moved to said second node, writing an updated copy of said second list in place of said first list in said ROOT data structure in non-volatile storage for said tree of key-index data identifiers.
  - 10. The method of claim 9, further comprising the step of:
    - g. propagating to all nodes said updated copy of said ROOT data structure.
  - 11. The method as defined in claim 1, further comprising the steps of:
    - f. responding to a key-index data identifier movement command transmitted to said first node from a node not involved in said balancing, by determining if the portion of said tree data structure in which said command is sought has been moved to said second node and, if so, redirecting said key-index data identifier command to said second node.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
International Business Machines Corporation
Original Assignee
International Business Machines Corporation
Inventors
Frey, Alexander H. Jr., Mosteller, Richard C.
Primary Examiner(s)
Harrell, Robert B.

Application Number

US07/510,209
Time in Patent Office

1,191 Days
Field of Search

364/DIG. 1 MS File, 364/DIG. 2 MS File, 371/6, 371/7, 371/8.1, 371/8.2, 371/9.1, 371/10.1, 371/10.2, 371/11.1, 371/11.3, 371/12, 371/13, 371/14, 395/200, 395/325, 395/575, 395/600, 395/800, 395/400
US Class Current

714/4.1
CPC Class Codes

G06F 11/1402 Saving, restoring, recoveri...

G06F 16/13 File access structures, e.g...

Method for balancing of distributed tree file structures in parallel computing systems to enable recovery after a failure

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

166 Citations

11 Claims

Specification

Solutions

Use Cases

Quick Links

Method for balancing of distributed tree file structures in parallel computing systems to enable recovery after a failure

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

166 Citations

11 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links