Identification and verification of common cluster files residing on nodes in a cluster

US 7,904,420 B2
Filed: 08/26/2008
Issued: 03/08/2011
Est. Priority Date: 08/26/2008
Status: Expired due to Fees

First Claim

Patent Images

1. A method for identification and verification of common cluster files and/or directories residing on nodes in a cluster, the method comprising:

compiling a first list of all files stored on all nodes of the cluster, wherein a file name concatenated to a complete directory path concatenated to a node number for each file constitutes a primary key for the first list;

compiling a second list of primary keys from the first list by removing the node number from each primary key;

sorting the second list by primary key;

compiling a third list of unique primary keys from the second list together with a primary key count representing the number of items in the first list for each unique primary key;

sorting the third list by primary key count;

compiling a fourth list of unique primary keys from the third list of unique primary keys by discarding any entry from the third list in which the primary key count is equal to the number of nodes in the cluster;

compiling a fifth list of unique primary keys from the fourth list of unique primary keys by discarding any entry from the fourth list in which the primary key count is less than or equal to a predetermined threshold indicative of the population of unique files; and

storing the fifth list on a computer readable medium.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

In accordance with a particular embodiment of the present disclosure, common cluster files residing on nodes in a cluster may be managed by compiling a first list of all files stored on all nodes of the cluster, compiling a second list indicative of unique files and the number of nodes on which each unique file is stored from the first list, determining, from the second list, unique files which are not stored on all nodes, determining, from the second list, which files are required by all nodes, and determining, from the first list and the second list, which files must be added to each node.

122 Citations

20 Claims

1. A method for identification and verification of common cluster files and/or directories residing on nodes in a cluster, the method comprising:
- compiling a first list of all files stored on all nodes of the cluster, wherein a file name concatenated to a complete directory path concatenated to a node number for each file constitutes a primary key for the first list;
  
  compiling a second list of primary keys from the first list by removing the node number from each primary key;
  
  sorting the second list by primary key;
  
  compiling a third list of unique primary keys from the second list together with a primary key count representing the number of items in the first list for each unique primary key;
  
  sorting the third list by primary key count;
  
  compiling a fourth list of unique primary keys from the third list of unique primary keys by discarding any entry from the third list in which the primary key count is equal to the number of nodes in the cluster;
  
  compiling a fifth list of unique primary keys from the fourth list of unique primary keys by discarding any entry from the fourth list in which the primary key count is less than or equal to a predetermined threshold indicative of the population of unique files; and
  
  storing the fifth list on a computer readable medium.
- View Dependent Claims (2, 3, 4, 5, 6, 7)
- - 2. The method of claim 1, the method further comprising comparing the fifth list to the first list to determine the common cluster files and/or directories that must be added to and/or replaced on each node.
  - 3. The method of claim 1, the method further comprising selecting a subset of nodes in the cluster for processing.
  - 4. The method of claim 1, the method further comprising selecting a subset of files and/or directories in the cluster for processing.
  - 5. The method of claim 1 wherein the files each comprise attributes including:
    - file permissions;
      
      file ownership;
      
      group ownership;
      
      file size;
      
      file link;
      
      and further comprising selecting and concatenating each attribute to the primary key of the first list.
  - 6. The method of claim 1 wherein the predetermined threshold is selected by a user.
  - 7. The method of claim 1 wherein the computer readable medium comprises common cluster storage.

8. A method for managing common cluster files residing on nodes in a cluster, the method comprising:
- compiling a first list of all files stored on all nodes of the cluster;
  
  compiling a second list indicative of unique files and the number of nodes on which each unique file is stored from the first list;
  
  determining, from the second list, unique files which are not stored on all nodes;
  
  determining, from the second list, which files are required by all nodes; and
  
  determining, from the first list and the second list, which files must be added to and/or replaced on each node.
- View Dependent Claims (9, 10, 11, 12, 13, 14)
- - 9. The method of claim 8, the method further comprising selecting a subset of nodes in the cluster for processing.
  - 10. The method of claim 8, the method further comprising selecting a subset of files and/or directories in the cluster for processing.
  - 11. The method of claim 8 wherein the files each comprise attributes including:
    - file permissions;
      
      file ownership;
      
      group ownership;
      
      file size;
      
      file link;
      
      and further comprising selecting and concatenating each attribute to the primary key of the first list.
  - 12. The method of claim 8 wherein a predetermined threshold may be selected for determining unique files.
  - 13. The method of claim 8 wherein the first list and the second list are stored on a computer readable medium.
  - 14. The method of claim 13 wherein the computer readable medium comprises common cluster storage.

15. Logic for managing common cluster files residing on nodes in a cluster, the logic embodied in a computer-readable medium and operable to:
- compile a first list of all files stored on all nodes of the cluster;
  
  compile a second list indicative of unique files and the number of nodes on which each unique file is stored from the first list;
  
  determine, from the second list, unique files which are not stored on all nodes;
  
  determine, from the second list, which files are required by all nodes; and
  
  determine, from the first list and the second list, which files must be added to and/or replaced on each node.
- View Dependent Claims (16, 17, 18, 19, 20)
- - 16. The logic of claim 15, the logic further operable to select a subset of nodes in the cluster for processing.
  - 17. The logic of claim 15, the logic further operable to select a subset of files and/or directories in the cluster for processing.
  - 18. The logic of claim 15 wherein the files each comprise attributes including:
    - file permissions;
      
      file ownership;
      
      group ownership;
      
      file size;
      
      file link;
      
      and wherein the logic is further operable to select and concatenate each attribute to the primary key of the first list.
  - 19. The logic of claim 15 wherein the logic is further operable to select a predetermined threshold for determining unique files.
  - 20. The logic of claim 15 wherein the first list and the second list are stored on common cluster storage.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Raytheon Company (Rtx Corporation)
Original Assignee
Raytheon Company (Rtx Corporation)
Inventors
Ianni, James C.
Primary Examiner(s)
Le; Uyen T.

Application Number

US12/198,365
Publication Number

US 20100057738A1
Time in Patent Office

924 Days
Field of Search

None
US Class Current

707/620
CPC Class Codes

G06F 16/16 File or folder operations, ...

Identification and verification of common cluster files residing on nodes in a cluster

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

122 Citations

20 Claims

Specification

Solutions

Use Cases

Quick Links

Identification and verification of common cluster files residing on nodes in a cluster

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

122 Citations

20 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links