Efficient data management in a cluster file system
First Claim
Patent Images
1. A cluster file system accessible to clients through a network, comprising:
- a plurality of file system nodes in a cluster, including a first node and a second node;
a backbone switch interconnecting the first node and the second node;
a metadata structure identifying the node on which datasets are stored; and
the first node comprising a first cache and a dataset controller configured to, if a specified dataset is stored on the second node;
receive a request from a client to perform a file system operation on the specified dataset;
access the metadata structure to determine the node on which the specified dataset is stored;
retrieve through the backbone switch from the second node that a first portion of the specified dataset to which the file system operation is directed and leave a remainder portion of the specified dataset stored in the second node;
store the retrieved first portion in the first cache; and
upon completion of the file system operation, modify the metadata structure to indicate that at least the first portion of the specified dataset is stored in the first node, whereby the first portion is not returned through the backbone switch to the second node.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and systems manage datasets in a cluster file system. A request is received from a client to perform a file system operation on a specified dataset stored in one of a plurality of nodes in a cluster. The specified dataset is retrieved from a first node through a backbone switch and stored in a cache in a second node. The requested file system operation is performed on the specified dataset and, upon completion of the requested operation, metadata is modified to indicate that the specified dataset is stored in the second node. The specified dataset is not returned through the backbone switch to the first node.
47 Citations
21 Claims
-
1. A cluster file system accessible to clients through a network, comprising:
-
a plurality of file system nodes in a cluster, including a first node and a second node;
a backbone switch interconnecting the first node and the second node;
a metadata structure identifying the node on which datasets are stored; and
the first node comprising a first cache and a dataset controller configured to, if a specified dataset is stored on the second node;
receive a request from a client to perform a file system operation on the specified dataset;
access the metadata structure to determine the node on which the specified dataset is stored;
retrieve through the backbone switch from the second node that a first portion of the specified dataset to which the file system operation is directed and leave a remainder portion of the specified dataset stored in the second node;
store the retrieved first portion in the first cache; and
upon completion of the file system operation, modify the metadata structure to indicate that at least the first portion of the specified dataset is stored in the first node, whereby the first portion is not returned through the backbone switch to the second node. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method for managing datasets in a cluster file system, comprising:
-
receiving a request from a client to perform a file system operation on a specified dataset stored in one of a plurality of nodes in a cluster;
retrieving the specified dataset from a first node through a backbone switch;
storing the retrieved specified dataset in a cache in a second node;
performing the requested file system operation on the specified dataset; and
upon completion of the requested operation, modifying metadata to indicate that the specified dataset is stored in the second node, whereby the specified dataset is not returned through the backbone switch to the first node. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer program product of a computer readable medium usable with a programmable computer, the computer program product having computer-readable code embodied therein for managing datasets in a cluster file system, the computer-readable code comprising instructions for:
-
receiving a request from a client to perform a file system operation on a specified dataset stored in one of a plurality of nodes in a cluster;
retrieving the specified dataset from a first node through a backbone switch;
storing the retrieved specified dataset in a cache in a second node;
performing the requested file system operation on the specified dataset; and
upon completion of the requested operation, modifying metadata to indicate that the specified dataset is stored in the second node, whereby the specified dataset is not returned through the backbone switch to the first node. - View Dependent Claims (14, 15, 16, 17)
-
-
18. A file system node in a multi-node cluster file system, comprising:
-
means for interconnecting the node to at least a second node through a backbone switch;
a cache;
a metadata structure identifying the node on which datasets are stored;
means for receiving a request from a client to perform a file system operation on a specified dataset;
means for accessing the metadata structure to determine the node on which the specified dataset is stored;
if the specified dataset is stored on the second node, means for retrieving through the backbone switch that first portion of the specified dataset to which the file system operation is directed and leaving a remainder portion of the specified dataset stored in the second node;
means for storing the retrieved first portion in the first cache; and
means for modifying the metadata structure upon completion of the file system operation to indicate that at least the first portion of the specified dataset is stored in the first node, whereby the first portion is not returned through the backbone switch to the second node. - View Dependent Claims (19, 20, 21)
-
Specification