Recovering transactions of failed nodes in a clustered file system
First Claim
Patent Images
1. A method for recovering transactions of failed nodes using a recovery procedure in a clustered file system (CFS) using a processor device, the method comprising:
- determining a data segment should be copied to a final storage location by validating that an ownership of the data segment is not associated with any other operational node, via a distributed shared memory (DSM) agent;
setting the ownership of the data segment to a local DSM agent;
including a transaction journal in each node for storing committed transactions generated by users on that node;
scanning a list to identify the location of the latest contents of a data segment prior to the transaction using a rollback procedure;
for each modified data segment, identifying the location of the latest contents prior to the transaction using the rollback procedure by;
if the data segment was marked as modified in the cache at the time it was inserted into the list, then the latest contents of this data segment appears only in the journal,otherwise, if the data segment was not marked as modified in the cache at the time the data segment was inserted into the list, then the latest contents of this data segment appears in its final location in the shared storage; and
recording a type of each data segment in the list during insertion of the data segment into the list, wherein;
all data segments in the list whose latest contents appear in a final location are discarded from the cache,for all the other data segments in the list, latest contents for the all the other data segments are restored from the journal into the cache by scanning the local transaction journal from ending to beginning, and first occurrences of these data segments in the local transaction journal are considered, and a modification indication of the data segments is set as true, andexclusive permissions on all the data segments involved in a cancellation transaction are released.
0 Assignments
0 Petitions
Accused Products
Abstract
Systems. Methods, and Computer Program Products are provided for recovering transactions of failed nodes using a recovery procedure in a clustered file system (CFS). A data segment is determined that the data segment should be copied to a final storage location by validating that an ownership of the data segment is not associated with any other operational node, via a distributed shared memory (DSM) agent. The ownership of the data segment is set to a local DSM agent.
7 Citations
24 Claims
-
1. A method for recovering transactions of failed nodes using a recovery procedure in a clustered file system (CFS) using a processor device, the method comprising:
-
determining a data segment should be copied to a final storage location by validating that an ownership of the data segment is not associated with any other operational node, via a distributed shared memory (DSM) agent; setting the ownership of the data segment to a local DSM agent; including a transaction journal in each node for storing committed transactions generated by users on that node; scanning a list to identify the location of the latest contents of a data segment prior to the transaction using a rollback procedure; for each modified data segment, identifying the location of the latest contents prior to the transaction using the rollback procedure by; if the data segment was marked as modified in the cache at the time it was inserted into the list, then the latest contents of this data segment appears only in the journal, otherwise, if the data segment was not marked as modified in the cache at the time the data segment was inserted into the list, then the latest contents of this data segment appears in its final location in the shared storage; and recording a type of each data segment in the list during insertion of the data segment into the list, wherein; all data segments in the list whose latest contents appear in a final location are discarded from the cache, for all the other data segments in the list, latest contents for the all the other data segments are restored from the journal into the cache by scanning the local transaction journal from ending to beginning, and first occurrences of these data segments in the local transaction journal are considered, and a modification indication of the data segments is set as true, and exclusive permissions on all the data segments involved in a cancellation transaction are released. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for recovering transactions of failed nodes using a recovery procedure in a clustered file system (CFS), the system comprising:
-
a cluster of nodes, the CFS including the cluster of nodes forming a computer cluster, a distributed shared memory (DSM) agent within a node of the cluster of nodes; a plurality of storage devices in communication with the CFS, a cache associated with the node, transaction journals belonging to each one of the cluster of nodes, and a processor device having a memory coupled to the processor device for controlling the CFS, wherein the processor device is assigned to the node and the node is in communication with the plurality of storage devices, wherein the processor device; determines a data segment should be copied to a final storage location by validating that an ownership of the data segment is not associated with any other operational node, via a distributed shared memory (DSM) agent, sets the ownership of the data segment to a local DSM agent, includes the transaction journal in each node for storing committed transactions generated by users on that node, includes a list to identify the location of the latest contents of a data segment prior to the transaction using a rollback procedure, for each modified data segment, identifies the location of the latest contents prior to the transaction using the rollback procedure by; if the data segment was marked as modified in the cache at the time it was inserted into the list, then the latest contents of this data segment appears only in the journal, otherwise, if the data segment was not marked as modified in the cache at the time the data segment was inserted into the list, then the latest contents of this data segment appears in its final location in the shared storage, and records a type of each data segment in the list during insertion of the data segment into the list, wherein; all data segments in the list whose latest contents appear in a final location are discarded from the cache, for all the other data segments in the list, latest contents for the all the other data segments are restored from the journal into the cache by scanning the local transaction journal from ending to beginning, and first occurrences of these data segments in the local transaction journal are considered, and a modification indication of the data segments is set as true, and exclusive permissions on all the data segments involved in a cancellation transaction are released. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer program product for recovering transactions of failed nodes using a recovery procedure in a clustered file system (CFS) using a processor device, the computer program product comprising a non-transitory computer-readable storage medium having computer-readable program code portions stored therein, the computer-readable program code portions comprising:
-
a first executable portion that determines a data segment should be copied to a final storage location by validating than an ownership of the data segment is not associated with any other operational node, via a distributed shared memory (DSM) agent; a second executable portion that sets the ownership of the data segment to a local DSM agent; a third executable portion that includes a transaction journal in each node for storing committed transactions generated by users on that node; a fourth executable portion that includes a list to identify the location of the latest contents of a data segment prior to the transaction using a rollback procedure; for each modified data segment, a fifth executable portion that identifies the location of the latest contents prior to the transaction using the rollback procedure by; if the data segment was marked as modified in the cache at the time it was inserted into the list, then the latest contents of this data segment appears only in the journal, otherwise, if the data segment was not marked as modified in the cache at the time the data segment was inserted into the list, then the latest contents of this data segment appears in its final location in the shared storage; and a sixth executable portion that records a type of each data segment in the list during insertion of the data segment into the list, wherein; all data segments in the list whose latest contents appear in a final location are discarded from the cache, for all the other data segments in the list, latest contents for the all the other data segments are restored from the journal into the cache by scanning the local transaction journal from ending to beginning, and first occurrences of these data segments in the local transaction journal are considered, and a modification indication of the data segments is set as true, and exclusive permissions on all the data segments involved in a cancellation transaction are released. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
Specification