Parallel recovery by non-failed nodes
First Claim
Patent Images
1. A method comprising computer-implemented steps of:
- determining that a particular node has failed, wherein the particular node is in a shared-nothing database system in which at least some database data, on which the particular node performed operations, is stored on a particular storage device; and
in response to determining that the failure has occurred, performing parallel recovery, by two or more non-failed nodes that have physical access to the particular storage device, of at least some of the database data on which the particular node performed operations.
1 Assignment
0 Petitions
Accused Products
Abstract
Various techniques are described for improving the performance of a shared-nothing database system in which at least two of the nodes that are running the shared-nothing database system have shared access to a disk. Specifically, techniques are provided for recovering the data owned by a failed node using multiple recovery nodes operating in parallel. The data owned by a failed node is reassigned to recovery nodes that have access to the shared disk on which the data resides. The recovery logs of the failed node are read by the recovery nodes, or by a coordinator process that distributes the recovery tasks to the recovery nodes.
-
Citations
30 Claims
-
1. A method comprising computer-implemented steps of:
-
determining that a particular node has failed, wherein the particular node is in a shared-nothing database system in which at least some database data, on which the particular node performed operations, is stored on a particular storage device; and in response to determining that the failure has occurred, performing parallel recovery, by two or more non-failed nodes that have physical access to the particular storage device, of at least some of the database data on which the particular node performed operations. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A non-volatile or volatile computer-readable non-transitory medium carrying one or more sequences of instructions which, when executed by one or more processors, causes the one or more processors to perform steps comprising:
-
determining that a particular node has failed, wherein the particular node is in a shared-nothing database system in which at least some database data, on which the particular node performed operations, is stored on a particular storage device; and in response to determining that the failure has occurred, performing parallel recovery, by two or more non-failed nodes that have physical access to the particular storage device, of at least some of the database data on which the particular node performed operations. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
Specification