Distributed database log recovery

US 9,058,371 B2
Filed: 11/07/2011
Issued: 06/16/2015
Est. Priority Date: 11/07/2011
Status: Active Grant

First Claim

Patent Images

1. A non-transitory computer program product storing instructions that, when executed by at least one programmable processor, cause the at least one programmable processor to perform operations comprising:

recording, in a data storage application, log entries for a plurality of transactions among nodes in a node hierarchy, the node hierarchy comprising a master node comprising a transaction coordinator and having a plurality of slave nodes, each slave node having a separate log at which its respective log entries are stored, the transaction coordinator storing respective prepare commit positions of each slave node in a commit record;

replaying, prior to replay of log entries at the slave nodes, at least a portion of the master node log entries until a first replay position is reached, the first replay position comprising positions of respective commit log records at the slave nodes until which the slave nodes must replay their respective logs in order to come to a transactionally-consistent state with the master node;

replaying, for each slave node and in response to at least one trigger from the master node, at least a portion of its respective log entries until the corresponding slave node reaches an end of its respective log prior to the first replay position;

initiating, by the transaction coordinator of the master node in parallel to at least a portion of the replaying by the slave nodes prior to the first replay position, replay of at least a portion of its log entries subsequent to the first replay position; and

discarding all log entries stored beyond the first replay position.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Log entries are recorded in a data storage application (such as an in-memory database, etc.) for a plurality of transactions among nodes in a node hierarchy. The node hierarchy comprises master node having a plurality of slave nodes. Thereafter, at least a portion of the master node log entries are replayed until a first replay position is reached. Next, for each slave node, at least a portion of its respective log entries are replayed until the first replay position is reached (or an error occurs). Subsequently, replay of at least a portion of the log entries of the master node that are subsequent to the first replay position is initiated by the master node in parallel to at least a portion of the replaying by the slave nodes. Related apparatus, systems, techniques and articles are also described.

18 Citations

View as Search Results

39 Claims

1. A non-transitory computer program product storing instructions that, when executed by at least one programmable processor, cause the at least one programmable processor to perform operations comprising:
- recording, in a data storage application, log entries for a plurality of transactions among nodes in a node hierarchy, the node hierarchy comprising a master node comprising a transaction coordinator and having a plurality of slave nodes, each slave node having a separate log at which its respective log entries are stored, the transaction coordinator storing respective prepare commit positions of each slave node in a commit record;
  
  replaying, prior to replay of log entries at the slave nodes, at least a portion of the master node log entries until a first replay position is reached, the first replay position comprising positions of respective commit log records at the slave nodes until which the slave nodes must replay their respective logs in order to come to a transactionally-consistent state with the master node;
  
  replaying, for each slave node and in response to at least one trigger from the master node, at least a portion of its respective log entries until the corresponding slave node reaches an end of its respective log prior to the first replay position;
  
  initiating, by the transaction coordinator of the master node in parallel to at least a portion of the replaying by the slave nodes prior to the first replay position, replay of at least a portion of its log entries subsequent to the first replay position; and
  
  discarding all log entries stored beyond the first replay position.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
- - 2. A computer program product as in claim 1, wherein the node hierarchy comprises at least two master nodes, each having corresponding slave nodes, and wherein the first replay position differs for at least two of the master nodes.
  - 3. A computer program product as in claim 1, wherein the operations further comprise:
    - performing a snapshot of a local persistency for the master node after replay to the first replay position.
  - 4. A computer program product as in claim 3, wherein the snapshot is performed using shadow-paging.
  - 5. A computer program product as in claim 1, wherein the operations further comprise:
    - performing a snapshot of a local persistency for each slave node after replay to the first replay position.
  - 6. A computer program product as in claim 5, wherein the snapshot of the local persistency is performed using shadow-paging.
  - 7. A computer program product as in claim 1, wherein:
    - the slave nodes replay their log entries to reach a transactionally-consistent state with the master node;
      
      the data storage application is implemented on a distributed database system that includes the master node and the plurality of slave nodes; and
      
      the replaying on the master node and the plurality of slave nodes occurs on the distributed database system.
  - 8. A computer program product as in claim 1, wherein the data storage application comprises an in-memory database.
  - 9. A computer program product as in claim 1, wherein the operations further comprise:
    - stopping, by at least one of the slave nodes, replay prior to the first replay position if a problem occurs during replay.
  - 10. A computer program product as in claim 9, wherein the problem comprises a corrupt or missing log.
  - 11. A computer program product as in claim 9, wherein the operations further comprise:
    - reporting, by the stopped at least one of the slave nodes, a maximum recoverable replay position prior to the first replay position to the master node.
  - 12. A computer program product as in claim 11, wherein the operations further comprise:
    - resetting the master node persistency to the maximum recoverable replay position.
  - 13. A computer program product as in claim 12, wherein the operations further comprise:
    - instructing each slave node by the master node to reset its persistency to the maximum recoverable replay position.
  - 14. A computer program product as in claim 13, wherein the operations further comprise:
    - adding a filter by the master node preventing replay by the slave nodes beyond the maximum recoverable replay position.
  - 15. A computer program product as in claim 14, wherein the operations further comprise:
    - discarding log entries stored at the master node and the corresponding slave nodes beyond the maximum recoverable replay position.

16. A method comprising:
- recording, in a data storage application, log entries for a plurality of transactions among nodes in a node hierarchy, the node hierarchy comprising at least one master node comprising a transaction coordinator and having a plurality of slave nodes, the transaction coordinator coordinating replay among the plurality of slave nodes, each slave node having a separate log at which its respective log entries are stored, the transaction coordinator storing respective prepare commit positions of each slave node in a commit record;
  
  replaying, for each master node and prior to replay of log entries at the slave nodes, at least a portion of its log entries until a first replay position is reached, the first replay position comprising positions of respective commit log records at the slave nodes until which the slave nodes must replay their respective logs in order to come to a transactionally-consistent state with the master node;
  
  performing, for each master node, a snapshot of a local persistency for the master node;
  
  replaying, for each slave node of a corresponding master node and in response to at least one trigger from the corresponding master node, at least a portion of its respective log entries until a maximum recoverable position prior to the first replay position is reached on one of the slave nodes, the maximum recoverable position representing a latest transactionally-consistent state and being specified by a last commit log record replayed by each corresponding master node;
  
  performing a snapshot of a local persistency for each slave node after replay to the maximum recoverable position;
  
  replaying the log entries on the at least one master node and the corresponding slave nodes to the maximum recoverable position, wherein the at least one master node replays log entries at a point in time subsequent to a point in time at which the corresponding slave nodes are replaying log entries; and
  
  discarding all log entries stored beyond the maximum recoverable position.
- View Dependent Claims (17, 31, 32, 33, 34, 35, 36, 37, 38, 39)
- - 17. A method as in claim 16, wherein the snapshots are performed using shadow-paging.
  - 31. A method as in claim 16, wherein:
    - the data storage application is implemented on a distributed database system that includes the master node and the plurality of slave nodes; and
      
      the replaying on the master node and the plurality of slave nodes occurs on the distributed database system.
  - 32. A method as in claim 31, wherein the data storage application comprises an in-memory database.
  - 33. A method as in claim 16 further comprising:
    - stopping, by at least one of the slave nodes, replay prior to the first replay position if a problem occurs during replay.
  - 34. A method as in claim 33, wherein the problem comprises a corrupt or missing log.
  - 35. A method as in claim 33 further comprising:
    - reporting, by the stopped at least one of the slave nodes, a maximum recoverable replay position prior to the first replay position to the master node.
  - 36. A method as in claim 35 further comprising:
    - resetting the master node persistency to the maximum recoverable replay position.
  - 37. A method as in claim 36 further comprising:
    - instructing each slave node by the master node to reset its persistency to the maximum recoverable replay position.
  - 38. A method as in claim 37 further comprising:
    - adding a filter by the master node preventing replay by the slave nodes beyond the maximum recoverable replay position.
  - 39. A method as in claim 38 further comprising:
    - discarding log entries stored at the master node and the corresponding slave nodes beyond the maximum recoverable replay position.

18. A system comprising:
- at least one data processor; and
  
  memory coupled to the at least one data processor, the memory storing instructions to cause the at least one data processor to perform operations comprising;
  
  recording, in a data storage application, log entries for a plurality of transactions among nodes in a node hierarchy, the node hierarchy comprising at least one master node comprising a transaction coordinator and having a plurality of slave nodes, the transaction coordinator coordinating replay among the plurality of slave nodes, each slave node having a separate log at which its respective log entries are stored, the transaction coordinator storing respective prepare commit positions of each slave node in a commit record;
  
  replaying, for each master node and prior to replay of log entries at the slave nodes, at least a portion of its log entries until a first replay position is reached, the first replay position comprising positions of respective commit log records at the slave nodes until which the slave nodes must replay their respective logs in order to come to a transactionally-consistent state with the master node;
  
  replaying, for each slave node of a corresponding master node and in response to at least one trigger from the corresponding master node, at least a portion of its respective log entries until a maximum recoverable position prior to the first replay position is reached on one of the slave nodes, the maximum recoverable position representing a latest transactionally-consistent state;
  
  replaying, in parallel, the log entries on the at least one master node and the corresponding slave nodes to the maximum recoverable position, wherein the at least one master node replays log entries at a point in time subsequent to a point in time at which the corresponding slave nodes are replaying log entries; and
  
  discarding all log entries stored beyond the maximum recoverable position.
- View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
- - 19. A system as in claim 18, wherein the operations further comprise:
    - performing a snapshot of a local persistency for the master node after replay to the first replay position; and
      
      performing a snapshot of a local persistency for each slave node after replay to the first replay position.
  - 20. A system as in claim 18, wherein the node hierarchy comprises at least two master nodes, each having corresponding slave nodes, and wherein the first replay position differs for at least two of the master nodes.
  - 21. A system as in claim 19, wherein the snapshots are performed using shadow-paging.
  - 22. A system as in claim 18, wherein:
    - the slave nodes replay their log entries to reach a transactionally-consistent state with the master node;
      
      the data storage application is implemented on a distributed database system that includes the master node and the plurality of slave nodes; and
      
      the replaying on the master node and the plurality of slave nodes occurs on the distributed database system.
  - 23. A system as in claim 18, wherein the data storage application comprises an in-memory database.
  - 24. A system as in claim 18, wherein the operations further comprise:
    - stopping, by at least one of the slave nodes, replay prior to the first replay position if a problem occurs during replay.
  - 25. A system as in claim 24, wherein the problem comprises a corrupt or missing log.
  - 26. A system as in claim 25, wherein the operations further comprise:
    - reporting, by the stopped at least one of the slave nodes, a maximum recoverable replay position prior to the first replay position to the master node.
  - 27. A system as in claim 26, wherein the operations further comprise:
    - resetting the master node persistency to the maximum recoverable replay position.
  - 28. A system as in claim 27, wherein the operations further comprise:
    - instructing each slave node by the master node to reset its persistency to the maximum recoverable replay position.
  - 29. A system as in claim 28, wherein the operations further comprise:
    - adding a filter by the master node preventing replay by the slave nodes beyond the maximum recoverable replay position.
  - 30. A system as in claim 29, wherein the operations further comprise:
    - discarding log entries stored at the master node and the corresponding slave nodes beyond the maximum recoverable replay position.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
SAP SE
Original Assignee
SAP SE
Inventors
Thomsen, Dirk, Schreter, Ivan
Primary Examiner(s)
Alam, Hosain
Assistant Examiner(s)
HARPER, ELIYAH STONE

Application Number

US13/290,742
Publication Number

US 20130117237A1
Time in Patent Office

1,317 Days
Field of Search

707/999.101, 707/999.102, 707/715
US Class Current

1/1
CPC Class Codes

G06F 11/1474   in transactions G06F16/20 t...

G06F 16/2379   Updates performed during on...

G06F 16/27   Replication, distribution o...

G06F 2201/84   Using snapshots, i.e. a log...

Distributed database log recovery

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

18 Citations

39 Claims

Specification

Solutions

Use Cases

Quick Links

Distributed database log recovery

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

18 Citations

39 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links