Smooth restart of storage clusters in a storage system
First Claim
Patent Images
1. A method comprising:
- starting a survivor process associated with a storage cluster application;
detecting a crash of the storage cluster application; and
performing a smooth restart of the storage cluster application after the crash according to a new storage cluster application configured to replace the storage cluster application by managing open handles associated with the storage cluster application,wherein performing a smooth restart of the storage cluster application after the crash comprises;
recovering context data associated with the storage cluster application;
determining consistency of the recovered context data;
recovering open handles associated with the storage cluster application from the associated survivor process; and
replaying any commands that were not complete, based upon the recovered open handles when the crash was detected to the storage cluster application.
8 Assignments
0 Petitions
Accused Products
Abstract
Described embodiments provide methods and systems for smoothly restarting a crashed storage cluster application associated with a storage cluster of a storage system. A survivor process associated with each storage cluster application is started. If a crash of at least one of the storage cluster applications is detected, the storage system performs a smooth restart of the at least one crashed storage cluster application. A smooth restart of the at least one crashed storage cluster application is performed according to a new storage cluster application that is configured to replace the crashed storage cluster application by managing open handles associated with the crashed storage cluster application.
155 Citations
17 Claims
-
1. A method comprising:
-
starting a survivor process associated with a storage cluster application; detecting a crash of the storage cluster application; and performing a smooth restart of the storage cluster application after the crash according to a new storage cluster application configured to replace the storage cluster application by managing open handles associated with the storage cluster application, wherein performing a smooth restart of the storage cluster application after the crash comprises; recovering context data associated with the storage cluster application; determining consistency of the recovered context data; recovering open handles associated with the storage cluster application from the associated survivor process; and replaying any commands that were not complete, based upon the recovered open handles when the crash was detected to the storage cluster application. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
a processor; and memory storing computer program code that when executed on the processor causes the processor to execute commands of a storage system operable to perform the operations of; starting a survivor process associated with a storage cluster application; detecting a crash of the storage cluster application; and performing a smooth restart of the storage cluster application after the crash according to a new storage cluster application configured to replace the storage cluster application by managing open handles associated with the storage cluster application, wherein performing a smooth restart of the storage cluster application after the crash comprises; recovering context data associated with the storage cluster application; determining consistency of the recovered context data; recovering open handles associated with the storage cluster application from the associated survivor process; and replaying any commands that were not complete, based upon the recovered open handles, when the crash was detected to the storage cluster application. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A computer program product including a non-transitory computer readable storage medium having computer program code encoded thereon that when executed on a processor of a computer causes the computer to execute commands of a storage system, the computer program product comprising:
-
computer program code for starting a survivor process associated with a storage cluster application; computer program code for detecting a crash of the storage cluster application; and computer program code for performing a smooth restart of the storage cluster application after the crash according to a new storage cluster application configured to replace the storage cluster application by managing open handles associated with the storage cluster application, wherein performing a smooth restart of the storage cluster application after the crash comprises; recovering context data associated with the storage cluster application; determining consistency of the recovered context data; recovering open handles associated with the storage cluster application from the associated survivor process; and replaying any commands that were not complete based upon the recovered open handles when the crash was detected to the storage cluster application. - View Dependent Claims (15, 16, 17)
-
Specification