Methods and apparatuses for data protection
First Claim
1. A method, comprising:
- allocating respective storage space for multiple versions of each of multiple items of data;
making multiple data changes to said multiple items of data where more than one of said items of data is changed multiple times, wherein, changes made to a same item of data have respective transaction identifiers such that a previous change to said same item of data is recognizable as having been made earlier than a later change to said same item of data through analysis of the previous and later changes'"'"' respective transaction identifiers, and wherein, changes made to said same item of data are kept so as to create different kept versions of said same item of data, and wherein, storage space needed to keep all of said different kept versions exceeds storage space allocated for said same item of data'"'"'s multiple versions, said method therefore further including writing a newest version of said same item of data over an oldest version of said same item of data;
flushing said items of data from a cache to a non volatile storage;
marking said flushing of said items of data as a consistent point of said multiple items of data including assigning a global transaction identifier to said items of data;
making additional changes to some but not all of said items of data after said flushing and keeping versions thereof, wherein, said changes to some but not all of said items of data have associated transaction identifiers that are recognizable as being made after said assignment of said global transaction identifier, and, recording on a list respective transaction identifiers for said changes made to some but not all of said items of data in a list;
experiencing a crash before a next global transaction identifier is assigned for said items of data; and
,in response to said crash, referring to said list to remove said versions thereof so as to restore said set of data items to said consistent point such that said versions thereof are not recognized.
0 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatuses for data protection against corrupted write operations. In one embodiment, a technique of crash protection uses an atomically updated global transaction indicator to switch a set of data structures from one set of versions to another set. In one embodiment, the storage space for multiple versions of a data structure is allocated at the time the data structure is created on the storage device so that the multiple versions of the data structure can be kept close by. Between two consistent points, some data structures are modified to create new versions without changing the corresponding previous versions; and unmodified data structures are not copied. At a consistent point, the global transaction indicator is updated to indicate a set of consistent versions of the data structures, such as metadata of a file system.
85 Citations
18 Claims
-
1. A method, comprising:
-
allocating respective storage space for multiple versions of each of multiple items of data; making multiple data changes to said multiple items of data where more than one of said items of data is changed multiple times, wherein, changes made to a same item of data have respective transaction identifiers such that a previous change to said same item of data is recognizable as having been made earlier than a later change to said same item of data through analysis of the previous and later changes'"'"' respective transaction identifiers, and wherein, changes made to said same item of data are kept so as to create different kept versions of said same item of data, and wherein, storage space needed to keep all of said different kept versions exceeds storage space allocated for said same item of data'"'"'s multiple versions, said method therefore further including writing a newest version of said same item of data over an oldest version of said same item of data; flushing said items of data from a cache to a non volatile storage; marking said flushing of said items of data as a consistent point of said multiple items of data including assigning a global transaction identifier to said items of data; making additional changes to some but not all of said items of data after said flushing and keeping versions thereof, wherein, said changes to some but not all of said items of data have associated transaction identifiers that are recognizable as being made after said assignment of said global transaction identifier, and, recording on a list respective transaction identifiers for said changes made to some but not all of said items of data in a list; experiencing a crash before a next global transaction identifier is assigned for said items of data; and
,in response to said crash, referring to said list to remove said versions thereof so as to restore said set of data items to said consistent point such that said versions thereof are not recognized. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A machine readable storage medium containing stored program code that when processed by a central processing unit of a computing system causes a method to be performed, said method comprising:
-
allocating respective storage space for multiple versions of each of multiple items of data; making multiple data changes to said multiple items of data where more than one of said items of data is changed multiple times, wherein, changes made to a same item of data have respective transaction identifiers such that a previous change to said same item of data is recognizable as having been made earlier than a later change to said same item of data through analysis of the previous and later changes'"'"' respective transaction identifiers, and wherein, changes made to said same item of data are kept so as to create different kept versions of said same item of data, and wherein, storage space needed to keep all of said different kept versions exceeds storage space allocated for said same item of data'"'"'s multiple versions, said method therefore further including writing a newest version of said same item of data over an oldest version of said same item of data; flushing said items of data from a cache to a non volatile storage; marking said flushing of said items of data as a consistent point of said multiple items of data including assigning a global transaction identifier to said items of data; making additional changes to some but not all of said items of data after said flushing and keeping versions thereof, wherein, said changes to some but not all of said items of data have associated transaction identifiers that are recognizable as being made after said assignment of said global transaction identifier, and, recording on a list respective transaction identifiers for said changes made to some but not all of said items of data in a list; experiencing a crash before a next global transaction identifier is assigned for said items of data; and
,in response to said crash, referring to said list to remove said versions thereof so as to restore said set of data items to said consistent point such that said versions thereof are not recognized. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computing system having a storage medium and a central processing unit, said storage medium containing stored program code that when processed by said central processing unit causes a method to be performed, said method comprising:
-
allocating respective storage space for multiple versions of each of multiple items of data; making multiple data changes to said multiple items of data where more than one of said items of data is changed multiple times, wherein, changes made to a same item of data have respective transaction identifiers such that a previous change to said same item of data is recognizable as having been made earlier than a later change to said same item of data through analysis of the previous and later changes'"'"' respective transaction identifiers, and wherein, changes made to said same item of data are kept so as to create different kept versions of said same item of data, and wherein, storage space needed to keep all of said different kept versions exceeds storage space allocated for said same item of data'"'"'s multiple versions, said method therefore further including writing a newest version of said same item of data over an oldest version of said same item of data; flushing said items of data from a cache to a non volatile storage; marking said flushing of said items of data as a consistent point of said multiple items of data including assigning a global transaction identifier to said items of data; making additional changes to some but not all of said items of data after said flushing and keeping versions thereof, wherein, said changes to some but not all of said items of data have associated transaction identifiers that are recognizable as being made after said assignment of said global transaction identifier, and, recording on a list respective transaction identifiers for said changes made to some but not all of said items of data in a list; experiencing a crash before a next global transaction identifier is assigned for said items of data; and
,in response to said crash, referring to said list to remove said versions thereof so as to restore said set of data items to said consistent point such that said versions thereof are not recognized. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification