Log Structured Content Addressable Deduplicating Storage
18 Assignments
0 Petitions
Accused Products
Abstract
A log structured content addressable deduplicated data storage system may be used to store deduplicated data. Data to be stored is partitioned into data segments. Each unique data segment is associated with a label. The storage system maintains a transaction log. Mutating storage operations are initiated by storing transaction records in the transaction log. Additional transaction records are stored in the log when storage operations are completed. Upon restarting an embodiment of the data storage system, the transaction records from the transaction logs are replayed to recreate the state of the data storage system. The data storage system updates file system metadata with transaction information while a storage operation associated with the file is being processed. This transaction information serves as atomically updated transaction commit points, allowing fully internally consistent snapshots of deduplicated volumes to be taken at any time.
-
Citations
37 Claims
-
1-12. -12. (canceled)
-
13. A method of modifying data in a data storage system, the method comprising:
-
identifying a storage label and storage data associated with a storage operation; generating a first transaction record associated with the storage operation; storing the first transaction record in a transaction log data structure; modifying a portion of the data storage system associated with the storage label and the storage data based on the storage operation; identifying a file system file associated with the portion of the data storage system; performing an atomic file system operation on the file system file to indicate a committal of the storage operation; generating a second transaction record, wherein the second transaction is adapted to indicate that the storage operation is complete; and storing the second transaction record in the transaction log data structure. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A method of restoring a data storage system, the method comprising:
-
accessing a transaction log including transaction records associated with storage system operations; identifying incomplete storage system operations from the transaction records; analyzing file system attributes associated with the incomplete storage system operations to identify at least a portion of the incomplete storage system records that have been previously committed; adding a portion of the transaction records associated with the portion of the incomplete storage system records to a list of storage operations to be reprocessed; and reprocessing a portion of the storage system operations corresponding with the transaction records included in the list of storage operations to reconstruct a prior state of the data storage system. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35, 36, 37)
-
Specification