Long term data protection system and method
First Claim
Patent Images
1. A method for accessing a storage system comprising:
- accessing a data object, the data object being divisible into one or more partitions, the partitions comprising data from the data object, the partitions referred to as input partitions; and
for each input partition, if there are no other partitions among other data objects in the storage system that are identical to the input partition, then producing one or more replicas of the input partition.
3 Assignments
0 Petitions
Accused Products
Abstract
A file to be written can be partitioned into one or more partitions. Each such input partition is identified with a hash code and a group ID. Replica(s) of an input partition can be created to ensure a certain number of identical partitions. When a file is accessed, each partition that is read out can be checked for corruption. Corrupted readout partitions are then replaced with a validated replacement partition, whether the replacement partition is from a replica or from another file.
26 Citations
34 Claims
-
1. A method for accessing a storage system comprising:
-
accessing a data object, the data object being divisible into one or more partitions, the partitions comprising data from the data object, the partitions referred to as input partitions; and for each input partition, if there are no other partitions among other data objects in the storage system that are identical to the input partition, then producing one or more replicas of the input partition. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for accessing a storage system comprising:
-
receiving data for a first file, to be stored in the storage system; providing partition data from the first file which constitutes a first partition of the first file; if a number of second partitions in the storage system is less than a first predetermined value, then producing a number of replicas of the first partition sufficient to increase the number of second partitions to a second predetermined value, wherein each second partition comprises data belonging to a file in the storage system and is identical to the partition data; and if the number of second partitions is greater than a third predetermined value and if there are one or more replicas of the first partition, then deleting one or more of the replicas, wherein the number of second partitions is reduced; and repeating for additional partition data comprising the first file. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A method for accessing a storage system comprising:
-
receiving a request to store a file; storing the file on the storage system; identifying one or more partitions which collectively constitute the file, the partitions referred to as input partitions; storing partition information that is associated with the file, wherein the partition information associates the file with each of its input partitions; and for each input partition, if there are no identical partitions, then if the number of replicas of the input partition is less than a threshold value, then producing at least one replica of the input partition and storing the replica on the storage system, wherein an identical partition is a partition, other than the input partition, of a file that is stored in the storage system whose content is identical to content of the input partition. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A data storage system comprising:
-
a storage component; and a data processing component in data communication with the storage component, the data processing component for receiving access requests from users, the access requests for accessing data that is stored in the storage component or for storing data to the storage component, the data processing component configured to perform the method steps of; accessing a first partition of a file, the first partition comprising a first portion of data that constitutes the file; if the first partition does not have a corresponding identical partition in the storage component, then creating at least one replica; and repeating for a second partition of the file, the second partition comprising a second portion of the data. - View Dependent Claims (29, 30, 31, 32)
-
-
33. A data storage system comprising:
-
first means for producing a partition ID for each partition comprising a file stored in the storage system, a partition comprising data from the file, the first means producing a first partition ID for a first partition of a first file; second means for identifying one or more identical partitions among other files in the storage system based on a first partition ID; and third means for creating a replica of the first partition in response to the second means making a determination that there are no identical partitions, wherein the first means, the second means, and the third means operate on every partition comprising the first file. - View Dependent Claims (34)
-
Specification