SCALABLE SECONDARY STORAGE SYSTEMS AND METHODS
First Claim
1. A method for managing data on a secondary storage system comprising:
- distributing data blocks to different data containers located in a plurality of different physical storage nodes in a node network to generate redundant chains of data containers in the nodes;
detecting an addition of active storage nodes to the network;
automatically splitting at least one chain of containers in response to detecting the addition; and
transferring at least a portion of data split from the at least one chain of containers from one of said storage nodes to another of said storage nodes to enhance system robustness against node failures.
2 Assignments
0 Petitions
Accused Products
Abstract
Exemplary systems and methods in accordance with embodiments of the present invention may provide a plurality of data services by employing splittable, mergable and transferable redundant chains of data containers. The chains and containers may be automatically split and/or merged in response to changes in storage node network configurations and may be stored in erasure coded fragments distributed across different storage nodes. Data services provided in a distributed secondary storage system utilizing redundant chains of containers may include global deduplication, dynamic scalability, support for multiple redundancy classes, data location, fast reading and writing of data and rebuilding of data due to node or disk failures.
135 Citations
20 Claims
-
1. A method for managing data on a secondary storage system comprising:
-
distributing data blocks to different data containers located in a plurality of different physical storage nodes in a node network to generate redundant chains of data containers in the nodes; detecting an addition of active storage nodes to the network; automatically splitting at least one chain of containers in response to detecting the addition; and transferring at least a portion of data split from the at least one chain of containers from one of said storage nodes to another of said storage nodes to enhance system robustness against node failures. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A secondary storage system comprising:
-
a network of physical storage nodes, wherein each storage node includes a storage medium configured to store fragments of data blocks in a chain of data containers that is redundant with respect to chains of data containers in other storage nodes; and a storage server configured to detect an addition of active storage nodes to the network, to automatically split at least one chain of containers on said storage medium in response to detecting the addition, and to transfer at least a portion of data split from the at least one chain of containers to a different storage node to enhance system robustness against node failures. - View Dependent Claims (12, 13, 14, 15, 16)
-
- 17. The system of 16, wherein the automatic split comprises extending at least one of said prefixes to generate at least one additional subset of storage nodes.
-
19. A method for managing data on a secondary storage system comprising:
-
distributing data blocks to different data containers located in a plurality of different physical storage nodes in a node network to generate redundant chains of data containers in the nodes; detecting a change in the number of active storage nodes in the network; and automatically merging at least one data container located in one of said storage nodes with another data container located in a different storage node in response to detecting the change to ensure manageability of the containers. - View Dependent Claims (20)
-
Specification