DISTRIBUTED STORAGE SYSTEM AND CONTROL METHOD FOR DISTRIBUTED STORAGE SYSTEM
First Claim
1. A distributed storage system including a plurality of nodes, which are connected to each other and each of which includes a storage medium, for controlling storage of data in logical storage areas composed of the storage medium,wherein the plurality of nodes:
- provide a host system with a virtual storage area across the logical storage areas controlled by the respective nodes;
manage allocation of a plurality of virtual partial areas obtained by dividing the virtual storage area and a plurality of partial logical storage areas obtained by dividing the logical storage area; and
distribute data written to the virtual storage area and store the data in the logical storage areas;
wherein a first node among the plurality of nodes judges whether the same data as first data, which is written to a first virtual partial area managed by the first node from among the plurality of virtual partial areas, exists in the virtual partial area managed by another node among the plurality of nodes;
wherein when the same data as the first data exists in the other node, the first node executes inter-node deduplication for changing allocation of either one of logical partial areas for the first virtual partial area or the virtual partial area of the other node to which the same data is written, to the other logical partial area; and
wherein when the first node calculates a predicted value of I/O load on the first node after execution of the inter-node deduplication of the first virtual partial area and the predicted value satisfies a predetermined condition, the first node executes the inter-node deduplication of a second virtual partial area managed by the first node from among the plurality of virtual partial areas obtained by dividing the virtual storage area.
1 Assignment
0 Petitions
Accused Products
Abstract
Regarding a distributed storage system including a plurality of nodes, a first node among the plurality of nodes judges whether the same data as first data, which is written to a first virtual partial area managed by the first node from among a plurality of virtual partial areas, exists in the virtual partial area managed by another node among the plurality of nodes; when the same data as the first data exists in the other node, the first node executes inter-node deduplication for changing allocation of either one of logical partial areas for the first virtual partial area or the virtual partial area of the other node to which the same data is written, to the other logical partial area; and when I/O load on the first node after execution of the inter-node deduplication of the first virtual partial area and the predicted value is less than a first threshold, the first node executes the inter-node deduplication of a second virtual partial area managed by the first node from among the plurality of virtual partial areas obtained by dividing the virtual storage area.
5 Citations
13 Claims
-
1. A distributed storage system including a plurality of nodes, which are connected to each other and each of which includes a storage medium, for controlling storage of data in logical storage areas composed of the storage medium,
wherein the plurality of nodes: - provide a host system with a virtual storage area across the logical storage areas controlled by the respective nodes;
manage allocation of a plurality of virtual partial areas obtained by dividing the virtual storage area and a plurality of partial logical storage areas obtained by dividing the logical storage area; and
distribute data written to the virtual storage area and store the data in the logical storage areas;wherein a first node among the plurality of nodes judges whether the same data as first data, which is written to a first virtual partial area managed by the first node from among the plurality of virtual partial areas, exists in the virtual partial area managed by another node among the plurality of nodes; wherein when the same data as the first data exists in the other node, the first node executes inter-node deduplication for changing allocation of either one of logical partial areas for the first virtual partial area or the virtual partial area of the other node to which the same data is written, to the other logical partial area; and wherein when the first node calculates a predicted value of I/O load on the first node after execution of the inter-node deduplication of the first virtual partial area and the predicted value satisfies a predetermined condition, the first node executes the inter-node deduplication of a second virtual partial area managed by the first node from among the plurality of virtual partial areas obtained by dividing the virtual storage area. - View Dependent Claims (2, 3, 4, 5, 6, 7)
- provide a host system with a virtual storage area across the logical storage areas controlled by the respective nodes;
-
8. A control method for a distributed storage system including a plurality of nodes, which are connected to each other and each of which includes a storage medium, for controlling storage of data in logical storage areas composed of the storage medium, the control method for the distributed storage system, comprising:
-
providing a host system with a virtual storage area across the logical storage areas controlled by the plurality of nodes respectively, managing allocation of a plurality of virtual partial areas obtained by dividing the virtual storage area and a plurality of partial logical storage areas obtained by dividing the logical storage area, and distributing data written to the virtual storage area and storing the data in the logical storage areas; judging whether the same data as first data, which is written to a first virtual partial area managed by a first node from among the plurality of virtual partial areas, exists in the virtual partial area managed by another node among the plurality of nodes; executing inter-node deduplication, when the same data as the first data exists in the other node, for changing allocation of either one of logical partial areas for the first virtual partial area or the virtual partial area of the other node to which the same data is written, to the other logical partial area; and executing the inter-node deduplication of a second virtual partial area managed by the first node from among the plurality of virtual partial areas obtained by dividing the virtual storage area when a predicted value of I/O load on the first node after execution of the inter-node deduplication of the first virtual partial area is calculated and the predicted value satisfies a predetermined condition. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A distributed storage system including a first node connected to one or more nodes, each of which has a storage medium, for controlling storage of data in a logical storage area composed of the storage medium, the first node and the one or more nodes constituting the distributed storage system,
wherein the first node: - provides a host system with a virtual storage area across the logical storage area controlled by the one or more nodes and the logical storage area controlled by the first node;
manages allocation of a plurality of virtual partial areas obtained by dividing the virtual storage area and a plurality of partial logical storage areas obtained by dividing the logical storage area; and
distributes data written to the virtual storage area with the one or more nodes and stores the data in the logical storage area;wherein the first node judges whether the same data as first data, which is written to a first virtual partial area managed by the first node from among the plurality of virtual partial areas, exists in the virtual partial area managed by another node among the plurality of nodes; wherein when the same data as the first data exists in the other node, the first node executes inter-node deduplication for changing allocation of either one of logical partial areas for the first virtual partial area or the virtual partial area of the other node to which the same data is written, to the other logical partial area; and wherein when the first node calculates a predicted value of I/O load on the first node after execution of the inter-node deduplication of the first virtual partial area and the predicted value satisfies a predetermined condition, the first node executes the inter-node deduplication of a second virtual partial area managed by the first node from among the plurality of virtual partial areas obtained by dividing the virtual storage area.
- provides a host system with a virtual storage area across the logical storage area controlled by the one or more nodes and the logical storage area controlled by the first node;
Specification