Storing data in a distributed file system
First Claim
1. A hardware device for storing data in a distributed file system, the distributed file system including a plurality of nonvolatile deduplication storage devices, the device comprising:
- at least one memory storing computer-executable instructions; and
at least one processor configured to access the at least one memory and execute the computer-executable instructions to;
receive a request to store first data in the distributed file system;
determine a characteristic of the first data;
identify one of the plurality of nonvolatile deduplication storage devices of the distributed file system as a deduplication storage device for the first data based on the determined characteristic of the first data matching a characteristic of second data stored in the identified deduplication device;
determine that the first data is redundant to the second data based on the determined characteristic of the first data matching the characteristic of the second data; and
in response at least in part to determining that the first data is redundant to the second data, store the first data in the identified deduplication storage device such that the first data and the second data are redundantly stored in the identified deduplication storage device and are deduplicatable within the identified deduplication storage device after the first data is redundantly stored.
1 Assignment
0 Petitions
Accused Products
Abstract
A device for storing data in a distributed file system, the distributed file system including a plurality of deduplication storage devices, includes a determination unit configured to determine a characteristic of first data to be stored in the distributed file system; an identification unit configured to identify one of the deduplication storage devices of the distributed file system as deduplication storage device for the first data based on the characteristic of the first data; and a storing unit configured to store the first data in the identified deduplication storage device such that the first data and second data being redundant to the first data are deduplicatable within the identified deduplication storage device.
11 Citations
20 Claims
-
1. A hardware device for storing data in a distributed file system, the distributed file system including a plurality of nonvolatile deduplication storage devices, the device comprising:
-
at least one memory storing computer-executable instructions; and at least one processor configured to access the at least one memory and execute the computer-executable instructions to; receive a request to store first data in the distributed file system; determine a characteristic of the first data; identify one of the plurality of nonvolatile deduplication storage devices of the distributed file system as a deduplication storage device for the first data based on the determined characteristic of the first data matching a characteristic of second data stored in the identified deduplication device; determine that the first data is redundant to the second data based on the determined characteristic of the first data matching the characteristic of the second data; and in response at least in part to determining that the first data is redundant to the second data, store the first data in the identified deduplication storage device such that the first data and the second data are redundantly stored in the identified deduplication storage device and are deduplicatable within the identified deduplication storage device after the first data is redundantly stored. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method, implemented by one or more computing systems, for storing data in a distributed file system, the distributed file system including a plurality of nonvolatile deduplication storage devices, the method comprising:
-
receiving a request to store first data in the distributed file system; determining a characteristic of the first data; identifying one of the plurality of nonvolatile deduplication storage devices of the distributed file system as a deduplication storage device for the first data based on the determined characteristic of the first data matching a characteristic of second data stored in the identified deduplication device; determining that the first data is redundant to the second data based on the determined characteristic of the first data matching the characteristic of the second data; and in response at least in part to determining that the first data is redundant to the second data, storing the first data in the identified deduplication storage device such that the first data and the second data are redundantly stored in the identified deduplication storage device and are deduplicatable within the identified deduplication storage device after the first data is stored. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A nontransitory computer readable storage medium having computer readable instructions stored thereon that, when executed by a computer, implement a method for storing data in a distributed file system, the distributed file system including a plurality of nonvolatile deduplication storage devices, the method comprising:
-
receiving a request to store first data in the distributed file system; determining a characteristic of the first data; identifying one of the plurality of nonvolatile deduplication storage devices of the distributed file system as deduplication storage device for the first data based on the determined characteristic of the first data matching a characteristic of second data stored in the identified deduplication device; determining that the first data is redundant to the second data based on the determined characteristic of the first data matching the characteristic of the second data; and in response at least in part to determining that the first data is redundant to the second data, storing the first data in the identified deduplication storage device such that the first data and the second data are redundantly stored in the identified deduplication storage device and are deduplicatable within the identified deduplication storage device after the first data is stored.
-
Specification