STORED DATA DEDUPLICATION METHOD, STORED DATA DEDUPLICATION APPARATUS, AND DEDUPLICATION PROGRAM
First Claim
1. A stored-data deduplication method for eliminating a duplicate data fragment from a storage area in a storage device, the duplicate data fragment being a duplicate of one of data fragments constituting data stored in the storage device, the method comprising:
- dividing the data to be stored in the storage device into the data fragments;
recording the data by using configurations of the divided data fragments;
judging whether identical data fragments exist in the data fragments;
when it is judged that the identical data fragments exist, storing one of the identical data fragments in the storage area of the storage device, and generating and recording data-fragment attribute information which is information indicating an attribute unique to the data fragment stored;
upon receipt of a request to read the data stored in the storage area of the storage device, acquiring the configurations of the data fragments forming the read-target data, reading the corresponding data fragments from the storage area of the storage device, and restoring the data;
acquiring and coupling the recorded data fragments to generate concatenation target data targeted for judgment on whether chunk concatenation is possible or not, and detecting whether the concatenation target data has a repeated data pattern which is repetition of a particular data pattern; and
using as a concatenated data fragment a sequence of a plurality of the data fragments having the detected repeated data pattern, generating from the concatenated data fragment concatenated-data fragment attribute information indicating an attribute of the concatenated data fragment, and recording the concatenated-data fragment attribute information.
1 Assignment
0 Petitions
Accused Products
Abstract
Method of dividing data to be stored in storage device into data fragments; recording the data by using configurations of divided data fragments; judging whether identical data fragments exist in data fragments; when it is judged that identical data fragments exist, storing one of the identical data fragments in storage area of the storage device, and generating and recording data-fragment attribute information indicating an attribute unique to the data fragment stored; upon receipt of request to read data stored in the storage area of the storage device, acquiring the configurations of the data fragments forming the read-target data, reading the corresponding data fragments from the storage area of the storage device, and restoring the data; acquiring and coupling the recorded data fragments to generate concatenation target data targeted for judgment on whether chunk concatenation is possible or not, and detecting whether the concatenation target data has a repeated data pattern
-
Citations
17 Claims
-
1. A stored-data deduplication method for eliminating a duplicate data fragment from a storage area in a storage device, the duplicate data fragment being a duplicate of one of data fragments constituting data stored in the storage device, the method comprising:
-
dividing the data to be stored in the storage device into the data fragments; recording the data by using configurations of the divided data fragments; judging whether identical data fragments exist in the data fragments; when it is judged that the identical data fragments exist, storing one of the identical data fragments in the storage area of the storage device, and generating and recording data-fragment attribute information which is information indicating an attribute unique to the data fragment stored; upon receipt of a request to read the data stored in the storage area of the storage device, acquiring the configurations of the data fragments forming the read-target data, reading the corresponding data fragments from the storage area of the storage device, and restoring the data; acquiring and coupling the recorded data fragments to generate concatenation target data targeted for judgment on whether chunk concatenation is possible or not, and detecting whether the concatenation target data has a repeated data pattern which is repetition of a particular data pattern; and using as a concatenated data fragment a sequence of a plurality of the data fragments having the detected repeated data pattern, generating from the concatenated data fragment concatenated-data fragment attribute information indicating an attribute of the concatenated data fragment, and recording the concatenated-data fragment attribute information. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A stored-data deduplication apparatus for eliminating a duplicate data fragment from a storage area in a storage device, the duplicate data fragment being a duplicate of one of data fragments constituting data stored in the storage device, the apparatus comprising a processor, a memory, and units implemented when the processor executes a corresponding program on the memory, the units being:
-
a data division unit configured to divide the data to be stored in the storage device into the data fragments; a data registration unit configured to record the data by using configurations of the divided data fragments; a data matching unit configured to judge whether identical data fragments exist in the data fragments, and when it is judged that the identical data fragments exist, store one of the identical data fragments in the storage area of the storage device, and generate and record data-fragment attribute information which is information indicating an attribute unique to the data fragment stored; a data restoration unit configured to, upon receipt of a request to read the data stored in the storage area of the storage device, acquire the configurations of the data fragments forming the read-target data, read the corresponding data fragments from the storage area of the storage device, and restore the data; a data analysis unit configured to acquire and couple the recorded data fragments to generate concatenation target data targeted for judgment on whether chunk concatenation is possible or not, and detect whether the concatenation target data has a repeated data pattern which is repetition of a particular data pattern; and a data update unit configured to use as a concatenated data fragment a sequence of a plurality of the data fragments having the detected repeated data pattern, generate from the concatenated data fragment concatenated-data fragment attribute information indicating an attribute of the concatenated data fragment, and record the concatenated-data fragment attribute information. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A stored-data deduplication program used for eliminating a duplicate data fragment from a storage area in a storage device, the duplicate data fragment being a duplicate of one of data fragments constituting data stored in the storage device, the program causing a computer to execute:
-
dividing the data to be stored in the storage device into the data fragments; recording the data by using configurations of the divided data fragments; judging whether identical data fragments exist in the data fragments; when it is judged that the identical data fragments exist, storing one of the identical data fragments in the storage area of the storage device, and generating and recording data-fragment attribute information which is information indicating an attribute unique to the data fragment stored; upon receipt of a request to read the data stored in the storage area of the storage device, acquiring the configurations of the data fragments forming the read-target data, reading the corresponding data fragments from the storage area of the storage device, and restoring the data; acquiring and coupling the recorded data fragments to generate concatenation target data targeted for judgment on whether chunk concatenation is possible or not, and detecting whether the concatenation target data has a repeated data pattern which is repetition of a particular data pattern; and using as a concatenated data fragment a sequence of a plurality of the data fragments having the detected repeated data pattern, generating from the concatenated data fragment concatenated-data fragment attribute information indicating an attribute of the concatenated data fragment, and recording the concatenated-data fragment attribute information.
-
Specification