DUAL LAYER DEDUPLICATION FOR APPLICATION SPECIFIC FILE TYPES IN AN INFORMATION PROCESSING SYSTEM
First Claim
1. An apparatus comprising:
- at least one processing device comprising a processor coupled to a memory, the at least one processing device being configured;
to obtain a file;
to determine a type of the file;
to determine a structure of the file based at least in part on the determined type of the file;
to identify at least one location in the file based at least in part on the determined structure;
to divide the file at the identified at least one location into a plurality of chunks; and
to provide the plurality of chunks to a block deduplication module of a storage system, the block deduplication module configured to perform a deduplication process based at least in part on the plurality of chunks.
3 Assignments
0 Petitions
Accused Products
Abstract
An apparatus in one embodiment comprises at least one processing device comprising a processor coupled to a memory. The at least one processing device is configured to obtain a file and to determine a type of the file. A structure of the file may be determined based at least in part on the determined type of the file and at least one location in the file may be identified based at least in part on the determined structure. The file may be divided at the identified at least one location into a plurality of chunks and the plurality of chunks may be provided to a block deduplication module of a storage system where the block deduplication module is configured to perform a deduplication process based at least in part on the plurality of chunks.
1 Citation
20 Claims
-
1. An apparatus comprising:
at least one processing device comprising a processor coupled to a memory, the at least one processing device being configured; to obtain a file; to determine a type of the file; to determine a structure of the file based at least in part on the determined type of the file; to identify at least one location in the file based at least in part on the determined structure; to divide the file at the identified at least one location into a plurality of chunks; and to provide the plurality of chunks to a block deduplication module of a storage system, the block deduplication module configured to perform a deduplication process based at least in part on the plurality of chunks. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
11. A method comprising:
-
obtaining a file; determining a type of the file; determining a structure of the file based at least in part on the determined type of the file; identifying at least one location in the file based at least in part on the determined structure; dividing the file at the identified at least one location into a plurality of chunks; and providing the plurality of chunks to a block deduplication module of a storage system, the block deduplication module configured to perform a deduplication process based at least in part on the plurality of chunks, wherein the method is implemented by at least one processing device comprising a processor coupled to a memory. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A computer program product comprising a non-transitory processor-readable storage medium having stored therein program code of one or more software programs, wherein the program code when executed by at least one processing device causes said at least one processing device:
-
to obtain a file; to determine a type of the file; to determine a structure of the file based at least in part on the determined type of the file; to identify at least one location in the file based at least in part on the determined structure; to divide the file at the identified at least one location into a plurality of chunks; and to provide the plurality of chunks to a block deduplication module of a storage system, the block deduplication module configured to perform a deduplication process based at least in part on the plurality of chunks.
-
Specification