METHODS AND APPARATUS FOR PERFORMING EFFICIENT DATA DEDUPLICATION BY METADATA GROUPING
First Claim
Patent Images
1. A storage system comprising:
- a data storage volume;
a memory storing metadata associated with a data storage volume;
a network interface configured to connect the storage system with a host computer; and
a central processing unit;
wherein said storage system calculates an identifier from data received from the host computer, and determines if the data is stored in the data storage volume by the identifier and said metadata.
1 Assignment
0 Petitions
Accused Products
Abstract
The system is composed of: identifier generation program or logic, identifier confirm program or logic, plural identifier table and metadata mapping table. Data streams or data blocks, files are stored in the data storage system with metadata. The metadata includes additional information of the data and files. For example application, creator, timestamp, OS type, and the like. Data storage system or backup appliance with this invention can have plural groups which are related to the metadata. Each group has an identifier table so that eliminating duplicated data is executed within the group.
47 Citations
20 Claims
-
1. A storage system comprising:
-
a data storage volume; a memory storing metadata associated with a data storage volume; a network interface configured to connect the storage system with a host computer; and a central processing unit; wherein said storage system calculates an identifier from data received from the host computer, and determines if the data is stored in the data storage volume by the identifier and said metadata. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A storage system comprising:
-
a data storage volume; a memory storing metadata associated with a data storage volume; a network interface configured to connect the storage system with a host computer; and a central processing unit; wherein said storage system calculates an identifier from data in an object received from the host computer, and determines if the data is stored in the data storage volume by the identifier, said metadata stored in the memory and metadata in said object. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A method performed in a storage system comprising a plurality of data storage units, the plurality of data storage units being divided into a plurality of chunks forming a chunk pool;
- a network interface configured to connect the storage system with a host computer; and
a storage controller comprising a central processing unit and a memory, the method comprising;i. provisioning a data storage volume and making the data storage volume available to the host computer via the network interface; ii. upon receipt of a write command directed to the data storage volume from the host computer, calculating an identifier corresponding to the data associated with the write command; iii. grouping the identifier based on metadata into at least one identifier group; iv. confirming uniqueness of the identifier within the identifier group associated with the metadata; and v. if the identifier is unique within the identifier group, allocating at least one chunk from the chunk pool to the data storage volume and storing the data associated with the write command in the allocated at least one chunk.
- a network interface configured to connect the storage system with a host computer; and
Specification