Data deduplication by separating data from meta data
First Claim
Patent Images
1. A method, comprising:
- inserting meta data into a file containing file data to form co-mingled data;
sending chunks of the co-mingled data and a mapping of boundaries that identifies offsets of which portions of each chunk of data are the meta data and which portions of each chunk of data are the file data to a data deduplication system; and
under control of the data deduplication system,receiving a chunk of data storing the co-mingled file data and meta data and the mapping;
using the mapping to split the chunk of data into a file data stream and a meta data stream and to store the file data from the file data stream in a first file and to store the meta data from the meta data stream in a second file, wherein the first file and the second file are separate files; and
deduplicating the file data in the first file without deduplicating the meta data in the second file.
1 Assignment
0 Petitions
Accused Products
Abstract
Provided are techniques for data deduplication. A chunk of data and a mapping of boundaries between file data and meta data in the chunk of data are received. The mapping is used to split the chunk of data into a file data stream and a meta data stream and to store file data from the file data stream in a first file and to store meta data from the meta data stream in a second file, wherein the first file and the second file are separate files. The file data in the first file is deduplicated.
136 Citations
17 Claims
-
1. A method, comprising:
-
inserting meta data into a file containing file data to form co-mingled data; sending chunks of the co-mingled data and a mapping of boundaries that identifies offsets of which portions of each chunk of data are the meta data and which portions of each chunk of data are the file data to a data deduplication system; and under control of the data deduplication system, receiving a chunk of data storing the co-mingled file data and meta data and the mapping; using the mapping to split the chunk of data into a file data stream and a meta data stream and to store the file data from the file data stream in a first file and to store the meta data from the meta data stream in a second file, wherein the first file and the second file are separate files; and deduplicating the file data in the first file without deduplicating the meta data in the second file. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer program product comprising a computer-readable medium including computer readable instructions, wherein the computer readable instructions when executed by a processor on a computer causes the computer to:
-
insert meta data into a file containing file data to form co-mingled data; send chunks of the co-mingled data and a mapping of boundaries that identifies offsets of which portions of each chunk of data are the meta data and which portions of each chunk of data are the file data to a data deduplication system; and under control of the data deduplication system, receive a chunk of data storing the co-mingled file data and meta data and the mapping; use the mapping to split the chunk of data into a file data stream and a meta data stream and to store the file data from the file data stream in a first file and to store the meta data from the meta data stream in a second file, wherein the first file and the second file are separate files; and deduplicate the file data in the first file without deduplicating the meta data in the second file. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A system, comprising:
-
hardware logic performing operations, the operations comprising; inserting meta data into a file containing file data to form co-mingled data; sending chunks of the co-mingled data and a mapping of boundaries that identifies offsets of which portions of each chunk of data are the meta data and which portions of each chunk of data are the file data to a data deduplication system; and hardware logic of the data deduplication system performing operations, the operations comprising; receiving a chunk of data storing the co-mingled file data and meta data and the mapping; using the mapping to split the chunk of data into a file data stream and a meta data stream and to store the file data from the file data stream in a first file and to store the meta data from the meta data stream in a second file, wherein the first file and the second file are separate files; and deduplicating the file data in the first file without deduplicating the meta data in the second file. - View Dependent Claims (14, 15, 16, 17)
-
Specification