Multi source wire deduplication
First Claim
1. A method for data de-duplication in a replication environment, the replication environment having a production site, a splitter, and a replication site, wherein the replication site has a journal, the method comprising:
- determining a digest for each chunk of data of a set of data chunks;
determining for each chunk of the set of data chunks whether the digest is in an index on the production site;
determining which offsets are to be evicted from a cache on the replication site;
replacing data chunks in the set of data chunks that are in the index with an offset; and
transmitting the set of data chunks, the offsets and an eviction list to the replication site.
9 Assignments
0 Petitions
Accused Products
Abstract
A method, system, and program product for data deduplication in a replication environment, the replication environment having a production site, a splitter, and a replication site, wherein the replication site has a journal, comprising determining a digest for each chunk of data of a set of data chunks, determining for each chunk whether the digest is in an index on the production site, determining for which offsets are to be evicted from the cache on the replication site, replacing the chunks in set of chunks that are in the index with an offset, and transmitting the set of chunks, offsets and an eviction list to the replication site.
-
Citations
20 Claims
-
1. A method for data de-duplication in a replication environment, the replication environment having a production site, a splitter, and a replication site, wherein the replication site has a journal, the method comprising:
-
determining a digest for each chunk of data of a set of data chunks; determining for each chunk of the set of data chunks whether the digest is in an index on the production site; determining which offsets are to be evicted from a cache on the replication site; replacing data chunks in the set of data chunks that are in the index with an offset; and transmitting the set of data chunks, the offsets and an eviction list to the replication site. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system for data deduplication in a replication environment, the system comprising:
-
a production site having an index, a replication site having a data cache; and computer-executable logic operating in memory, wherein the computer-executable program logic is configured for execution of the following; determining a digest for each chunk of data of a set of data chunks; determining for each chunk of the set of data chunks whether the digest is in an index on the production site; determining for which offsets are to be evicted from the cache on the replication site; replacing data chunks in the set of data chunks that are in the index with an offset; and transmitting the set of data chunks, the offsets and an eviction list to the replication site. - View Dependent Claims (13, 14, 15)
-
-
16. A computer program product comprising:
-
a non-transitory computer readable medium encoded with computer executable program code for data de-duplication in a replication environment, the replication environment having a production site, a splitter, and a replication site, wherein the replication site has a journal, the code enabling; determining a digest for each chunk of data of a set of data chunks; determining for each chunk of the set of data chunks whether the digest is in an index on the production site; determining which offsets are to be evicted from a cache on the replication site; replacing data chunks in the set of data chunks that are in the index with an offset; and transmitting the set of data chunks, the offsets and an eviction list to the replication site. - View Dependent Claims (17, 18, 19, 20)
-
Specification