×

Method and system for processing checksum of a data stream to optimize deduplication

  • US 9,063,664 B1
  • Filed: 12/18/2012
  • Issued: 06/23/2015
  • Est. Priority Date: 12/18/2012
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for deduplicating data, comprising:

  • receiving at a storage system over a network from a client a first data stream having a plurality of data regions and a plurality of checksums for verifying integrity of the data regions embedded therein, the first data stream representing a file or a directory of one or more files of a file system associated with the client;

    scanning the first data stream to recognize a plurality of checksum markers that identify the checksums, wherein the checksum markers were inserted into the first data stream by the client prior to receiving the first data stream over the network;

    extracting the checksum markers and the checksums from the first data stream to generate second data stream without the checksum markers and associated checksum data therein; and

    deduplicating the second data stream into a plurality of deduplicated chunks.

View all claims
  • 9 Assignments
Timeline View
Assignment View
    ×
    ×