×

Enhanced method and system for assuring integrity of deduplicated data

  • US 20090271454A1
  • Filed: 04/29/2008
  • Published: 10/29/2009
  • Est. Priority Date: 04/29/2008
  • Status: Active Grant
First Claim
Patent Images

1. An enhanced method in a computer system for assuring integrity of deduplicated data in a data system, comprising:

  • performing deduplication upon a data object by dividing the data object into a set of one or more data chunks, and for each data chunk;

    inputting the data chunk into a hash function;

    performing the hash function upon a predetermined size of the data chunk to produce an intermediate hash value of the data chunk;

    performing the hash function on the remainder of the data chunk;

    computing a hash value for the data chunk used for determining whether a duplicate of the data chunk exists in the data system; and

    deduplicating the data chunk based on the computed hash value for each data chunk;

    generating an original object signature of the data object by computing a checksum from a collection of the intermediate hash values produced for the predetermined size of each data chunk within the data object;

    storing the original object signature in an index;

    assembling the deduplicated data object into a reassembled state responsive to said data object being accessed;

    dividing the reassembled data object into a set of one or more data chunks, and for each data chunk;

    inputting the data chunk into the hash function; and

    performing the hash function on the predetermined size of the data chunk to produce an intermediate hash value of the data chunk;

    generating a reassembled object signature of the reassembled data object by computing a checksum from a collection of the intermediate hash values produced for the predetermined size of each data chunk within the reassembled data object;

    comparing the reassembled object signature with the original object signature associated with the data object stored in the index; and

    providing the reassembled data object if the reassembled object signature matches the original object signature.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×