×

Data compression and storage techniques

  • US 8,832,045 B2
  • Filed: 05/02/2012
  • Issued: 09/09/2014
  • Est. Priority Date: 04/07/2006
  • Status: Active Grant
First Claim
Patent Images

1. A method for use in computerized data storage, wherein a computerized system is operative to utilize computer readable media to back-up a data set, comprising:

  • generating hash signatures including an identifier hash associated with identifying data and a content hash associated with content of individual portions of an initial data set;

    transferring the initial data set to a storage location via a network interface;

    at a time subsequent to transferring the initial data set, performing a back-up of a subsequent data set associated with the initial data set, wherein performing the back-up comprises;

    generating hash signatures including an identifier hash associated with identifying data and a content hash associated with content of individual portions of the subsequent data;

    comparing the identifier hashes of corresponding portions of the initial data set and the subsequent data set and, upon failing to match identifier hashes, comparing content hashes of said corresponding portions to determine if a corresponding content hash exists for the initial data set and to identify changed portions of the subsequent data set;

    obtaining corresponding portions of the initial dataset that correspond to the changed portions of the subsequent data set;

    preloading a dictionary-based compression engine with one of the corresponding portions of the initial data set, wherein the one corresponding portion of the initial data set is loaded in the dictionary-based compression engine and defines an individual dictionary block;

    compressing a corresponding one of the changed portions of the subsequent data set using the dictionary-based compression engine as loaded with the corresponding portion of the initial data set as a dictionary, wherein a compressed data portion is generated; and

    storing the compressed data portion to the storage location via the network interface to define a back-up version of the subsequent data set.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×