×

METHOD AND APPARATUS FOR BLOCK SIZE OPTIMIZATION IN DE-DUPLICATION

  • US 20090313248A1
  • Filed: 06/11/2008
  • Published: 12/17/2009
  • Est. Priority Date: 06/11/2008
  • Status: Active Grant
First Claim
Patent Images

1. A method of determining sizing of chunk portions in data de-duplication, comprising:

  • chunking input data into a first plurality of data segments each having a first size;

    assigning an identifier to each of the first plurality of data segments;

    assigning an index to each of said identifiers;

    creating a suffix structure and a longest common prefix structure from the indexes;

    detecting repeated sequences of indexes and non-repeated indexes from the suffix structure and the longest common prefix structure;

    determining a second size based on said detected repeated sequences and non-repeated indexes; and

    chunking the input data into a second plurality of data segments each having the second size.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×