×

Data de-duplication in a dispersed storage network utilizing data characterization

  • US 8,458,233 B2
  • Filed: 09/17/2010
  • Issued: 06/04/2013
  • Est. Priority Date: 11/25/2009
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer implemented method comprises:

  • receiving, from a requesting device, a data storage request that includes data for storage;

    determining whether substantially identical data is currently stored in a dispersed storage network (DSN) memory; and

    when the substantially identical data is not stored in the DSN memory;

    encoding at least a portion of the data using an error coding dispersal storage function to produce a set of encoded data slices;

    sending the set of encoded data slices to the DSN memory for storage therein; and

    generating a unique retrieval matrix for the requesting device, wherein the unique retrieval matrix identifies a sub-set of encoded data slices of the set of encoded data slices for subsequent retrieval of the at least a portion of the data; and

    when the substantially identical data is stored in the DSN memory;

    generating a second unique retrieval matrix for the requesting device, wherein the second unique retrieval matrix identifies a second sub-set of encoded data slices of the set of encoded data slices for subsequent retrieval of the at least a portion of the data.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×