×

Scalable chunk store for data deduplication

  • US 10,394,757 B2
  • Filed: 11/18/2010
  • Issued: 08/27/2019
  • Est. Priority Date: 11/18/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method, comprising:

  • parsing a data stream into a sequence of data chunks;

    determining whether any of the sequence of data chunks are stored in a chunk container that includes a plurality of data chunks;

    storing, in a contiguous arrangement and in a same sequence in the chunk container as in the data stream, data chunks of the sequence of data chunks determined to not be stored in the chunk container;

    generating a stream map that is a data structure that describes a mapping between a structure of the data stream and an optimized structure of the data chunks stored in the chunk container to enable data chunks referenced in the stream map to be located in the chunk container, the optimized structure including data chunks that have been deduplicated, the stream map including metadata for each data chunk of the sequence; and

    including, in the metadata for each of the data chunks stored in the contiguous arrangement, a same locality indicator value that indicates the contiguous arrangement and indicates that each of the data chunks stored in the contiguous arrangement is associated with the generated stream map.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×