×

Data aware deduplication object storage (DADOS)

  • US 10,761,758 B2
  • Filed: 12/21/2016
  • Issued: 09/01/2020
  • Est. Priority Date: 12/21/2015
  • Status: Active Grant
First Claim
Patent Images

1. A data aware deduplicating object store, comprising:

  • a collection of storage devices, wherein each device includes a processor and non-transitory storage medium storing instructions that cause the processor to perform corresponding functions;

    a consistent hashing logic configured to;

    receive a first item having a first item identifier;

    break data of the item into at least two chunks;

    perform a first hash function on each chunk to generate a recipe comprising respective chunk identifiers of the at least two chunks;

    perform a second hash function on the item identifier to determine a metadata location on a metadata ring of a dual ring architecture;

    store the recipe in the metadata location;

    provide respective chunk identifiers and chunks for storing in a bulk ring of the dual ring architecture; and

    a deduplication logic configured to;

    receive the chunk identifiers and chunks from the consistent hashing logic;

    access a plurality of Bloom filter shards, wherein each Bloom filter shard stores information about chunks of data stored in an associated key/value data store of the bulk ring, to perform deduplication on each chunk based on the chunk identifier for the chunk to determine whether the chunk is a duplicate chunk already present in the bulk ring;

    increment a respective reference count for each respective chunk identifier based on the recipe including the respective chunk, wherein the reference count facilitates garbage collection or data reclamation; and

    when the chunk of data is a duplicate chunk, refrain from providing the duplicate chunk to the bulk ring for storing.

View all claims
  • 7 Assignments
Timeline View
Assignment View
    ×
    ×