×

High availability distributed deduplicated storage system

  • US 9,633,033 B2
  • Filed: 01/10/2014
  • Issued: 04/25/2017
  • Est. Priority Date: 01/11/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method of performing a storage operation in a distributed, deduplicated storage system, comprising:

  • receiving at a first secondary storage computing device of a plurality of secondary storage computing devices a request from a client computing device to backup a file comprising a plurality of data blocks and stored in primary storage,wherein a first deduplication database computing device of a plurality of deduplication database computing devices communicatively coupled to the first secondary storage computing device is configured to store a first subset of signature blocks based at least in part on a data block distribution policy and is designated as a failover deduplication database computing device for a second deduplication database computing device of the plurality of deduplication database computing devices that is configured to store, based at least in part on the data block distribution policy, a second subset of the signature blocks that is different from and does not overlap with the first subset of the signature blocks,wherein the plurality of deduplication database computing devices store the signature blocks corresponding to data blocks stored in secondary storage, wherein the data blocks stored in the secondary storage correspond to data blocks stored in primary storage, at least one signature block of the signature blocks comprising a signature of at least one data block of the plurality of data blocks, location information of the at least one data block in the secondary storage, and a reference count value indicative of a quantity of one or more references in the secondary storage to the at least one data block;

    in response to the request and using one or more processors, calculating a signature of a particular data block of the plurality of data blocks using a signature function;

    identifying the second deduplication database computing device as the deduplication database computing device assigned to store the signature of the particular data block;

    determining that the second deduplication database computing device is unavailable; and

    querying the first deduplication database computing device for the signature of the particular data block,the method further comprising at least one of;

    based at least in part on an indication from the first deduplication database computing device that the signature of the particular data block does not reside in the first deduplication database computing device, store the signature in a failover index, cause at least one storage device of the of the secondary storage to store a copy of the particular data block, and request the first deduplication database computing device to store the signature of the particular data block and a location of the copy of the particular data block, orbased at least in part on an indication from the first deduplication database computing device that the signature of the particular data block resides in the first deduplication database computing device, cause at least one storage device of the of the secondary storage to store a reference to a copy of the particular data block that is stored in the secondary storage.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×