×

High availability distributed deduplicated storage system

  • US 9,665,591 B2
  • Filed: 01/10/2014
  • Issued: 05/30/2017
  • Est. Priority Date: 01/11/2013
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method of performing a storage operation in a distributed, deduplicated storage system, the method comprising:

  • during a period of availability of a first deduplication database computing device of a plurality of deduplication database computing devices and a second deduplication database computing device of the plurality of deduplication database computing devices,the plurality of deduplication database computing devices storing signature blocks corresponding to a plurality of data blocks stored in one or more secondary storage devices of secondary storage, the plurality of data blocks corresponding to data blocks received from primary storage,at least one signature block of the signature blocks comprising a signature of at least one data block of the plurality of data blocks, location information of the at least one data block in the one or more secondary storage devices, and a reference count value indicative of a quantity of one or more references in the secondary storage to the at least one data block,the first deduplication database computing device configured to store a first subset of the signature blocks based at least in part on a data block distribution policy and designated as a failover deduplication database computing device for the second deduplication database computing device that is configured to store, based at least in part on the data block distribution policy, a second subset of the signature blocks that is different from and does not overlap with the first subset of the signature blocks;

    receiving at a secondary storage computing device comprising a failover index and communicatively coupled to the plurality of deduplication database computing devices, a first set of one or more signatures corresponding to one or more data blocks stored in primary storage;

    identifying, based at least in part on the data block distribution policy, the second deduplication database computing device as the deduplication database computing device assigned to store the first set of one or more signatures;

    determining, based at least in part on a query of the failover index, that at least one signature of the first set of one or more signatures matches at least one signature of a second set of one or more signatures that was stored in the first deduplication database computing device during a previous period of unavailability of the second deduplication database computing device;

    querying the first deduplication database computing device for the at least one signature of the first set of one or more signatures;

    receiving from the first deduplication database computing device a location of a copy of a data block corresponding to the at least one signature of the first set of one or more signatures; and

    storing in the secondary storage the location of the copy of the data block corresponding to the at least one signature of the first set of one or more signatures.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×