×

Self-healing by hash-based deduplication

  • US 9,928,248 B2
  • Filed: 07/08/2013
  • Issued: 03/27/2018
  • Est. Priority Date: 07/08/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method for self-healing in a hash-based deduplication system using a processor device in a computing environment, the method comprising:

  • maintaining deduplication digests of data with a corresponding list of the deduplication digests in a table of contents (TOC) for the self-healing of data that is one of lost and unreadable within each one of a plurality of user-level stored entities, the user-level stored entities each comprising a file portion of a virtual tape cartridge, such that only the deduplication digests corresponding to the data stored on a given virtual tape cartridge are listed in the TOC of the given virtual tape cartridge;

    comparing input data digests to the TOC if directed to data that is one of lost and unreadable and using the input data digests to repair the one of lost and unreadable data;

    storing by each one of the plurality of user-level stored entities a deduplication digest of data belonging to each one of the plurality of user-level stored entities and the corresponding list of the deduplication digest in the TOC, wherein the TOC is a list of references to storage blocks in a common storage area where each entry lists one of a block and range of the storage blocks and an offset and range within one of the block and list of blocks, and wherein the data in the one of the plurality of user-level stored entities is a concatenation of the data in the offset and range within one of the block and list of blocks that are listed in the list of references to storage blocks;

    determining if a digest-to-block mapping module contains an entry for a deduplication digest, the digest-to-block mapping module searching only for the deduplication digest within the given virtual tape cartridge of which the data for the entry resides;

    using the digest-to-block mapping module to look up a storage block containing the deduplication digest in the digest-to-block mapping module; and

    removing the deduplication digest from the digest-to-block mapping module when the storage block is found to be unreadable.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×