×

Systems and methods for classifying files as candidates for deduplication

  • US 8,849,768 B1
  • Filed: 03/08/2011
  • Issued: 09/30/2014
  • Est. Priority Date: 03/08/2011
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for classifying files as candidates for deduplication, at least a portion of the method being performed by a computing device comprising at least one processor, the method comprising:

  • identifying at least a portion of a file;

    detecting at least one event that is suggestive of a duplicate instance of the portion of the file already being stored within a storage device prior to determining whether the duplicate instance of the portion of the file is already stored within the storage device, the event comprising an attempt by a user of the computing device to mark the file to indicate that the file is likely to be duplicated on a plurality of computing devices that are backed up at the storage device;

    in response to detecting the event, classifying the file as a candidate for deduplication such that the file'"'"'s candidate-for-deduplication classification indicates that the duplicate instance of the portion of the file is likely already stored within the storage device;

    maintaining the file'"'"'s candidate-for-deduplication classification for use in prompting a determination on whether the duplicate instance of the portion of the file is already stored within the storage device by maintaining, within the file, an attribute associated with the file that indicates that the file is a candidate for deduplication;

    during deduplication or backup of data within a storage system;

    identifying, within the file, the attribute associated with the file;

    determining, based on the attribute associated with the file, that the file is a candidate for deduplication;

    in response to determining that the file is a candidate for deduplication, determining whether the portion of the file is already stored within the storage device.

View all claims
  • 7 Assignments
Timeline View
Assignment View
    ×
    ×