×

Method for de-duplicating data and apparatus therefor

  • US 9,851,917 B2
  • Filed: 03/07/2014
  • Issued: 12/26/2017
  • Est. Priority Date: 03/07/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method for data de-duplication, performed in an apparatus for data de-duplication, comprising:

  • obtaining access property including access time on data, modification on the data, a number of sequential accesses on the data, and a number of random accesses on the data based on input request or output request for the data;

    calculating a first difference between a current access time on the data and a previous modification time on the data;

    determining a fourth de-duplication unit having a lowest de-duplication probability as the de-duplication unit of the data when the first difference is equal to or less than a predefined first threshold;

    calculating a second difference between the current access time on the data and the previous access time on the data when the first difference is in excess of the first threshold;

    determining a first de-duplication unit having a highest de-duplication probability as the de-duplication unit of the data when the second difference is in excess of a predefined second threshold;

    determining a second de-duplication unit having a lower de-duplication probability than the first de-duplication unit as the de-duplication unit of the data when the second difference is equal to or less than the second threshold and the number of random accesses on the data is equal to and more than the number of sequential accesses on the data;

    determining a third de-duplication unit having a lower probability of being de-duplicated than the second de-duplication unit as the de-duplication unit of the data when the second difference is equal to or less than the second threshold and the number of random accesses on the data is less than the number of sequential accesses on the data;

    generating at least one data block of the data based on the determined de-duplication unit according to the access property, wherein the determined de-duplication unit is one of the first de-duplication unit, second de-duplication unit, third de-duplication unit, and fourth de-duplication unit;

    generating unique identifier for the at least one data block; and

    performing de-duplication on the data based on whether the unique identifier is in an index table or not.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×