×

DE-DUPLICATION DEPLOYMENT PLANNING

  • US 20140358870A1
  • Filed: 09/03/2013
  • Published: 12/04/2014
  • Est. Priority Date: 06/03/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • dividing an address space of files into multiple containers;

    performing a file metadata scan, including obtaining attributes for files in each container;

    aggregating the file attributes into characterizations for each attribute dimension, and generating a content feature summary for each container incorporating the characterizations;

    measuring a content similarity prediction measurement between containers from the generated content feature summary; and

    assigning files from each container to a de-duplication domain based on the computed content similarity prediction measurement.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×