×

Efficient content meta-data collection and trace generation from deduplicated storage

  • US 8,667,032 B1
  • Filed: 12/22/2011
  • Issued: 03/04/2014
  • Est. Priority Date: 12/22/2011
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for collecting meta-data from a deduplication data storage system, the method comprising:

  • collecting a set of file recipes for a set of files stored in the deduplication data storage system, each file recipe in the set of file recipes including a fingerprint for each unique data chunk that constitutes a file, wherein each fingerprint identifies each corresponding unique data chunk;

    collecting meta-data for a set of unique data chunks for the collected set of files by a data collection engine, wherein the meta-data describes the unique data chunks;

    anonymizing the collected set of file recipes and the meta-data by an anonymizing engine; and

    storing the anonymized set of file recipes and the anonymized meta-data in a data collection storage unit for content data set analysis without the content data set.

View all claims
  • 9 Assignments
Timeline View
Assignment View
    ×
    ×