×

De-duplicating distributed file system using cloud-based object store

  • US 10,191,914 B2
  • Filed: 03/31/2015
  • Issued: 01/29/2019
  • Est. Priority Date: 03/31/2015
  • Status: Active Grant
First Claim
Patent Images

1. A method of storing file system data, comprising:

  • receiving, at a processor, a request to store a file comprising a plurality of chunks of file data;

    determining, by the processor, to store at least a first subset of the plurality of chunks of file data based at least in part on a chunk identifier, wherein a second subset of the plurality of chunks of file data are already stored at a remote storage;

    determining, by the processor, a deduplication chunk size for the plurality of chunks of file data of the first subset, wherein the deduplication chunk size facilitates achieving a desired deduplication performance when storing the first subset of the plurality of chunks of file data, and wherein the deduplication chunk size is larger than a chunk size of a chunk included in the first subset of the plurality of chunks;

    selecting, by the processor, which chunks of the first subset of the plurality of chunks of file data to combine into a single stored object that satisfies the deduplication chunk size associated with the desired deduplication performance;

    combining, by the processor, the selected chunks of the first subset of the plurality of chunks of file data into the single stored object satisfying the deduplication chunk size; and

    providing, by the processor, the single stored object that includes the combined selected chunks of the first subset of the plurality of chunks of file data to the remote storage, wherein the remote storage is configured to store the provided single stored object, and wherein storing the single stored object achieves a better deduplication performance than would be achieved if the combined selected chunks were stored individually.

View all claims
  • 9 Assignments
Timeline View
Assignment View
    ×
    ×