×

Network optimized deduplication of virtual machine snapshots

  • US 10,282,112 B2
  • Filed: 02/20/2015
  • Issued: 05/07/2019
  • Est. Priority Date: 11/04/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method for operating a data management system, comprising:

  • acquiring a first snapshot of a first virtual machine;

    storing the first snapshot using a first storage device of a first type;

    generating a first plurality of hash values corresponding to a first signature by sampling a first plurality of data blocks within the first snapshot, the first plurality of data blocks includes a first region of data blocks wherein at least two or more data blocks are spaced at a fixed distance from each other and a second region of data blocks wherein at least two or more data blocks are spaced at increasingly greater distances from each other, the second region does not overlap with the first region;

    acquiring a second snapshot of a second virtual machine subsequent to acquiring the first snapshot of the first virtual machine, the first virtual machine and the second virtual machine comprise different virtual machines;

    generating a second plurality of hash values corresponding to a second signature by sampling a second plurality of data blocks within the second snapshot;

    determining a matching score between the first signature and the second signature by comparing the first plurality of hash values to the second plurality of hash values;

    generating a dependent base file for the second virtual machine based on the matching score, wherein the dependent base file comprises data differences between the first snapshot of the first virtual machine and the second snapshot of the second virtual machine;

    storing the dependent base file using a second storage device of a second type different from the first type;

    andgenerating a third snapshot of the second virtual machine using the dependent base file for the second virtual machine and the first snapshot of the first virtual machine, wherein the generating the third snapshot includes concurrently reading the dependent base file for the second virtual machine from the second storage device of the second type while reading the first snapshot of the first virtual machine from the first storage device of the first type.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×