×

Synchronized data deduplication

  • US 10,540,327 B2
  • Filed: 08/23/2017
  • Issued: 01/21/2020
  • Est. Priority Date: 07/08/2009
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented data deduplication method, the method comprising:

  • with one or more computing systems of a shared storage system that maintains a deduplicated data store and that is in networked communication with a plurality of client computing systems that are physically separate from the shared storage system and share the deduplicated data store;

    determining whether a first data segment included in data generated by an application executing on a first computing system of the plurality of client computing systems is already stored in the shared storage system;

    if the first data segment is not already stored in the shared storage system, updating a central reference table of the shared storage system to include an entry corresponding to the first data segment;

    determining a first subset of the references in the central reference table for inclusion in a first updated partial instantiation of the central reference table based on one or more of data segment size information and data segment utilization frequency information, the first subset including a reference to the first data segment;

    transmitting the first updated partial instantiation of the central reference table from the shared storage system to a second computing system of the plurality of client computing systems such that, subsequent to said transmitting, a partial instantiation of the central reference table local to the second computing system includes the entry corresponding to the first data segment;

    determining a second subset of the references in the central reference table for inclusion in a second updated partial instantiation of the central reference table based on one or more of data segment size information and data segment utilization frequency information, the second subset different than the first subset; and

    transmitting the second updated partial instantiation of the central reference table to a third computing system of the plurality of client computing systems such that, subsequent to transmission of the first and second updated partial instantiations, a partial instantiation of the central reference table local to the third computing system is different from the partial instantiation of the central reference table local to the second computing system, and does not include the entry corresponding to the first data segment.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×