×

Parallelizing and deduplicating backup data

  • US 10,303,656 B2
  • Filed: 08/13/2015
  • Issued: 05/28/2019
  • Est. Priority Date: 08/13/2015
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for deduplicating data, comprising:

  • selecting a grid server in a plurality of grid servers for deduplicating a segment of data in a plurality of segments of data contained within a data stream, wherein a data deduplication system communicatively coupled to the plurality of servers is configured to split the data stream into the plurality of segments and select the grid server for deduplicating the segment of data;

    forwarding the segment of data to the selected grid server for deduplication; and

    deduplicating, using the plurality of grid servers, a zone contained within the forwarded segment of data using a listing of a plurality of zone stamps, each zone stamp in the listing of the plurality of zone stamps representing a zone in a plurality of zones previously deduplicated by at least one server in the plurality of grid servers, the deduplicating includingdetermining, using the listing of the plurality of zone stamps, by a first grid server in the plurality of grid servers that a second grid server in the plurality of grid servers previously deduplicated a first zone in the plurality of zones having a first zone stamp matching to a second zone stamp of a second zone being processed by the first grid server, andtransmitting, by the first grid server, the second zone to the second grid server for deduplication.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×