×

Locating potentially identical objects across multiple computers based on stochastic partitioning of workload

  • US 20050216538A1
  • Filed: 05/20/2005
  • Published: 09/29/2005
  • Est. Priority Date: 06/06/2001
  • Status: Active Grant
First Claim
Patent Images

1. A method, implemented in a computer that is part of a plurality of computers in a network, comprising:

  • selecting a portion of file information corresponding to a file stored on one of the plurality of computers;

    comparing, for each of the plurality of computers, the selected portion to a portion of a computer identifier associated with the computer;

    identifying which of the computer identifiers have portions matching the selected portion of the file information;

    communicating, for identification of potentially identical files stored on the plurality of computers, the file information to each of the computers associated with a computer identifier having a portion matching the selected portion of the file information; and

    wherein a value W represents the size of the portion of the file information, wherein a value M represents a count of computers that the one computer is aware of in the network, wherein a value R is a system configuration value calculated based on an average number of computers that a particular file identifier should be communicated to, wherein 1g is a base 2 logarithm function, wherein floor brackets indicate the largest integer that is no greater than the enclosed value, and wherein the value W is determined as follows;

    W=

    lg





    MR


    .

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×