Locating potentially identical objects across multiple computers based on stochastic partitioning of workload
First Claim
1. A method comprising:
- receiving file information corresponding to a file stored at a computer;
comparing the received file information to a file information database;
checking whether the received file information matches any of the file information in the database;
determining that two potentially identical files exist if the received file information matches any of the file information in the database; and
forwarding the received file information to another computer for storage in a file information database at the other computer.
1 Assignment
0 Petitions
Accused Products
Abstract
Potentially identical objects (e.g., files) are located across multiple computers based on stochastic partitioning of workload. For each of a plurality of objects stored on a plurality of computers in a network, a portion of object information corresponding to the object is selected. The object information can be generated in a variety of manners (e.g., based on hashing the object, based on characteristics of the object, and so forth). Any of a variety of portions of the object information can be used (e.g., the least significant bits of the object information). A stochastic partitioning process is then used to identify which of the plurality of computers to communicate the object information to for identification of potentially identical objects on the plurality of computers.
-
Citations
8 Claims
-
1. A method comprising:
-
receiving file information corresponding to a file stored at a computer;
comparing the received file information to a file information database;
checking whether the received file information matches any of the file information in the database;
determining that two potentially identical files exist if the received file information matches any of the file information in the database; and
forwarding the received file information to another computer for storage in a file information database at the other computer. - View Dependent Claims (2)
-
-
3. One or more computer-readable media having stored thereon a plurality of instructions that, when executed by one or more processors, causes the one or more processors to perform the following acts:
-
receiving, from a requesting computer, a request for an imprint to computer mapping, wherein the imprint comprises a portion of object information corresponding to an object;
accessing a mapping database to identify one or more computers associated with the imprint; and
returning an identification of at least one of the one or more computers to the requesting computer. - View Dependent Claims (4, 5, 6, 7, 8)
-
Specification