Locating potentially identical objects across multiple computers based on stochastic partitioning of workload
First Claim
1. A method comprising:
- identifying an imprint for an object stored at a computer, wherein the imprint is a set of bits of object information corresponding to the object;
accessing an imprint to computer mapping;
identifying one or more computers to receive the object information based at least in part on the accessed mapping; and
sending the object information to at least one of the identified one or more computers.
1 Assignment
0 Petitions
Accused Products
Abstract
Potentially identical objects (e.g., files) are located across multiple computers based on stochastic partitioning of workload. For each of a plurality of objects stored on a plurality of computers in a network, a portion of object information corresponding to the object is selected. The object information can be generated in a variety of manners (e.g., based on hashing the object, based on characteristics of the object, and so forth). Any of a variety of portions of the object information can be used (e.g., the least significant bits of the object information). A stochastic partitioning process is then used to identify which of the plurality of computers to communicate the object information to for identification of potentially identical objects on the plurality of computers.
122 Citations
13 Claims
-
1. A method comprising:
-
identifying an imprint for an object stored at a computer, wherein the imprint is a set of bits of object information corresponding to the object;
accessing an imprint to computer mapping;
identifying one or more computers to receive the object information based at least in part on the accessed mapping; and
sending the object information to at least one of the identified one or more computers. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
Specification