Locating potentially identical objects across multiple computers based on stochastic partitioning of workload
First Claim
1. A method implemented in a computer, the method comprising:
- generating an imprint identification code for an object stored at the computer, wherein the imprint identification code comprises a first set of bits of object information corresponding to the object, and wherein the number of bits in the first set of bits is based at least in part on a count of computers that the computer is aware of in the network;
identifying one or more additional computers each having a computer identifier that includes a second set of bits that match the imprint identification code; and
sending the object information to each of the identified one or more additional computers.
1 Assignment
0 Petitions
Accused Products
Abstract
Potentially identical objects (e.g., files) are located across multiple computers based on stochastic partitioning of workload. For each of a plurality of objects stored on a plurality of computers in a network, a portion of object information corresponding to the object is selected. The object information can be generated in a variety of manners (e.g., based on hashing the object, based on characteristics of the object, and so forth). Any of a variety of portions of the object information can be used (e.g., the least significant bits of the object information). A stochastic partitioning process is then used to identify which of the plurality of computers to communicate the object information to for identification of potentially identical objects on the plurality of computers.
111 Citations
17 Claims
-
1. A method implemented in a computer, the method comprising:
-
generating an imprint identification code for an object stored at the computer, wherein the imprint identification code comprises a first set of bits of object information corresponding to the object, and wherein the number of bits in the first set of bits is based at least in part on a count of computers that the computer is aware of in the network; identifying one or more additional computers each having a computer identifier that includes a second set of bits that match the imprint identification code; and sending the object information to each of the identified one or more additional computers. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. One or more computer storage media having stored thereon a plurality of instructions that, when executed by one or more processors of a computer that is one of a plurality of computers in a network, causes the one or more processors to perform the following acts:
-
generating an imprint identification code for an object stored at the computer, wherein the imprint identification code comprises a first set of bits of object information corresponding to the object, and wherein the number of bits in the first set of bits is based at least in part on a count of computers of the plurality of computers that the computer is aware of in the network; identifying one or more additional computers of the plurality of computers each having a computer identifier that includes a second set of bits that match the imprint identification code; and sending the object information to each of the identified one or more additional computers. - View Dependent Claims (11, 12, 13)
-
-
14. A computing device comprising:
-
a processor; a storage device, coupled to the processor, to store instructions that, when executed by the processor, cause the processor to; generate an imprint identification code for an object stored at the computing device, wherein the imprint identification code comprises a first set of bits of object information corresponding to the object, and wherein the number of bits in the first set of bits is based at least in part on an average number of computing devices of the plurality of computing devices that a particular object identifier should be communicated to; identify one or more additional computing devices each having a computing device identifier that includes a second set of bits that match the imprint identification code; and send the object information to each of the identified one or more additional computing devices. - View Dependent Claims (15, 16, 17)
-
Specification