ADAPTIVE FILE PLACEMENT IN A DISTRIBUTED FILE SYSTEM
First Claim
1. A computer-implemented method of peer-to-peer caching, the method comprising:
- requesting, at a first storage device, a copy of a file that is stored on a second storage device but not the first storage device;
receiving the copy of the file at the first storage device from the second storage device in response to said requesting;
storing the copy of the file on the first storage device; and
publishing, to a metadata server that is separate from a machine that controls the first storage device, information that indicates that the copy of the file is available on the first storage device.
3 Assignments
0 Petitions
Accused Products
Abstract
In a distributed system that includes multiple machines, a scheduler attempts to schedule a task on a machine that is not currently overloaded with work. If a task is scheduled on a machine that does not yet have copies of the portions of the data set on which the task needs to operate, then that machine obtains copies of those portions from other machines that already have them. Whenever a “source” machine ships a copy of a portion to another “destination” machine in the distributed system, the destination machine persistently stores that copy on the destination machine'"'"'s persistent storage mechanism. The copy also remains on the source machine. Thus, portions of the data set are automatically replicated whenever those portions are shipped between machines of the distributed system. Each machine in the distributed system has access to “global” information that indicates which machines have which portions of the data set.
70 Citations
21 Claims
-
1. A computer-implemented method of peer-to-peer caching, the method comprising:
-
requesting, at a first storage device, a copy of a file that is stored on a second storage device but not the first storage device; receiving the copy of the file at the first storage device from the second storage device in response to said requesting; storing the copy of the file on the first storage device; and publishing, to a metadata server that is separate from a machine that controls the first storage device, information that indicates that the copy of the file is available on the first storage device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21)
-
Specification