Distributed hosting of web content using partial replication
First Claim
1. A method for distributing content of a web site among multiple computing devices within a computer cluster comprising a host computing device and one or more peer computing devices, the content comprising a plurality of files, the method comprising:
- calculating a relative importance for each of the plurality of files, wherein calculating the relative importance comprises calculating a file weight (FW) based on a type weight (T) and a site weight (S) assigned to each of the plurality of files such that
FW=S×
T;
creating a plurality of subsets of the content, the subsets being determined based on the relative importance of each of the plurality of files, wherein creating the plurality of subsets is further based on an individual replication amount specified for each computing device, the individual replication amount being based upon a storage amount on the computing device for holding one of the subsets and is determined by;
where D(λ
i) represents a total amount to be replicated to a peer computing device i, λ
i represents the file weight for an mth file of the web site and Pm represents a file size of the mth file of the web site; and
distributing one of the plurality of subsets to each of the multiple computing devices.
2 Assignments
0 Petitions
Accused Products
Abstract
The mechanisms described herein are directed at hosting content of a web site on multiple computing devices. A relative importance for each file associated with the web site is calculated. This relative importance is used to calculate several subsets of the content which are distributed to several devices within a computer cluster, such as a server array, peer-to-peer network, and the like. The subsets may include coded messages created using an erasure coding scheme on packets containing portions of one or more files. Upon retrieving a file, a fixed number of distinct coded messages are retrieved from the devices based on the erasure coding scheme. The file is re-created with these distinct messages. Because multiple devices hold the content, the web site may be retrieved significantly faster and the reliability is increased without consuming a large amount of storage space or bandwidth of any one computing device.
78 Citations
15 Claims
-
1. A method for distributing content of a web site among multiple computing devices within a computer cluster comprising a host computing device and one or more peer computing devices, the content comprising a plurality of files, the method comprising:
-
calculating a relative importance for each of the plurality of files, wherein calculating the relative importance comprises calculating a file weight (FW) based on a type weight (T) and a site weight (S) assigned to each of the plurality of files such that
FW=S×
T;
creating a plurality of subsets of the content, the subsets being determined based on the relative importance of each of the plurality of files, wherein creating the plurality of subsets is further based on an individual replication amount specified for each computing device, the individual replication amount being based upon a storage amount on the computing device for holding one of the subsets and is determined by;
where D(λ
i) represents a total amount to be replicated to a peer computing device i, λ
i represents the file weight for an mth file of the web site and Pm represents a file size of the mth file of the web site; anddistributing one of the plurality of subsets to each of the multiple computing devices. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system comprising:
- a processor; and
a memory into which a plurality of instructions are loaded, the plurality of instructions performing a method comprising;assigning a type weight to a file associated with a web site;
assigning a site weight to the file;calculating a file weight for the file based on the type weight and the site weight; calculating a replication ratio for the file, wherein the replication ratio is determined by;
wm,i=max{1,FWm×
λ
i},where wmi represents a peer replication ratio for a mth file for a peer i, λ
i represents a peer replication factor calculated for the peer i and FWm represents the file weight for the mth file in the web site; andsending the file to one of a plurality of computing devices based on the replication ratio. - View Dependent Claims (14, 15)
- a processor; and
Specification