System and method for workload-aware request distribution in cluster-based network servers

US 7,117,242 B2
Filed: 06/20/2001
Issued: 10/03/2006
Est. Priority Date: 06/20/2001
Status: Expired due to Fees

First Claim

Patent Images

1. A server cluster having a plurality of nodes, the server cluster comprising:

a set of base files, wherein said base files are a set of frequently accessed files fitting into a cluster memory of said server cluster, and wherein the base files are logically partitioned into a set of core files having a core size, a set of partitioned files having a partitioned size, and a set of on disk files, and wherein a total of the partitioned size added to the product of the number of said plurality of nodes multiplied by the core size is no greater than the cluster memory; and

wherein each node of said plurality of nodes comprises;

at least said set of core files stored locally thereto;

a distributor component for distributing a request to a specific node of said plurality of nodes;

a dispatcher component comprising routing information for said plurality of nodes and replicated across said plurality of nodes, wherein said routing information indicates which said node of said plurality of nodes is for processing said request, said dispatcher component coupled to said distributor component; and

a server component for processing said request, said server component coupled to said dispatcher component;

wherein said plurality of nodes are coupled to a network.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method and system for workload-aware request in cluster-based network servers. The present invention provides a web server cluster having a plurality of nodes wherein each node comprises a distributor component, a dispatcher component and a server component. In another embodiment, the present provides a method for managing request distribution to a set of files stored on a web server cluster. A request for a file is received at a first node of a plurality of nodes, each node comprising a distributor component, a dispatcher component and a server component. If the request is for a core file, the request is processed at the first node (e.g., processed locally). If the request is for a partitioned file, it is determined whether the request is assigned to be processed locally at the first node or at another node (e.g., processed remotely). If the request is for neither a core file nor a partitioned file, the request is processed at the first node. In one embodiment, the present invention provides a method for identifying a set of frequently accessed files on a server cluster comprising a number of nodes. Embodiments of the present invention operate to maximize the number of requests served from the total cluster memory of a web server cluster and to minimize the forwarding overhead and disk access overhead by identifying the subset of core files to be processed at any node and by identifying the subset of partitioned files to be processed by different nodes in the cluster.

Citations

18 Claims

1. A server cluster having a plurality of nodes, the server cluster comprising:
- a set of base files, wherein said base files are a set of frequently accessed files fitting into a cluster memory of said server cluster, and wherein the base files are logically partitioned into a set of core files having a core size, a set of partitioned files having a partitioned size, and a set of on disk files, and wherein a total of the partitioned size added to the product of the number of said plurality of nodes multiplied by the core size is no greater than the cluster memory; and
  
  wherein each node of said plurality of nodes comprises;
  
  at least said set of core files stored locally thereto;
  
  a distributor component for distributing a request to a specific node of said plurality of nodes;
  
  a dispatcher component comprising routing information for said plurality of nodes and replicated across said plurality of nodes, wherein said routing information indicates which said node of said plurality of nodes is for processing said request, said dispatcher component coupled to said distributor component; and
  
  a server component for processing said request, said server component coupled to said dispatcher component;
  
  wherein said plurality of nodes are coupled to a network.
- View Dependent Claims (2, 3, 4, 5, 14, 15, 16, 17, 18)
- - 2. The server cluster as recited in claim 1 wherein said server cluster is a web server cluster.
  - 3. The server cluster as recited in claim 1 wherein said cluster memory is a combined random access memory of each of said nodes of said server cluster.
  - 4. The server cluster as recited in claim 1 wherein each of said plurality of nodes further comprises a set of partitioned files.
  - 5. The server cluster as recited in claim 1 wherein said set of core files comprises a set of most frequently accessed files of said set of bases files.
  - 14. The server cluster as recited in claim 1 wherein each file of said set of partitioned files is assigned to a particular one of said plurality of nodes for processing thereof.
  - 15. The server cluster as recited in claim 14 wherein said routing information indicates which of said plurality of nodes is assigned for processing each of said set of partitioned files.
  - 16. The server cluster as recited in claim 1 wherein said dispatcher component of a node that receives said request (“
    - the receiving node”
      
      ) comprises logic for determining whether said request is a request to access a core file, and if determined that said request is a request to access said core file then said dispatcher component of the receiving node assigns the server component of the receiving node to process said request.
  - 17. The server cluster as recited in claim 16 wherein said dispatcher component of the receiving node further comprises logic for determining whether said request is a request to access a partitioned file, and if determined that said request is a request to access said partitioned file then said dispatcher component of the receiving node determines one of said plurality of node assigned to process the requested partitioned file.
  - 18. The server cluster as recited in claim 17 wherein if said dispatcher component determines that the receiving node is assigned to process the requested partitioned file, the dispatcher component assigns the server component of the receiving node to process said request.

6. A server cluster having a plurality of nodes, wherein each node of said plurality of nodes comprises:
- a distributor component for distributing a request to a specific node of said plurality of nodes;
  
  a dispatcher component comprising routing information for said plurality of nodes and replicated across said plurality of nodes, wherein said routing information indicates which said node of said plurality of nodes is for processing said request, said dispatcher component coupled to said distributor component;
  
  a server component for processing said request, said server component coupled to said dispatcher component;
  
  wherein said plurality of nodes are coupled to a network, and wherein each of said plurality of nodes further comprises a set of core files and a set of partitioned files; and
  
  wherein said set of core files is identified by the steps of;
  
  a) logically partitioning said base files into a first subset of files having a first size, a second subset of files having a second size, and a third subset of files having a third size, wherein said base files comprising each of said first subset of files, said second subset of files, and said third subset of files are ordered in decreasing frequency of access;
  
  b) identifying said first subset of files and said second subset of files wherein the total of said second size added to the product of said number of nodes multiplied by said first is less than said cluster memory; and
  
  c) minimizing a total overhead due to the base files wherein said total overhead equals an overhead of said first subset of files plus an overhead of said second subset of files plus said overhead of said third subset of files.

7. A method for managing request distribution to a set of files stored on a server, said method comprising the steps of:
- a) receiving a request for a file at a first node of a plurality of nodes, each of said nodes comprising a distributor component for distributing a request to a specific node of said plurality of nodes, a dispatcher component comprising routing information for said plurality of nodes and replicated across said plurality of nodes, and a server component for processing said request;
  
  b) provided said request is for a core file, processing said request at said first node;
  
  c) provided said request is for a partitioned file, determining whether said request is assigned to be processed by said first node;
  
  c1) provided said request is for a partitioned file assigned to be processed by said first node, processing said request at said first node;
  
  c2) provided said request is for a partitioned file assigned to be processed by another node of said plurality of nodes, forwarding said request to a specific node of said plurality of nodes as indicated by said dispatcher component of said first node and processing said request at said specific node; and
  
  d) provided said request is not for a said core file or a said partitioned file, processing said request at said first node;
  
  wherein each of said plurality of nodes further comprises a set of core files comprising said core file and a set of partitioned files comprising said partitioned file, and wherein said set of core files is identified by the steps of;
  
  a) logically partitioning said base files into said set of core files having a core size, said set of partitioned files having a partitioned size, and a set of on disk files having an on disk size, wherein said base files comprising each of said set of core files, said set of partitioned files, and said set of on disk files are ordered in decreasing frequency of access;
  
  b) identifying said set of core files and said set of partitioned files wherein the total of said partitioned size added to the product of number of said plurality of nodes multiplied by said core size is less than memory of said plurality of nodes; and
  
  c) minimizing a total overhead due to the base files wherein said total overhead equals an overhead of said core set of files plus an overhead of said partitioned set of files plus said overhead of said on disk set of files.

8. A method for identifying a set of frequently accessed files on a server cluster comprising a number of nodes, said method comprising the steps of:
- a) defininga set of base files, wherein said base files are a set of frequently accessed files fitting into the cluster memory of said server cluster, said base files ordered in decreasing frequency of access;
  
  b) logically partitioning said base files into a first subset of files having a first size, a second subset of files having a second size, and a third subset of files having a third size, wherein said base files comprising each of said first subset of files, said second subset of files, and said third subset of files are ordered in decreasing frequency of access;
  
  c) identifying said first subset of files and said second subset of files wherein the total of said second size added to the product of said number of nodes multiplied by said first size is not greater than said cluster memory; and
  
  d) minimizing a total overhead due to the base files wherein said total overhead equals an overhead of said first subset of files plus an overhead of said second subset of files plus said overhead of said third subset of files.
- View Dependent Claims (9, 10, 11, 12)
- - 9. The method as recited in claim 8 wherein said server cluster is a web server cluster.
  - 10. The method as recited in claim 8 wherein said cluster memory is a combined random access memory of each of said nodes of said web server cluster.
  - 11. The method as recited in claim 8 wherein said first subset of files is a set of core files and wherein said first size is a core size.
  - 12. The method as recited in claim 8 wherein said second subset of files is a set of partitioned files and wherein said second size is a partitioned size.

13. A method for determining a set of Files_core, said method comprising the steps of:
- a) defining a set of BaseFiles as a set of frequently accessed files fitting into a ClusterRAM, said BaseFiles ordered in decreasing frequency of access;
  
  b) logically partitioning said BaseFiles into a Files_part, a Files_coreand a Files_{on disk}wherein BaseFiles=Files_part+Files_core+Files_{on disk};
  
  c) identifying said set Files_partand said set Files_coreaccording to N∞
  
  Size_core+Size_part≦
  
  ClusterRAM; and
  
  d) minimizing OH_BaseFilesaccording to OH_BaseFiles=OH_part+OH_core+OH_{on disk}.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Hewlett-Packard Development Company, L.P. (HP Inc.)
Original Assignee
Hewlett-Packard Development Company, L.P. (HP Inc.)
Inventors
Karlsson, Magnus, Cherkasova, Ludmila
Primary Examiner(s)
Follansbee, John
Assistant Examiner(s)
Joo, Joshua

Application Number

US09/886,672
Publication Number

US 20030028636A1
Time in Patent Office

1,931 Days
Field of Search

709/203, 709214-215, 709/238, 707/200, 707/201, 707/202, 707/203, 707/204, 707/205
US Class Current

709/203
CPC Class Codes

H04L 67/1001   for accessing one among a p...

H04L 67/10015   Access to distributed or re...

H04L 67/1017   based on a round robin mech...

H04L 67/1019   Random or heuristic server ...

H04L 67/63   Routing a service request d...

H04L 69/329   in the application layer [O...

H04L 9/40   Network security protocols

System and method for workload-aware request distribution in cluster-based network servers

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

18 Claims

Specification

Solutions

Use Cases

Quick Links

System and method for workload-aware request distribution in cluster-based network servers

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

18 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links