Runtime load balancing of work across a clustered computing system using current service performance levels

US 20050256971A1
Filed: 06/27/2005
Published: 11/17/2005
Est. Priority Date: 08/14/2003
Status: Abandoned Application

First Claim

Patent Images

1. A computer-implemented method for determining how much work to route to computing nodes in a computing system that comprises a plurality of nodes that each hosts a server instance that provides a service that performs work, the method comprising:

based on a current moving average of a performance metric, from each of a plurality of server instances that provide a particular service, that is associated with the particular service, computing a performance grade for each of the plurality of server instances; and

computing, based on the respective performance grades, a percentage of work to route to each of the plurality of server instances.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Runtime load balancing of work across a clustered computing system involves servers calculating, and clients utilizing, current service performance grades of each instance in the system. A performance grade for an instance is based on performance metrics for that instance, where the computation used may vary by policy. Examples of possible policies include: (a) using estimated bandwidth as a performance grade, (b) using spare capacity as a performance grade, or (c) using response time as a performance grade. Clients distribute work requests across servers in the system as the requests arrive. Work requests can be distributed according to performance grades, and/or flags associated with the performance grades. Automatically and intelligently directing work requests to the best server instances, based on real-time service performance metrics, minimizes the need to manually relocate work within the clustered system.

105 Citations

View as Search Results

18 Claims

1. A computer-implemented method for determining how much work to route to computing nodes in a computing system that comprises a plurality of nodes that each hosts a server instance that provides a service that performs work, the method comprising:
- based on a current moving average of a performance metric, from each of a plurality of server instances that provide a particular service, that is associated with the particular service, computing a performance grade for each of the plurality of server instances; and
  
  computing, based on the respective performance grades, a percentage of work to route to each of the plurality of server instances.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
- - 2. The method of claim 1, further comprising:
    - publishing the percentages to one or more subscribing clients; and
      
      routing work, by at least one of the subscribing clients and based on the percentages, to particular nodes in the computing system.
  - 3. The method of claim 2, wherein the step of publishing includes posting events to an event queue, wherein each event is associated with one particular service.
  - 4. The method of claim 2, wherein the step of publishing the performance grades includes periodically publishing the percentages, wherein the period is based on a rate at which requests for the service are received.
  - 5. The method of claim 2, further comprising:
    - publishing a flag to the one or more subscribing clients, in association with each percentage, wherein the flag indicates any one from the group consisting of (a) the performance grade was computed for this instance, (b) the service on this instance is violating a service level agreement associated with the service, (c) the performance grade was not computed for this instance, and (d) the performance grade was not computed for this instance, but work can be routed to this instance; and
      
      routing work, by at least one of the subscribing clients and based on the percentage and associated flags, to nodes in the computing system.
  - 6. The method of claim 1, wherein the step of computing performance grades includes computing a performance grade based on a performance metric associated with response time of the service as provided by the server instance.
  - 7. The method of claim 1, wherein the step of computing performance grades includes computing a performance grade based on a performance metric associated with throughput of the respective node on which the server instances is executing.
  - 8. The method of claim 1, wherein the step of computing performance grades includes computing a performance grade based on a performance metric associated with spare capacity of the respective node on which the server instances is executing.
  - 9. The method of claim 1, wherein the step of computing performance grades includes applying one or more weighting factors to the respective moving averages of the performance metric;
    - wherein the weighting factors are based, at least in part, on any one or more from the group consisting of available CPU processing, available 10 processing, and available network communication processing within the computing system.
  - 10. A machine-readable medium carrying one or more sequences of instructions which, when executed by one or more processors, causes the one or more processors to perform the method recited in claim 1.
  - 11. A machine-readable medium carrying one or more sequences of instructions which, when executed by one or more processors, causes the one or more processors to perform the method recited in claim 2.
  - 12. A machine-readable medium carrying one or more sequences of instructions which, when executed by one or more processors, causes the one or more processors to perform the method recited in claim 3.
  - 13. A machine-readable medium carrying one or more sequences of instructions which, when executed by one or more processors, causes the one or more processors to perform the method recited in claim 4.
  - 14. A machine-readable medium carrying one or more sequences of instructions which, when executed by one or more processors, causes the one or more processors to perform the method recited in claim 5.
  - 15. A machine-readable medium carrying one or more sequences of instructions which, when executed by one or more processors, causes the one or more processors to perform the method recited in claim 6.
  - 16. A machine-readable medium carrying one or more sequences of instructions which, when executed by one or more processors, causes the one or more processors to perform the method recited in claim 7.
  - 17. A machine-readable medium carrying one or more sequences of instructions which, when executed by one or more processors, causes the one or more processors to perform the method recited in claim 8.
  - 18. A machine-readable medium carrying one or more sequences of instructions which, when executed by one or more processors, causes the one or more processors to perform the method recited in claim 9.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Oracle International Corporation (Oracle Corporation)
Original Assignee
Oracle International Corporation (Oracle Corporation)
Inventors
Simmons, Charles, Pruscino, Angelo, Colrain, Carol, Semler, Daniel, Pommerenk, Stefan

Application Number

US11/168,967
Publication Number

US 20050256971A1
Time in Patent Office

Days
Field of Search
US Class Current

709/238
CPC Class Codes

G06F 2209/508   Monitor

G06F 9/505   considering the load

G06F 9/5083   Techniques for rebalancing ...

Runtime load balancing of work across a clustered computing system using current service performance levels

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

105 Citations

18 Claims

Specification

Solutions

Use Cases

Quick Links

Runtime load balancing of work across a clustered computing system using current service performance levels

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

105 Citations

18 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links