Mechanism for facilitating dynamic load balancing at application servers in an on-demand services environment
First Claim
Patent Images
1. A computer-implemented method comprising:
- polling a plurality of application servers for status, wherein polling includes periodically pinging each of the plurality of application servers;
in response to the polling, receiving status reports corresponding to the plurality of application servers, wherein each status report includes data analysis relating to an availability or a capability of each of the plurality of application servers, wherein the status reports are generated based on sets of details provided, in real-time, by a plurality virtual machines associated with the plurality of application servers, wherein the sets of details are collected, in real-time, by one or more virtual machines associated with each application server;
assigning priority levels to the plurality of application servers based on the status reports, wherein each application server is assigned a single priority level based on its corresponding status report, wherein the assigned priority levels are dynamically adjusted based on the real-time collection of the sets of details by the plurality of virtual machines, wherein, based on the real-time collection of the sets of details, a high priority level of the assigned priority levels is assigned to one or more application servers that, in comparison with other applications, have more available space in a heap or performed fewer full garbage collections during a last or any particular interval of time; and
facilitating dynamic load balancing at the plurality of application servers based on the assigned priority levels.
1 Assignment
0 Petitions
Accused Products
Abstract
In accordance with embodiments, there are provided mechanisms and methods for facilitating dynamic load balancing at application servers in an on-demand services environment. In one embodiment and by way of example, a method includes polling a plurality of application servers for status, receiving status from each of the plurality of application servers, assigning a priority level to each of the plurality of application servers based on its corresponding status, and facilitating load balancing at the plurality of application servers based on their corresponding priority levels.
-
Citations
18 Claims
-
1. A computer-implemented method comprising:
-
polling a plurality of application servers for status, wherein polling includes periodically pinging each of the plurality of application servers; in response to the polling, receiving status reports corresponding to the plurality of application servers, wherein each status report includes data analysis relating to an availability or a capability of each of the plurality of application servers, wherein the status reports are generated based on sets of details provided, in real-time, by a plurality virtual machines associated with the plurality of application servers, wherein the sets of details are collected, in real-time, by one or more virtual machines associated with each application server; assigning priority levels to the plurality of application servers based on the status reports, wherein each application server is assigned a single priority level based on its corresponding status report, wherein the assigned priority levels are dynamically adjusted based on the real-time collection of the sets of details by the plurality of virtual machines, wherein, based on the real-time collection of the sets of details, a high priority level of the assigned priority levels is assigned to one or more application servers that, in comparison with other applications, have more available space in a heap or performed fewer full garbage collections during a last or any particular interval of time; and facilitating dynamic load balancing at the plurality of application servers based on the assigned priority levels. - View Dependent Claims (2, 3, 10, 11, 12)
-
-
4. A system comprising:
-
a computing device having a memory to store instructions, and a processing device to execute the instructions, wherein the instructions cause the processing device to; poll a plurality of application servers for status, wherein polling includes periodically pinging each of the plurality of application servers; in response to the polling, receive status reports corresponding to the plurality of application servers, wherein each status report includes data analysis relating to an availability or a capability of each of the plurality of application servers, wherein the status reports are generated based on sets of details provided, in real-time, by a plurality virtual machines associated with the plurality of application servers, wherein the sets of details are collected, in real-time, by one or more virtual machines associated with each application server; assign priority levels to the plurality of application servers based on the status reports, wherein each application server is assigned a single priority level based on its corresponding status report, wherein the assigned priority levels are dynamically adjusted based on the real-time collection of the sets of details by the plurality of virtual machines, wherein, based on the real-time collection of the sets of details, a high priority level of the assigned priority levels is assigned to one or more application servers that, in comparison with other applications, have more available space in a heap or performed fewer full garbage collections during a last or any particular interval of time; and facilitate dynamic load balancing at the plurality of application servers based on the assigned priority levels. - View Dependent Claims (5, 6, 13, 14, 15)
-
-
7. At least one non-transitory machine-readable medium having stored thereon instructions which, when executed by a machine, cause the machine to:
-
poll a plurality of application servers for status, wherein polling includes periodically pinging each of the plurality of application servers; in response to the polling, receive status reports corresponding to the plurality of application servers, wherein each status report includes data analysis relating to an availability or a capability of each of the plurality of application servers, wherein the status reports are generated based on sets of details provided, in real-time, by a plurality virtual machines associated with the plurality of application servers, wherein the sets of details are collected, in real-time, by one or more virtual machines associated with each application server; assign priority levels to the plurality of application servers based on the status reports, wherein each application server is assigned a single priority level based on its corresponding status report, wherein the assigned priority levels are dynamically adjusted based on the real-time collection of the sets of details by the plurality of virtual machines, wherein, based on the real-time collection of the sets of details, a high priority level of the assigned priority levels is assigned to one or more application servers that, in comparison with other applications, have more available space in a heap or performed fewer full garbage collections during a last or any particular interval of time; and facilitate dynamic load balancing at the plurality of application servers based on the assigned priority levels. - View Dependent Claims (8, 9, 16, 17, 18)
-
Specification