Adaptive scaling of workloads in a distributed computing environment
First Claim
1. A method for computing and implementing automatic scaling operations in a distributed computing environment, by at least one processor, comprising:
- determining a current allocation of a specific resource used by a specific application instance of an application;
configuring a high watermark as a low bound for a high tier of consumption of the specific resource used by the specific application instance;
configuring a low watermark as a high bound for a low tier of consumption of the specific resource used by the specific application instance;
computing and implementing an increase in allocation of the specific resource for the specific application instance, when sustained consumption of the specific resource at the high tier is identified;
computing and implementing a decrease in allocation of the specific resource for the specific application instance, when the sustained consumption of the specific resource at the low tier is identified, thereby optimizing application efficiency and resource utilization in the distributed computing environment; and
identifying sustained consumption when no more than an outlying percentage of samples for consumption of the specific resource are outside of either one of the high or the low tier for a duration of a qualifying time period;
wherein the qualifying time period is one of fixed and implemented as a sliding window over time.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments for computing and implementing automatic scaling operations in a distributed computing environment. A current allocation of a specific resource used by a specific application instance of an application. A high watermark is configured as a low bound for a high tier of consumption of the specific resource used by the specific application instance, and a low watermark is configured as a high bound for a low tier of consumption of the specific resource used by the specific application instance. An increase in allocation of the specific resource for the specific application instance is computed and implemented when sustained consumption of the specific resource at the high tier is identified, and a decrease in allocation of the specific resource for the specific application instance is computed and implemented when the sustained consumption of the specific resource at the low tier is identified.
-
Citations
21 Claims
-
1. A method for computing and implementing automatic scaling operations in a distributed computing environment, by at least one processor, comprising:
-
determining a current allocation of a specific resource used by a specific application instance of an application; configuring a high watermark as a low bound for a high tier of consumption of the specific resource used by the specific application instance; configuring a low watermark as a high bound for a low tier of consumption of the specific resource used by the specific application instance; computing and implementing an increase in allocation of the specific resource for the specific application instance, when sustained consumption of the specific resource at the high tier is identified; computing and implementing a decrease in allocation of the specific resource for the specific application instance, when the sustained consumption of the specific resource at the low tier is identified, thereby optimizing application efficiency and resource utilization in the distributed computing environment; and identifying sustained consumption when no more than an outlying percentage of samples for consumption of the specific resource are outside of either one of the high or the low tier for a duration of a qualifying time period; wherein the qualifying time period is one of fixed and implemented as a sliding window over time. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for computing and implementing automatic scaling operations in a distributed computing environment, the system comprising:
-
at least one processor coupled to a memory storing computer readable program instructions that, when executed, cause the at least one processor to; determine a current allocation of a specific resource used by a specific application instance of an application; configure a high watermark as a low bound for a high tier of consumption of the specific resource used by the specific application instance; configure a low watermark as a high bound for a low tier of consumption of the specific resource used by the specific application instance; compute and implement an increase in allocation of the specific resource for the specific application instance, when sustained consumption of the specific resource at the high tier is identified; compute and implement a decrease in allocation of the specific resource for the specific application instance, when the sustained consumption of the specific resource at the low tier is identified, thereby optimizing application efficiency and resource utilization in the distributed computing environment; and identify sustained consumption when no more than an outlying percentage of samples for consumption of the specific resource are outside of either one of the high or the low tier for a duration of a qualifying time period; wherein the qualifying time period is one of fixed and implemented as a sliding window over time. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer program product for computing and implementing automatic scaling operations in a distributed computing environment, by at least one processor, the computer program product embodied on a non-transitory computer-readable storage medium having computer-readable program code portions stored therein, the computer-readable program code portions comprising:
-
an executable portion that determines a current allocation of a specific resource used by a specific application instance of an application; an executable portion that configures a high watermark as a low bound for a high tier of consumption of the specific resource used by the specific application instance; an executable portion that configures a low watermark as a high bound for a low tier of consumption of the specific resource used by the specific application instance; an executable portion that computes and implements an increase in allocation of the specific resource for the specific application instance, when sustained consumption of the specific resource at the high tier is identified; an executable portion that computes and implements a decrease in allocation of the specific resource for the specific application instance, when the sustained consumption of the specific resource at the low tier is identified, thereby optimizing application efficiency and resource utilization in the distributed computing environment; and an executable portion that identifies sustained consumption when no more than an outlying percentage of samples for consumption of the specific resource are outside of either one of the high or the low tier for a duration of a qualifying time period; wherein the qualifying time period is one of fixed and implemented as a sliding window over time. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification