Compound token buckets for burst-mode admission control
First Claim
Patent Images
1. A system, comprising:
- one or more computing devices configured to;
determine a peak burst rate at which work requests directed to a work target are to be accepted for execution during a burst mode of operation, and a peak burst window size indicative of a maximum duration for which work requests at the peak burst rate are to be accepted;
determine a sustained burst rate smaller than the peak burst rate, and a sustained burst window size greater than the peak burst window size, wherein the sustained burst window size is indicative of a maximum duration for which work requests directed to the work target at the sustained burst rate are to be accepted during the burst mode of operation;
configure a first maximum token population, a first initial token population, and a first token consumption policy for a peak-burst token bucket to be used for admission control of work requests directed at the work target, wherein the first maximum token population is based at least in part on the peak burst rate;
configure a second maximum token population, a second initial token population, and a second token consumption policy for a sustained-burst token bucket to be used for admission control of work requests directed at the work target, wherein the second maximum token population is based at least in part on the sustained burst rate; and
in response to receiving a work request directed at the work target during a burst mode of operation, determine to accept the work request for execution based at least in part on the respective token populations of the peak-burst token bucket and the sustained-burst token bucket.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and apparatus for compound token buckets usable for burst-mode admission control are disclosed. A peak burst rate and a sustained burst rate of work requests that are to be supported at a work target are determined. The maximum token populations of a peak-burst token bucket and a sustained-burst token bucket are configured, based on the peak burst rate and the sustained burst rate respectively. In response to receiving a work request directed at the work target, a determination to accept the work request for execution is made based at least in part on the token population of the peak-burst token bucket and/or the sustained-burst token bucket.
29 Citations
18 Claims
-
1. A system, comprising:
one or more computing devices configured to; determine a peak burst rate at which work requests directed to a work target are to be accepted for execution during a burst mode of operation, and a peak burst window size indicative of a maximum duration for which work requests at the peak burst rate are to be accepted; determine a sustained burst rate smaller than the peak burst rate, and a sustained burst window size greater than the peak burst window size, wherein the sustained burst window size is indicative of a maximum duration for which work requests directed to the work target at the sustained burst rate are to be accepted during the burst mode of operation; configure a first maximum token population, a first initial token population, and a first token consumption policy for a peak-burst token bucket to be used for admission control of work requests directed at the work target, wherein the first maximum token population is based at least in part on the peak burst rate; configure a second maximum token population, a second initial token population, and a second token consumption policy for a sustained-burst token bucket to be used for admission control of work requests directed at the work target, wherein the second maximum token population is based at least in part on the sustained burst rate; and in response to receiving a work request directed at the work target during a burst mode of operation, determine to accept the work request for execution based at least in part on the respective token populations of the peak-burst token bucket and the sustained-burst token bucket. - View Dependent Claims (2, 3, 4, 5)
-
6. A method, comprising:
performing, by one or more computing devices; determining a peak burst rate at which work requests directed to a work target are to be accepted during a burst mode of operation; determining a sustained burst rate at which work requests directed to the work target are to be accepted for up to a particular duration during the burst mode of operation; configuring a first maximum token population for a peak-burst token bucket to be used for admission control of work requests directed at the work target, and a second maximum token population for a sustained-burst token bucket to be used for admission control of work requests directed at the work target, wherein the first maximum token population is based at least in part on the peak burst rate and the second maximum token population is based at least in part on the sustained burst rate, wherein the first token maximum token population is based at least in part on a peak burst time window indicative of a maximum duration for which the work requests directed to the work target are to be accepted for execution by the work target at the peak burst rate, and wherein the second maximum token population is based at least in part on a sustained burst time window indicative of the particular duration; and in response to receiving a work request directed at the work target during a burst mode of operation, determining to accept the work request for execution based at least in part on a token population of at least one of;
the peak-burst token bucket and the sustained-burst token bucket.- View Dependent Claims (7, 8, 9, 10, 11, 12)
-
13. A non-transitory computer-accessible storage medium storing program instructions that when executed on one or more processors:
-
determine a peak burst rate at which work requests directed to a work target are to be accepted; determine a sustained burst rate at which work requests directed to the work target are to be accepted for up to a particular duration; configure a first maximum token population for a peak-burst token bucket to be used for admission control of work requests directed at the work target and a second maximum token population for a sustained-burst token bucket to be used for admission control of work requests directed at the work target, wherein the first maximum token population is based at least in part on the peak burst rate and the second maximum token population is based at least in part on the sustained burst rate, wherein the first token maximum token population is based at least in part on a peak burst time window indicative of a maximum duration for which the work requests directed to the work target are to be accepted for execution by the work target at the peak burst rate, and wherein the second maximum token population is based at least in part on a sustained burst time window indicative of the particular duration; and in response to receiving a work request directed at the work target, determine to accept the work request for execution based at least in part on a token population of at least one of;
the peak-burst token bucket and the sustained-burst token bucket. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification