Method and system for multi-tenant resource distribution
First Claim
1. A method of resource allocation in a distributed computing system comprising a plurality of tenants organized in a tiered hierarchy, the plurality of tenants including a root tenant associated with at least two sub-tenants, and each sub-tenant of the two sub-tenants associated with one or more corresponding leaf tenants, the method comprising:
- receiving, at a distributed resource manager, a plurality of requests from said plurality of tenants for allocation of quantities of resources to workloads of said tenants;
allocating, at the distributed resource manager, said quantities of resources to said workloads of said plurality of tenants in accordance with a distribution policy, the distribution policy defining a resource entitlement for each of said one or more corresponding leaf tenants and defining a hierarchical resource entitlement for each of the at least two sub-tenants, said resource entitlement for each of the plurality of tenants including a guaranteed quantity of resources for said each of the plurality of tenants; and
in response to determining, at the distributed resource manager, that a first quantity of resources allocated for workloads of a first sub-tenant of said plurality of tenants is less than the guaranteed quantity of resources for said first sub-tenant;
selecting, by the distributed resource manager, a second sub-tenant from among said plurality of tenants based on a comparison of said guaranteed quantity of resources for said second sub-tenant and a quantity of resources allocated to workloads of said second sub-tenant, andinterrupting processing of a workload of said second sub-tenant prior to completing processing of said workload of said second sub-tenant and re-allocating the quantity of resources allocated to said workload of said second sub-tenant to a workload of said workloads of said first sub-tenant.
2 Assignments
0 Petitions
Accused Products
Abstract
In a distributed computing network, requests for allocation of resources to tenant workloads and messages identifying resource availability are received and aggregated. Resources are allocated to the workloads in accordance with a distribution policy defining values for resource entitlements of the tenants. The values include pre-emption quantities. In response to determining that a quantity of resources allocated for workloads of a first tenant is less than the tenant'"'"'s pre-emption quantity, processing of another workload from a second tenant is interrupted to re-allocate resources from the second tenant'"'"'s workload to the first tenant'"'"'s workload.
20 Citations
23 Claims
-
1. A method of resource allocation in a distributed computing system comprising a plurality of tenants organized in a tiered hierarchy, the plurality of tenants including a root tenant associated with at least two sub-tenants, and each sub-tenant of the two sub-tenants associated with one or more corresponding leaf tenants, the method comprising:
-
receiving, at a distributed resource manager, a plurality of requests from said plurality of tenants for allocation of quantities of resources to workloads of said tenants; allocating, at the distributed resource manager, said quantities of resources to said workloads of said plurality of tenants in accordance with a distribution policy, the distribution policy defining a resource entitlement for each of said one or more corresponding leaf tenants and defining a hierarchical resource entitlement for each of the at least two sub-tenants, said resource entitlement for each of the plurality of tenants including a guaranteed quantity of resources for said each of the plurality of tenants; and in response to determining, at the distributed resource manager, that a first quantity of resources allocated for workloads of a first sub-tenant of said plurality of tenants is less than the guaranteed quantity of resources for said first sub-tenant; selecting, by the distributed resource manager, a second sub-tenant from among said plurality of tenants based on a comparison of said guaranteed quantity of resources for said second sub-tenant and a quantity of resources allocated to workloads of said second sub-tenant, and interrupting processing of a workload of said second sub-tenant prior to completing processing of said workload of said second sub-tenant and re-allocating the quantity of resources allocated to said workload of said second sub-tenant to a workload of said workloads of said first sub-tenant. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A master server of a distributed computing system comprising a plurality of resource servers, the master server comprising:
-
a resource collection module for receiving messages identifying available resources associated with said resource servers; a demand collection module for receiving messages identifying resource requests from a plurality of tenants of said distributed computing system, the plurality of tenants organized in a tiered hierarchy, the plurality of tenants including a root tenant associated with at least two sub-tenants, and each sub-tenant of the two sub-tenants associated with one or more corresponding leaf tenants; a data structure comprising a distribution policy for said available resources, said distribution policy containing a resource entitlement for each of said one or more corresponding leaf tenants and defining a hierarchical resource entitlement for each of the at least two sub-tenants, said resource entitlement for each of the plurality of tenants including a guaranteed quantity of resources for said each of the plurality of tenants; a distributed resource manager for; determining that a first quantity of resources allocated for workloads of a first sub-tenant of said plurality of tenants is less than the guaranteed quantity of resources for said first sub-tenant; selecting a second sub-tenant from among said plurality of tenants based on a comparison of said guaranteed quantity of resources for said second sub-tenant and a quantity of resources allocated to workloads of said second sub-tenant; and interrupting processing a workload of said second sub-tenant prior to completing processing of said workload of said second sub-tenant and re-allocating the quantity of resources allocated to said workload of said second sub-tenant to a workload of said workloads of said first sub-tenant. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A non-transitory computer-readable medium storing instructions which when executed by at least one processor causes the at least one processor to:
-
receive a plurality of requests from a plurality of tenants for allocation of quantities of resources to workloads of said tenants, the plurality of tenants organized in a tiered hierarchy, the plurality of tenants including a root tenant associated with at least two sub-tenants, and each sub-tenant of the two sub-tenants associated with one or more corresponding leaf tenants; allocate said quantities of resources to said workloads of said plurality of tenants in accordance with a distribution policy, the distribution policy defining a resource entitlement for each of said one or more corresponding leaf tenants and defining a hierarchical resource entitlement for each of the at least two sub-tenants, said resource entitlement for each of the plurality of tenants including a guaranteed quantity of resources for said each of the plurality of tenants; and in response to determining that a first quantity of resources allocated for workloads of a first sub-tenant of said plurality of tenants is less than the guaranteed quantity of resources for said first sub-tenant; selecting a second sub-tenant from among said plurality of tenants based on a comparison of said guaranteed quantity of resources for said second sub-tenant and a quantity of resources allocated to workloads of said second sub-tenant, and interrupting processing of a workload of said second sub-tenant prior to completing processing of said workload of said second sub-tenant and re-allocating the quantity of resources allocated to said workload of said second sub-tenant to a workload of said workloads of said first sub-tenant.
-
Specification