Specifying a highly-resilient system in a disaggregated compute environment
First Claim
1. A method for assigning resources in a compute environment, comprising:
- providing a set of server resource pools, wherein a server resource pool comprises a set of resources of a common type;
for a given tenant, defining a server entity composed of one or more resources selected from one or more of the server resource pools, wherein the one or more resources are selected from the one or more of the server resource pools based on a projected workload and a resiliency requirement;
receiving information collected from monitoring health of the one or more resources in the server entity as an actual workload is processed for the given tenant; and
based on the monitoring indicating a change in health of a resource in the server entity, adjusting a composition of the server entity to attempt to maintain the resiliency requirement.
1 Assignment
0 Petitions
Accused Products
Abstract
Server resources in a data center are disaggregated into shared server resource pools. Servers are constructed dynamically, on-demand and based on workload requirements and a tenant'"'"'s resiliency requirements (e.g., as specified in an SLA), by allocating from these resource pools. A disaggregated compute system of this type keeps track of resources that are available in the shared server resource pools, and it manages those resources based on that information and the health of the resources. As a workload is processed by the server entity and component resources fail, the server entity composition is changed, e.g. by allocating other resources to the server entity, or by transitioning to other server entities, to ensure that a resiliency requirement is maintained.
-
Citations
25 Claims
-
1. A method for assigning resources in a compute environment, comprising:
-
providing a set of server resource pools, wherein a server resource pool comprises a set of resources of a common type; for a given tenant, defining a server entity composed of one or more resources selected from one or more of the server resource pools, wherein the one or more resources are selected from the one or more of the server resource pools based on a projected workload and a resiliency requirement; receiving information collected from monitoring health of the one or more resources in the server entity as an actual workload is processed for the given tenant; and based on the monitoring indicating a change in health of a resource in the server entity, adjusting a composition of the server entity to attempt to maintain the resiliency requirement. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. Apparatus for assigning resources in a compute environment, comprising:
-
one or more hardware processors; computer memory holding computer program instructions executed by the hardware processors and operative to; manage a set of server resource pools, wherein a server resource pool comprises a set of resources of a common type; for a given tenant, define a server entity composed of one or more resources selected from one or more of the server resource pools, wherein the one or more resources are selected from the one or more of the server resource pools based on a projected workload and a resiliency requirement; receive information collected from monitoring health of the one or more resources in the server entity as an actual workload is processed for the given tenant; and based on the monitoring indicating a change in health of a resource in the server entity, adjust a composition of the server entity to attempt to maintain the resiliency requirement. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer program product in a non-transitory computer readable medium for use in a data processing system for assigning resources in a compute environment, the computer program product holding computer program instructions executed in the data processing system and operative to:
-
manage a set of server resource pools, wherein a server resource pool comprises a set of resources of a common type; for a given tenant, define a server entity composed of one or more resources selected from one or more of the server resource pools, wherein the one or more resources are selected from the one or more of the server resource pools based on a projected workload and a resiliency requirement; receive information collected from monitoring health of the one or more resources in the server entity as an actual workload is processed for the given tenant; and based on the monitoring indicating a change in health of a resource in the server entity, adjust a composition of the server entity to attempt to maintain the resiliency requirement. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
-
25. A data center facility, comprising:
-
a set of server resource pools that comprise a compute pool, and a memory pool; a disaggregated compute system comprising processors selected from the compute pool, computer memories selected from the memory pool, and an optical interconnect, the disaggregated compute system being configured to meet a resiliency requirement associated with a tenant, the resiliency requirement being associated with a tenant'"'"'s service level agreement (SLA); and a resiliency manager executing in a hardware element and responsive to a failure in one or more resources in the disaggregated compute system as the tenant'"'"'s workload is processed to selectively adjust a composition of the disaggregate compute system to maintain the resiliency requirement.
-
Specification