Managing operational throughput for shared resources
First Claim
1. A computer-implemented method for managing shared resources, comprising:
- receiving an instance request that includes a request parameter, the instance request associated with a user of a multi-tenant computing environment, the multi-tenant computing environment including a plurality of client devices in communication via a network with one or more servers and storage devices, the instance request indicating a data set and a performance specification for responding to data requests for the data set, the performance specification indicating a latency target to be met for responding to at least one of the data requests for the data set;
determining an amount of latency of processing at least one of the data requests;
determining that the amount of latency is greater than the latency target as indicated by the performance specification;
identifying another storage device having additional capacity, the another storage device being associated with a resource usage request for the additional capacity;
determining whether the request parameter satisfies the resource usage request;
in response to determining that the request parameter satisfies the resource usage request, moving at least a subset of the data set from a current storage device to the another storage device; and
in response to determining that the request parameter does not satisfy the resource usage request, denying the instance request.
1 Assignment
0 Petitions
Accused Products
Abstract
Usage of shared resources can be managed by enabling users to obtain different types of guarantees at different times for various types and/or levels of resource capacity. A user can select to have an amount or rate of capacity dedicated to that user. A user can also select reserved capacity for at least a portion of the requests, tasks, or program execution for that user, where the user has priority to that capacity but other users can utilize the excess capacity during other periods. Users can alternatively specify to use the excess capacity or other variable, non-guaranteed capacity. The capacity can be for any appropriate functional aspect of a resource, such as computational capacity, throughput, latency, bandwidth, and storage. Users can submit bids for various types and combinations of excess capacity, and winning bids can receive dedicated use of the excess capacity for at least a period of time.
-
Citations
20 Claims
-
1. A computer-implemented method for managing shared resources, comprising:
-
receiving an instance request that includes a request parameter, the instance request associated with a user of a multi-tenant computing environment, the multi-tenant computing environment including a plurality of client devices in communication via a network with one or more servers and storage devices, the instance request indicating a data set and a performance specification for responding to data requests for the data set, the performance specification indicating a latency target to be met for responding to at least one of the data requests for the data set; determining an amount of latency of processing at least one of the data requests; determining that the amount of latency is greater than the latency target as indicated by the performance specification; identifying another storage device having additional capacity, the another storage device being associated with a resource usage request for the additional capacity; determining whether the request parameter satisfies the resource usage request; in response to determining that the request parameter satisfies the resource usage request, moving at least a subset of the data set from a current storage device to the another storage device; and in response to determining that the request parameter does not satisfy the resource usage request, denying the instance request. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer system for managing shared resources, comprising:
-
one or more processors; and memory, including instructions executable by the one or more processors to cause the computer system to at least; receive an instance request that includes a request parameter, the instance request associated with a user of a multi-tenant computing environment, the multi-tenant computing environment including a plurality of client devices in communication via a network with one or more servers and storage devices, the instance request indicating a data set and a user-specified performance requirement for responding to data requests for the data set; determine, based at least in part on the user-specified performance requirement for responding to the data requests in connection with the data set, whether one or more current data resources collectively storing the data set respond to the data requests in accordance with the user-specified performance requirement, the user-specified performance requirement indicating a latency target to be met for responding to the data requests; determine that the one or more current data resource fail to respond to the data requests in accordance with the user-specified performance requirement based at least in part on the latency target as indicated by the user-specified performance requirement; identify another storage device having additional capacity, the another storage device being associated with a resource usage request for the additional capacity; determine whether the request parameter satisfies the resource usage request; in response to determining that the request parameter satisfies the resource usage request, transfer data from at least one of the one or more current data resources to at least one second data resource; and in response to determining that the request parameter does not satisfy the resource usage request, deny the instance request. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A non-transitory computer-readable storage media having collectively stored thereon executable instructions that, when executed by one or more processors of a computer system, cause the computer system to at least:
-
receive an instance request that includes a request parameter, the instance request associated with a user of a multi-tenant computing environment, the multi-tenant computing environment including a plurality of client devices in communication via a network with one or more servers and storage devices, the instance request indicating a performance specification for responding to data requests, the performance specification indicating a latency target to be met for responding to the data requests; determine whether a current storage resource processing the data requests meets the performance specification based at least in part on an amount of latency of processing the data requests; identifying another storage device having additional capacity, the another storage device being associated with a resource usage request for the additional capacity; determining whether the request parameter satisfies the resource usage request; in response to determining that the request parameter satisfies the resource usage request, transfer responsibility for processing the data requests from the current storage resource to a second storage resource after determining the current storage resource fails to meet the performance specification, the second storage resource determined to meet the performance specification; and in response to determining that the request parameter does not satisfy the resource usage request, denying the instance request. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification