×

Load shedding techniques for distributed services with persistent client connections to ensure quality of service

  • US 10,346,367 B1
  • Filed: 04/30/2015
  • Issued: 07/09/2019
  • Est. Priority Date: 04/30/2015
  • Status: Active Grant
First Claim
Patent Images

1. A system, comprising:

  • a plurality of access nodes (ANs) of a file storage service implemented at a provider network, including a first AN in a first AN peer group (APG) collectively responsible for processing received client requests directed to at least a first file system instance;

    wherein the first AN implements a plurality of load shedding analysis iterations (LSAIs), wherein a particular LSAI of the plurality of LSAIs comprises;

    determining that a workload level associated with one or more persistent client connections (PCCs) meets a triggering condition for initiating load shedding, wherein each PCC of the one or more PCCs is established to process a plurality of client requests;

    selecting at least a first PCC of the one or more PCCs as a candidate for termination;

    examining a workload metric cache associated with the first APG, wherein the cache is populated based at least in part on one or more updates received from a workload information distributor, and wherein the cache comprises one or more metrics including at least a thread pool utilization metric of a different AN of the first APG;

    determining, based at least in part on said examining, that one or more ANs of the APG meet an available-capacity criterion; and

    initiating, based at least in part on said determining that one or more ANs of the APG meet the available-capacity criterion, a phased termination of the first PCC, wherein the phased termination comprises allowing completion of in-flight requests on the first PCC, and rejecting new requests on the first PCC before the phased termination of the first PCC is completed.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×