Resilience as a service
First Claim
1. A system, comprising:
- at least one processor running instances of two or more services, wherein instances of a first service are dependent upon instances of a second service;
a fault-injection manager that introduces a fault-inducing condition in the system;
a health monitoring manager that detects failures in the system; and
wherein the fault-injection manager identifies the first service as vulnerable to the fault-inducing condition when the fault-inducing condition causes an instance of the second service to fail,wherein the fault-injection manager introduces the fault-inducing condition in a guided manner, wherein a scope of the fault-inducing condition is increased in steps over time.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods disclosed herein are directed to creating a service directory of dependencies for services running on a system, wherein instances of a first service are dependent upon instances of a second service. The directory of dependencies comprises metadata associated with connections between the services. The system injects faults targeting all levels of the dependencies. The system is monitored to detect failures created by the faults. The injected faults are selected from transport layer faults, memory pressure, processor pressure, storage pressure, virtual machine restart, and virtual machine shut down. A domain name service is monitored to identify names that are resolved for the services. The service directory is then updated continuously with additional dependencies using information about the resolved names. The faults may be injected in a guided manner, wherein the scope of the faults is increased in steps over time to identify a failure point in the system.
10 Citations
20 Claims
-
1. A system, comprising:
-
at least one processor running instances of two or more services, wherein instances of a first service are dependent upon instances of a second service; a fault-injection manager that introduces a fault-inducing condition in the system; a health monitoring manager that detects failures in the system; and wherein the fault-injection manager identifies the first service as vulnerable to the fault-inducing condition when the fault-inducing condition causes an instance of the second service to fail, wherein the fault-injection manager introduces the fault-inducing condition in a guided manner, wherein a scope of the fault-inducing condition is increased in steps over time. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method, comprising:
-
creating a service directory of dependencies for services running on a system, wherein instances of a first service are dependent upon instances of a second service, the directory of dependencies further comprising metadata associated with connections between the services; continuously updating the service directory with additional dependencies, based on results of monitoring of a domain name service resolving names for the services; injecting faults in the system, the faults targeting a plurality of levels of the dependencies; and monitoring the system to detect failures created by the faults. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A system, comprising:
- a processor; and
a memory containing a program, which, when executed on the processor, performs an operation for identifying an impact of system fault conditions, the operation comprising; creating a service directory of dependencies for services running on the system, wherein instances of a first service are dependent upon instances of a second service, the directory of dependencies further comprising metadata associated with connections between the services; continuously updating the service directory with additional dependencies, based on results of monitoring of a domain name service resolving names for the services; injecting faults in the system, the faults targeting all levels of the dependencies; and monitoring the system to detect failures created by the faults. - View Dependent Claims (18, 19, 20)
- a processor; and
Specification