System and method for in-service diagnostics based on health signatures
First Claim
1. A method for providing diagnostics in an information handling system having a non-transitory computer readable storage medium, the method comprising:
- sensing that the information handling system is resuming from a system crash; and
obtaining a crash health signature, wherein the crash health signature comprises crash data;
determining a type of crash based, at least in part, on the crash data of the crash health signature, wherein the crash health signature is indicative of at least one of memory corruption, hardware configuration, and software configuration;
tracking one or more parameters indicative of health of the information handling system;
storing, on the non-transitory computer readable storage medium, data related to the plurality of parameters;
generating one or more statistics based, at least in part, on the plurality of parameters;
selecting, based on at least one of, the crash data and at least one of the plurality of parameters, a diagnostic test to execute;
scheduling the selected diagnostic test;
executing the selected diagnostic test; and
obtaining a validation code from the selected diagnostic test.
14 Assignments
0 Petitions
Accused Products
Abstract
In accordance with the present disclosure, a system and method are herein disclosed for providing in-service diagnostics based on health signatures of an information handling system. In one embodiment, the health of an information handling system is periodically probed by a software agent running as a service. The service polls a set of probes from interfaces on the host system and tracks parameters indicative of system and component health and performance. An analytics engine processes the data to identify anomalies and outliers, and to detect signatures that are precursors to potential service incidents. A rule set matches signatures to potential actions. Based on the detected signatures and rule sets, the service is operable to select, schedule, and execute one or more diagnostic tests. If an information handling system or component thereof fails a diagnostic test, the results are archived to persistent storage and communicated to an administrator.
-
Citations
20 Claims
-
1. A method for providing diagnostics in an information handling system having a non-transitory computer readable storage medium, the method comprising:
-
sensing that the information handling system is resuming from a system crash; and obtaining a crash health signature, wherein the crash health signature comprises crash data; determining a type of crash based, at least in part, on the crash data of the crash health signature, wherein the crash health signature is indicative of at least one of memory corruption, hardware configuration, and software configuration; tracking one or more parameters indicative of health of the information handling system; storing, on the non-transitory computer readable storage medium, data related to the plurality of parameters; generating one or more statistics based, at least in part, on the plurality of parameters; selecting, based on at least one of, the crash data and at least one of the plurality of parameters, a diagnostic test to execute; scheduling the selected diagnostic test; executing the selected diagnostic test; and obtaining a validation code from the selected diagnostic test. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A non-transitory, computer readable storage medium comprising executable instructions to cause at least one processor, of an information handling system, coupled to the computer readable storage medium to:
-
sense that the information handling system is resuming from a system crash; and obtain a crash health signature, wherein the crash health signature comprises crash data; determine a type of crash based, at least in part, on the crash data of the crash health signature, wherein the crash health signature is indicative of at least one of memory corruption, hardware configuration, and software configuration; track one or more parameters indicative of health of the information handling system; store, on the non-transitory computer readable storage medium, data related to the plurality of parameters; generate one or more statistics based, at least in part, on the plurality of parameters; select, based on at least one of, the crash data and at least one of the plurality of parameters, a diagnostic test to execute; schedule the selected diagnostic test; execute the selected diagnostic test; and obtain a validation code from the selected diagnostic test. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
-
14. An information handling system, comprising:
-
at least one processor; and a non-transitory, computer readable storage medium coupled to the at least one processor, wherein the non-transitory, computer readable storage medium comprises instructions that cause the at least one processor to; sense that the information handling system is resuming from a system crash; and obtain a crash health signature, wherein the crash health signature comprises crash data; determine a type of crash based, at least in part, on the crash data of the crash health signature, wherein the crash health signature is indicative of at least one of memory corruption, hardware configuration, and software configuration; track one or more parameters indicative of health of the information handling system; store, on the non-transitory computer readable storage medium, data related to the plurality of parameters; generate one or more statistics based, at least in part, on the plurality of parameters; select, based on at least one of, the crash data and at least one of the plurality of parameters, a diagnostic test to execute; schedule the selected diagnostic test; execute the selected diagnostic test; and obtain a validation code from the selected diagnostic test. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification