IDENTIFYING TROUBLESHOOTING OPTIONS FOR RESOLVING NETWORK FAILURES
First Claim
1. A method comprising:
- receiving an alarm, the alarm comprising failure conditions that are indicative of a network failure;
responsive to receiving the alarm, accessing historical data, the historical data comprising a failure symptom of the network failure and troubleshooting options previously undertaken to mitigate the network failure;
mapping the failure conditions of the alarm to the failure symptom in the historical data;
responsive to mapping the failure conditions of the alarm to the failure symptom, identifying the troubleshooting options;
assigning respective labels to the troubleshooting options, the labels indicative of respective probabilities that the troubleshooting options will mitigate the failure symptom; and
outputting the plurality of troubleshooting options and their respective labels.
3 Assignments
0 Petitions
Accused Products
Abstract
Described herein are various technologies pertaining to providing assistance to an operator in a data center with respect to failures in the data center. An alarm is received, and a failing device is identified based upon content of the alarm. Failure conditions of the alarm are mapped to a failure symptom that may be exhibited by the failing device, and troubleshooting options previously employed to mitigate the failure symptom are retrieved from historical data. Labels are respectively assigned to the troubleshooting options, where a label is indicative of a probability that a troubleshooting option to which the label has been assigned will mitigate the failure symptom.
153 Citations
20 Claims
-
1. A method comprising:
-
receiving an alarm, the alarm comprising failure conditions that are indicative of a network failure; responsive to receiving the alarm, accessing historical data, the historical data comprising a failure symptom of the network failure and troubleshooting options previously undertaken to mitigate the network failure; mapping the failure conditions of the alarm to the failure symptom in the historical data; responsive to mapping the failure conditions of the alarm to the failure symptom, identifying the troubleshooting options; assigning respective labels to the troubleshooting options, the labels indicative of respective probabilities that the troubleshooting options will mitigate the failure symptom; and outputting the plurality of troubleshooting options and their respective labels. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A resolution system that facilitates resolving network failures in data center, the resolution system comprising:
-
a processor; and a memory that comprises a plurality of components that are executed by the processor, the plurality of components comprising; an alarm receiver component that receives an alarm, the alarm indicative of a network failure in the data center; and a resolution identifier component that, responsive to the alarm receiver component receiving the alarm, outputs troubleshooting options for resolving the network failure, the troubleshooting options having labels respectively assigned thereto that are indicative of confidences that the troubleshooting options, when performed by an operator of the data center, will resolve the network failure. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19)
-
-
20. A computer-readable storage medium comprising instructions that, when executed by a processor, cause the processor to perform acts comprising:
-
receiving an alarm, the alarm comprises failure conditions that are indicative of a network failure in a data center; responsive to receiving the alarm, identifying a failing device that causes the network failure, the failing device identified based upon the failure conditions; responsive to identifying the failing device, accessing a failure history table for the failing device, the failure history table comprises a failure symptom previously exhibited by the failing device and troubleshooting options previously employed to mitigate the troubleshooting symptom; mapping the failure conditions of the alarm to the failure symptom in the failure history table; retrieving the troubleshooting options responsive to the mapping of the failure conditions to the failure symptom; and outputting the troubleshooting options and respective labels for the troubleshooting options, the labels indicative of respective confidences that the troubleshooting options, when employed by an operator, will mitigate the failure symptom.
-
Specification