Management system for outputting information denoting recovery method corresponding to root cause of failure
First Claim
Patent Images
1. A computer system comprising:
- at least one node apparatus; and
a management system comprising one or more computers, and configured to detect an event that occurs in the node apparatus,wherein the management system is configured to store event information and meta rule information,wherein the event information includes an event entry representing an event identifier of an event that has occurred in a certain node apparatus, and a node identifier of the certain node apparatus in which the event occurred,wherein the meta rule information includes a meta rule representing, without including an identifier of the node apparatus, a potential event type that could potentially occur in the node apparatus and a root cause event type that can be identified as a root cause in a case where an event corresponding to the potential event type occurs, wherein the meta rule includes an expanded rule, which is an expanded Root Cause Analysis (RCA) rule,wherein the management system is configured;
(A) to identify a first cause event, which is the root cause of a first event identified by the event entry based on the meta rule information, and to identify a first meta rule used in the identification of the first cause event;
(B) to receive via an input device, after the identification of the first cause event, a meta recovery method, which is a method for recovering from the first cause event, and to register the meta recovery method to correspond to the first meta rule identified in (A);
(C′
) to create the expanded rule based on the meta rule information and topology information of the at least one node apparatus;
(C) to identify a second cause event, which is the root cause of a second event identified by the event entry based on the meta rule information, and to identify a second meta rule used in the identification of the second cause event;
(D) to identify a particular meta recovery method registered in the management system, which corresponds to the second meta rule identified in (C);
(E) to display the particular meta recovery method with information about the second cause event; and
(F) display, before the reception of the meta recovery method in (B), a meta recovery method registration screen including;
an element group area showing various types of element icons for describing the meta recovery method,a meta recovery method edit area showing element icons which area drag and dropped from the element group area for representing an on editing meta recovery method, andan element detail configuration area showing detail of the element, andwherein the element detail configuration area for a first type element and the element detail configuration area for a second type element different from the first type element are different.
0 Assignments
0 Petitions
Accused Products
Abstract
A management server includes a meta rule for identifying an event to be a root cause and a failure recovery method that corresponds to the meta rule for an event capable of occurring in a plurality of node apparatuses, and also displays a cause event to be a root cause of an event detected by the management server, and a method for recovering from this cause event.
75 Citations
9 Claims
-
1. A computer system comprising:
-
at least one node apparatus; and a management system comprising one or more computers, and configured to detect an event that occurs in the node apparatus, wherein the management system is configured to store event information and meta rule information, wherein the event information includes an event entry representing an event identifier of an event that has occurred in a certain node apparatus, and a node identifier of the certain node apparatus in which the event occurred, wherein the meta rule information includes a meta rule representing, without including an identifier of the node apparatus, a potential event type that could potentially occur in the node apparatus and a root cause event type that can be identified as a root cause in a case where an event corresponding to the potential event type occurs, wherein the meta rule includes an expanded rule, which is an expanded Root Cause Analysis (RCA) rule, wherein the management system is configured; (A) to identify a first cause event, which is the root cause of a first event identified by the event entry based on the meta rule information, and to identify a first meta rule used in the identification of the first cause event; (B) to receive via an input device, after the identification of the first cause event, a meta recovery method, which is a method for recovering from the first cause event, and to register the meta recovery method to correspond to the first meta rule identified in (A); (C′
) to create the expanded rule based on the meta rule information and topology information of the at least one node apparatus;(C) to identify a second cause event, which is the root cause of a second event identified by the event entry based on the meta rule information, and to identify a second meta rule used in the identification of the second cause event; (D) to identify a particular meta recovery method registered in the management system, which corresponds to the second meta rule identified in (C); (E) to display the particular meta recovery method with information about the second cause event; and (F) display, before the reception of the meta recovery method in (B), a meta recovery method registration screen including; an element group area showing various types of element icons for describing the meta recovery method, a meta recovery method edit area showing element icons which area drag and dropped from the element group area for representing an on editing meta recovery method, and an element detail configuration area showing detail of the element, and wherein the element detail configuration area for a first type element and the element detail configuration area for a second type element different from the first type element are different. - View Dependent Claims (2, 3)
-
-
4. A management system for detecting an event that occurs in a node apparatus, comprising:
-
a memory storing event information and meta rule information, wherein the event information includes an event entry representing an event identifier of an event that has occurred in a certain node apparatus, and a node identifier of the certain node apparatus in which the event occurred, wherein the meta rule information includes a meta rule representing, without including an identifier of the node apparatus, a potential event type that could potentially occur in the node apparatus and a root cause event type that can be identified as a root cause in a case where an event corresponding to the potential event type occurs, wherein the meta rule includes an expanded rule, which is an expanded Root Cause Analysis (RCA) rule, and a processor configured; (A) to identify a first cause event, which is the root cause of a first event identified by the event entry based on the meta rule information, and to identify a first meta rule used in the identification of the first cause event; (B) to receive via an input device, after the identification of the first cause event, a meta recovery method, which is a method for recovering from the first cause event, and to register the meta recovery method to correspond to the first meta rule identified in (A); (C′
) to create the expanded rule based on the meta rule information and topology information of the at least one node apparatus;(C) to identify a second cause event, which is the root cause of a second event identified by the event entry based on the meta rule information, and to identify a second meta rule used in the identification of the second cause event; (D) to identify a particular meta recovery method registered in the management system, which corresponds to the second meta rule identified in (C); (E) to display the particular meta recovery method with information about the second cause event; and (F) display, before the reception of the meta recovery method in (B), a meta recovery method registration screen including; an element group area showing various types of element icons for describing the meta recovery method, a meta recovery method edit area showing element icons which area drag and dropped from the element group area for representing an on editing meta recovery method, and an element detail configuration area showing detail of the element, and wherein the element detail configuration area for a first type element and the element detail configuration area for a second type element different from the first type element are different. - View Dependent Claims (5, 6)
-
-
7. A non-transitory computer-readable storage medium storing a program for detecting an event that occurs in a node apparatus, the program, when executed by a management system, performs a method, the management system comprising a memory storing event information and meta rule information,
wherein the event information includes an event entry representing an event identifier of an event occurred in a certain node apparatus, and a node identifier of the certain node apparatus in which the event occurred, wherein the meta rule information includes a meta rule representing, without including an identifier of the node apparatus, a potential event type that could potentially occur in the node apparatus and a root cause event type that can be identified as a root cause in a case where an event corresponding to the potential event type occurs, wherein the meta rule includes an expanded rule, which is an expanded Root Cause Analysis (RCA) rule, the method comprising: -
(A) identifying a first cause event, which is the root cause of a first event identified by the event entry based on the meta rule information, and identifying a first meta rule used in the identification of the first cause event; (B) receiving via an input device, after the identification of the first cause event, a meta recovery method, which is a method for recovering from the first cause event, and registering the meta recovery method to correspond to the first meta rule identified in (A); (C′
) creating the expanded rule based on the meta rule information and topology information of the at least one node apparatus;(C) identifying a second cause event, which is the root cause of a second event identified by the event entry based on the meta rule information, and identifying a second meta rule used in the identification of the second cause event; (D) identifying a particular meta recovery method registered in the management system, which corresponds to the second meta rule identified in (C); (E) displaying the particular meta recovery method with information about the second cause event; and (F) displaying, before the reception of the meta recovery method in (B), a meta recovery method registration screen including; an element group area showing various types of element icons for describing the meta recovery method, a meta recovery method edit area showing element icons which area drag and dropped from the element group area for representing an on editing meta recovery method, and an element detail configuration area showing detail of the element, and wherein the element detail configuration area for a first type element and the element detail configuration area for a second type element different from the first type element are different. - View Dependent Claims (8, 9)
-
Specification