Determination of related failure events in a multi-node system
First Claim
Patent Images
20. A processor-readable media having stored thereon processor executable instructions for performing acts comprising:
- obtaining event data from each of a plurality of nodes in a node group;
identifying a multi-node event burst occurring in the node group based on the event data;
determining relationship strength values for pairs of nodes based on the identified multi-node event burst; and
determining at least one cluster of related nodes based on the determined relationship strength values.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for determining related node failures in a multi-node system use log data obtained from the nodes. This log data is processed in various ways to indicate clusters of nodes that experience related failures.
-
Citations
54 Claims
-
20. A processor-readable media having stored thereon processor executable instructions for performing acts comprising:
-
obtaining event data from each of a plurality of nodes in a node group;
identifying a multi-node event burst occurring in the node group based on the event data;
determining relationship strength values for pairs of nodes based on the identified multi-node event burst; and
determining at least one cluster of related nodes based on the determined relationship strength values. - View Dependent Claims (1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 35, 36, 37, 38)
-
-
23-1. A processor-readable media as recited in claim 22, wherein identifying a single-node event burst on a node comprises identifying outage events on a node occurring within a specified time frame of one another.
-
34. A processor-readable media having stored thereon processor executable instructions for performing operations comprising:
-
identifying a multi-node event burst occurring in a node group based on the event data collected from each of a plurality of nodes in a node group;
determining a cluster of related nodes based on the identified multi-node event burst; and
determining at least one metric for cluster. - View Dependent Claims (39)
-
-
40. A computer system comprising:
-
means for obtaining event data from each of a plurality of nodes of a group of associated nodes;
means for identifying single-node event bursts occurring on the nodes based on the obtained event data;
means for identifying a multi-node event bursts occurring in the group of associated nodes based on the identified single-node bursts. - View Dependent Claims (41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54)
-
Specification