Determination of related failure events in a multi-node system
First Claim
Patent Images
1. A method comprising:
- obtaining event data from each node in a group of associated nodes;
identifying at least one single-node event burst occurring on each of a plurality of the nodes in the group of nodes based on the obtained event data, including identifying at least one node event that does not have a corresponding opposite node event and creating an opposite node event associated with the at least one node event; and
identifying a multi-node event burst occurring in the group of associated nodes based on the identified single-node event bursts.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for determining related node failures in a multi-node system use log data obtained from the nodes. This log data is processed in various ways to indicate clusters of nodes that experience related failures.
-
Citations
53 Claims
-
1. A method comprising:
-
obtaining event data from each node in a group of associated nodes; identifying at least one single-node event burst occurring on each of a plurality of the nodes in the group of nodes based on the obtained event data, including identifying at least one node event that does not have a corresponding opposite node event and creating an opposite node event associated with the at least one node event; and identifying a multi-node event burst occurring in the group of associated nodes based on the identified single-node event bursts. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method comprising:
-
obtaining event data from each node in a group of associated nodes; storing the obtained event data in an event table; identifying at least one single-node event burst occurring on each of a plurality of the nodes in the group of nodes based on the obtained event data; identifying at least one up event that does not have a corresponding down event in the data structure; creating an associated down event for the up event; and identifying a multi-node event burst occurring in the group of associated nodes based on the identified single-node event bursts.
-
-
13. A method comprising:
-
obtaining event data from each node in a group of associated nodes; storing the obtained event data in an event table; identifying at least one down event that does not have a corresponding up event in the data structure; creating an associated up event for the down event; and identifying a multi-node event burst occurring in the group of associated nodes based on the identified single-node event bursts.
-
-
14. A method comprising:
-
obtaining event data from each node in a group of associated nodes; identifying at least one single-node event burst occurring on each of a plurality of the nodes in the group of nodes based on the obtained event data; and identifying a multi-node event burst occurring in the group of associated nodes based on the identified single-node event bursts; and determining a relationship strength value for two nodes having associated single-node event bursts occurring in the identified multi-node event burst.
-
-
15. A method comprising:
-
obtaining event data from each node in a group of associated nodes; identifying at least one single-node event burst occurring on each of a plurality of the nodes in the group of nodes based on the obtained event data; identifying a multi-node event burst occurring in the group of associated nodes based on the identified single-node event bursts; determining relationship strength values for pairs of nodes based on the identified multi-node event burst; and determining at least one cluster of related nodes based on the determined relationship strength values.
-
-
16. A method comprising:
-
obtaining event data from each node in a group of associated nodes; identifying at least one single-node event burst occurring on each of a plurality of the nodes in the group of nodes based on the obtained event data; identifying a multi-node event burst occurring in the group of associated nodes based on the identified single-node event bursts; determining relationship strength values for pairs of nodes based on the identified multi-node event burst; determining at least one cluster of related nodes based on the determined relationship strength values; and determining at least one metric for the at least one cluster.
-
-
17. A method comprising:
-
obtaining event data from each node in a group of associated nodes; identifying at least one single-node event burst occurring on each of a plurality of the nodes in the group of nodes based on the obtained event data; identifying a multi-node event burst occurring in the group of associated nodes based on the identified single-node event bursts; determining relationship strength values for pairs of nodes based on the identified multi-node event burst; determining at least one cluster of related nodes based on the determined relationship strength values; determining at least one metric for the at least one cluster; and presenting the determined metric to a user.
-
-
18. A method comprising:
-
obtaining event data from each node in a group of associated nodes; identifying at least one single-node event burst occurring on each of a plurality of the nodes in the group of nodes based on the obtained event data; identifying a multi-node event burst occurring in the group of associated nodes based on the identified single-node event bursts; determining relationship strength values for pairs of nodes based on the identified multi-node event burst; determining at least one cluster of related nodes based on the determined relationship strength values; determining a plurality of metrics for the at least one cluster; and presenting the determined plurality of metrics to a user.
-
-
19. One or more processor-readable storage media having stored thereon processor executable instructions for performing acts comprising:
-
obtaining event data from each of a plurality of nodes in a node group; identifying a multi-node event burst occurring in the node group based on the event data; determining relationship strength values for pairs of nodes based on the identified multi-node event burst; and determining at least one cluster of related nodes based on the determined relationship strength values. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 53)
-
-
33. One or more processor-readable storage media having stored thereon processor executable instructions for performing operations comprising:
-
identifying a multi-node event burst occurring in a node group based on the event data collected from each of a plurality of nodes in a node group; determining a cluster of related nodes based on the identified multi-node event burst; and determining at least one metric for cluster. - View Dependent Claims (34, 35, 36, 37, 38)
-
-
39. A computer system comprising:
-
means for obtaining event data from each of a plurality of nodes of a group of associated nodes; means for identifying single-node event bursts occurring on the nodes based on the obtained event data, including means for identifying at least one node event that does not have a corresponding opposite node event and means for creating an opposite node event associated with the at least one node event; and means for identifying a multi-node event burst occurring in the group of associated nodes based on the identified single-node bursts. - View Dependent Claims (40, 41, 42, 43, 44, 45, 46, 47, 48, 49)
-
-
50. A computer system comprising:
-
means for obtaining event data from each of a plurality of nodes of a group of associated nodes; means for identifying single-node event bursts occurring on the nodes based on the obtained event data; means for identifying a multi-node event burst occurring in the group of associated nodes based on the identified single-node bursts; and means for storing the obtained event data in an event table and wherein identifying single-node event bursts includes identifying at least one up event that does not have a corresponding down event in the data structure and creating an associated down event for the up event.
-
-
51. A computer system comprising:
-
means for obtaining event data from each of a plurality of nodes of a group of associated nodes; means for identifying single-node event bursts occurring on the nodes based on the obtained event data; means for identifying a multi-node event burst occurring in the group of associated nodes based on the identified single-node bursts; and means for storing the obtained event data in an event table and wherein identifying single-node event bursts includes identifying at least one down event that does not have a corresponding up event in the data structure and creating an associated up event for the down event.
-
-
52. A computer system comprising:
-
means for obtaining event data from each of a plurality of nodes of a group of associated nodes; means for identifying single-node event bursts occurring on the nodes based on the obtained event data; means for identifying a multi-node event burst occurring in the group of associated nodes based on the identified single-node bursts; and means for determining a relationship strength value for two nodes having associated single-node event bursts occurring in an identified multi-node event burst.
-
Specification