System and method for problem determination using dependency graphs and run-time behavior models
First Claim
Patent Images
1. A problem determination system implemented in instructions recorded on computer readable media, said instructions, when run on a computer, causing said computer to provide:
- means for computing operational limits for a set of resources in a distributed transaction system;
means for determining a poor performance status for said resources relative to said operational limits;
means for generating a set of root cause components which are likely to be causing said poor performance status;
means for prioritizing said set of root cause components; and
means for providing said prioritized set of root cause components to a systems administrator;
wherein, said means for computing operational limits includes;
means for capturing monitored information from said distributed transaction system;
means for automatically computing individual component level thresholds from user level SLAs; and
means for generating a dynamic dependency information for said distributed transaction system using static dependency data and said monitored information.
6 Assignments
0 Petitions
Accused Products
Abstract
A problem determination system and method reduces the time and effort required by system administrators to trouble shot transaction processing difficulties in a large distributed I/T environment by monitoring system component performance and computing operational performance threshold limits based on dependency graphs for the system. Using this data, a prioritized list of suspect components for the I/T system is generated.
24 Citations
8 Claims
-
1. A problem determination system implemented in instructions recorded on computer readable media, said instructions, when run on a computer, causing said computer to provide:
-
means for computing operational limits for a set of resources in a distributed transaction system; means for determining a poor performance status for said resources relative to said operational limits; means for generating a set of root cause components which are likely to be causing said poor performance status; means for prioritizing said set of root cause components; and means for providing said prioritized set of root cause components to a systems administrator; wherein, said means for computing operational limits includes; means for capturing monitored information from said distributed transaction system; means for automatically computing individual component level thresholds from user level SLAs; and means for generating a dynamic dependency information for said distributed transaction system using static dependency data and said monitored information.
-
-
2. A problem determination system implemented in instructions recorded on computer readable media, said instructions, when run on a computer, causing said computer to provide:
-
means for computing operational limits for a set of resources in a distributed transaction system; means for determining a poor performance status for said resources relative to said operational limits; means for generating a set of root cause components which are likely to be causing said poor performance status; means for prioritizing said set of root cause components; and means for providing said prioritized set of root cause components to a systems administrator; wherein, said means for determining poor performance status for said resources relative to said operational limits includes; means for obtaining monitored information relative to performance of a transaction; means for identifying a sub-set of resources that are used to perform said transaction from said set of resources in said distributed transaction system; means for obtaining performance information for each resource of said sub-set of resources; and means for comparing said monitored information with said performance information to make performance status determination. - View Dependent Claims (3)
-
-
4. A problem determination System implemented in instructions recorded on computer readable media, said instructions, when run on a computer, causing said computer to provide:
-
means for computing operational limits for a set of resources in a distributed transaction system; means for determining a poor performance status for said resources relative to said operational limits; means for generating a set of root cause components which are likely to be causing said poor performance status; means for prioritizing said set of root cause components; and means for providing said prioritized set of root cause components to a systems administrator; wherein, means for generating a set of root cause components which are likely to be causing said poor performance status includes; means for selecting one of several algorithms for calculating a severity value for each of said resources; and means for creating a priority list of said set root cause components based on severity values.
-
-
5. A set of resources configured as a distributed transactions system in which problem determination is performed on an automatic basis by a distributed transaction system including:
-
the set of resources configured as a distributed transaction system; means for computing operational limits for a set of resources in a distributed transaction system; means for determining a poor performance status for said resources relative to said operational, limits; means for generating a set of root cause components which are likely to be causing said poor performance status; means for prioritizing said set of root cause components; and means for providing said prioritized set of root cause components to a systems administrator; wherein, said means for computing operational limits includes; means for capturing monitoring information from said distributed transaction system; means for automatically computing individual component level thresholds from user level SLAs; and means for generating a dynamic dependency information for said distributed transaction system using static dependency data and said monitoring information.
-
-
6. A set of resources configured as a distributed transactions system in which problem determination is performed on an automatic basis by a distributed transaction system including:
-
the set of resources configured as a distributed transaction system; means for computing operational limits for a set of resources in a distributed transaction system; means for determining a poor performance status for said resources relative to said operational limits; means for generating a set of root cause components which are likely to be causing said poor performance status; means for prioritizing said set of root cause components; and means for providing said prioritized set of root cause components to a systems administrator; wherein, said means for determining poor performance status for said resources relative to said operational limits includes; means for obtaining monitoring information relative to performance of a transaction; means for identifying a sub-set of resources that are used to perform said transaction from said set of resources in said distributed transaction system; means for obtaining performance information for each resource of said sub-set of resources; and means for comparing said monitoring information with said performance information to make performance status determination. - View Dependent Claims (7)
-
-
8. A set of resources configured as a distributed transactions system in which problem determination is performed on an automatic basis by a distributed transaction system including:
-
the set of resources configured as a distributed transaction system; means for computing operational limits for a set of resources in a distributed transaction system; means for determining a poor performance status for said resources relative to said operational limits; means for generating a set of root cause components which are likely to be causing said poor performance status; means for prioritizing said set of root cause components; and means for providing said prioritized set of root cause components to a systems administrator; wherein, means for generating a set of root cause components which are likely to be causing said poor performance status includes; means for selecting one of several algorithms for calculating a severity value for each of said resources; and means for creating a priority list of said set root cause components based on severity values.
-
Specification