×

System and method for managing faults in a distributed system

  • US 5,664,093 A
  • Filed: 07/25/1996
  • Issued: 09/02/1997
  • Est. Priority Date: 12/27/1994
  • Status: Expired due to Fees
First Claim
Patent Images

1. A fault management system for use in a distributed system, comprising:

  • a configuration manager maintaining configuration information of components used in the distributed system, the configuration information comprising an object-oriented model describing relationships between the components, wherein the object-oriented model maintains a list of the components as objects and an understanding of how the objects are related;

    a plurality of measurement agents obtaining performance information from the components in the distributed system; and

    a diagnostic system coupled to the configuration manager and each of the plurality of measurement agents for identifying faults occurring in the distributed system and providing solutions for correcting the faults, the diagnostic system comprising a knowledge base having a plurality of rules for the components and an inference engine for applying the rules to the performance information, the diagnostic system receiving the configuration information from the configuration manager and the performance information from the plurality of measurement agents and using the configuration and performance information to identify faults and provide solutions for the faults, the diagnostic system identifying faults by querying the configuration manager for the object-oriented model of the components and using the model along with the plurality of rules in the knowledge base to identify the causes responsible for the fault and to provide a solutions for correcting the faults, the diagnostic system initiating the identification of faults at any location in the object-oriented model.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×