Distributed device status in a clustered system environment
First Claim
1. An apparatus for communicating error and status information, said apparatus comprising:
- first, second, and third computer systems, said first, second, and third computer systems being interconnected via a shared bus;
first, second, and third system managers, said first system manager residing on said first computer system, said second system manager residing on said second computer system, and said third system manager residing on said third computer system;
a primary IOP manager and first and second secondary IOP managers, said primary IOP manager residing on said first computer system, said first secondary IOP manager residing on said second computer system, and said second secondary IOP manager residing on said third computer system;
a plurality of IOPs, each of said plurality of IOPs being capable of communicating said error and status information to said first, second, and third computer systems;
means for communicating said error and status information from a first IOP to said primary IOP manager, said first IOP being one of said plurality IOPs;
means for communicating said error and status information from said primary IOP manager to said first system manager;
means for communicating said error and status information from said primary IOP manager to said first secondary IOP manager and said second secondary IOP manager, said error and status information being communicated via said first IOP; and
means for communicating said error and status information from said first secondary IOP manager to said second system manager and means for communicating said error and status information from said second secondary IOP manager to said third system manager.
0 Assignments
0 Petitions
Accused Products
Abstract
Problem ownership and status propagation in a clustered system environment must be coordinated in a way that avoids redundant and/or conflicting recovery efforts. IOP managers which overcome these problems are disclosed. A primary IOP manager is one which has problem ownership for a particular IOP. There is only one primary IOP manager for each IOP. A secondary IOP manager is one which resides upon a computer system that shares the resources of a particular IOP or IOPs, but nevertheless does not have problem ownership for that particular IOP(s). There are one or more secondary IOP managers for a given IOP. When a primary IOP manager is informed of a problem in a device attached to a shared IOP, it updates resident system management information to indicate the new status of the subject device, informs its local system management of the problem so that corrective actions may be taken, and sends the status information to the secondary IOP managers via the IOP itself. When a secondary IOP manager is informed of a problem with a device attached to a shared IOP, it updates system management information to indicate the new status of the subject device and sends the status information to the primary IOP manager and other secondary IOP managers via the IOP itself. The secondary IOP manager does not, however, inform its local system management that corrective actions are required.
-
Citations
18 Claims
-
1. An apparatus for communicating error and status information, said apparatus comprising:
-
first, second, and third computer systems, said first, second, and third computer systems being interconnected via a shared bus; first, second, and third system managers, said first system manager residing on said first computer system, said second system manager residing on said second computer system, and said third system manager residing on said third computer system; a primary IOP manager and first and second secondary IOP managers, said primary IOP manager residing on said first computer system, said first secondary IOP manager residing on said second computer system, and said second secondary IOP manager residing on said third computer system; a plurality of IOPs, each of said plurality of IOPs being capable of communicating said error and status information to said first, second, and third computer systems; means for communicating said error and status information from a first IOP to said primary IOP manager, said first IOP being one of said plurality IOPs; means for communicating said error and status information from said primary IOP manager to said first system manager; means for communicating said error and status information from said primary IOP manager to said first secondary IOP manager and said second secondary IOP manager, said error and status information being communicated via said first IOP; and means for communicating said error and status information from said first secondary IOP manager to said second system manager and means for communicating said error and status information from said second secondary IOP manager to said third system manager. - View Dependent Claims (2, 3)
-
-
4. An apparatus for communicating error and status information, said apparatus comprising:
-
first, second, and third computer systems; first, second, and third system managers, said first system manager residing on said first computer system, said second system manager residing on said second computer system, and said third system manager residing on said third computer system; a primary IOP manager and first and second secondary IOP managers, said primary IOP manager residing on said first computer system, said first secondary IOP manager residing on said second computer system, and said second secondary IOP manager residing on said third computer system; a plurality of IOPs, each of said plurality of IOPs being capable of communicating status and error information to said first, second, and third computer systems; means for communicating said error and status information from a first IOP to said first secondary IOP manager, said first IOP being one of said plurality of IOPs; means for communicating said error and status information from said first secondary IOP manager to said second system manager; means for communicating said error and status information from said first secondary IOP manager to said primary IOP manager and said second secondary IOP manager, said error and status information being communicated via said first IOP; and means for communicating said error and status information from said primary IOP manager to said first system manager and means for communicating said error and status information from said second secondary IOP manager to said third system manager. - View Dependent Claims (5, 6, 7, 8)
-
-
9. A method for communicating error and status information, said method comprising the steps of:
-
communicating said error and status information from a first IOP to a primary IOP manager, said first IOP being one of a plurality of IOPs, said primary IOP manager residing on a first computer system; communicating said error and status information from said primary IOP manager to a first system manager, said first system manager residing on said first computer system; communicating said error and status information from said primary IOP manager to a first secondary IOP manager and a second secondary IOP manager, said first secondary IOP manager residing on a second computer system and said second secondary IOP manager residing on a third computer system, said error and status information being communicated via said first IOP; and communicating said error and status information from said first secondary IOP manager to a second system manager and communicating said error and status information from said second secondary IOP manager to a third system manager, said second system manager residing on said second computer system and said third system manager residing on said third computer system. - View Dependent Claims (10, 11)
-
-
12. A method for communicating error and status information amongst first, second, and third computer systems, said method comprising the steps of:
-
communicating said error and status information from an IOP to a first secondary IOP manager, said IOP being one of a plurality of IOPs, said first secondary IOP manager residing on said second computer system; communicating said error and status information from said first secondary IOP manager to a second system manager, said second system manager residing on said second computer system; communicating said error and status information from said first secondary IOP manager to a primary IOP manager and a second secondary IOP manager, said primary IOP manager residing on said first computer system and said second secondary IOP manager residing on said third computer system, said error and status information being communicated via said IOP; and communicating said error and status information from said primary IOP manager to a first system manager and communicating said error and status information from said second secondary IOP manager to a third system manager, said first system manager residing on said first computer system and said third system manager residing on said third computer system. - View Dependent Claims (13, 14, 15, 16)
-
-
17. An apparatus for communicating error and status information amongst the nodes of a network of nodes, said apparatus comprising:
-
first, second, and third nodes, said first, second, and third nodes being interconnected via a network; first, second, and third system managers, said first system manager residing on said first node, said second system manager residing on said second node, and said third system manager residing on said third node; a primary communications manager and first and second secondary communications managers, said primary communications manager residing on said first node, said first secondary communications manager residing on said second node, and said second secondary communications manager residing on said third node; a plurality of shared devices, said shared devices being capable of communicating error and status information to said first, second, and third nodes; means for communicating said error and status information from a first shared device to said primary communications manager, said first shared device being one of said plurality of shared devices; means for communicating said error and status information from said primary communications manager to said first system manager; means for communicating said error and status information from said primary communications manager to said first secondary communications manager and said second secondary communications manager, said error and status information being communicated via said first shared device; and means for communicating said error and status information from said first secondary communications manager to said second system manager and means for communicating said error and status information from said second secondary communications manager to said third system manager.
-
-
18. An apparatus for communicating error and status information, said apparatus comprising:
-
first, second, and third nodes; first, second, and third system managers, said first system manager residing on said first node, said second system manager residing on said second node, and said third system manager residing on said third node; a primary communications manager and first and second secondary communications managers, said primary communications manager residing on said first node, said first secondary communications manager residing on said second node, and said second secondary communications manager residing on said third node; a plurality of shared devices, each of said plurality of shared devices being capable of communicating error and status information to said first, second, and third nodes; means for communicating said error and status information from a first shared device to said first secondary communications manager, said first shared device being one of said plurality of shared devices; means for communicating said error and status information from said first secondary communications manager to said second system manager; means for communicating said error and status information from said first secondary communications manager to said primary communications manager and said second secondary communications manager, said error and status information being communicated via said first shared device; and means for communicating said error and status information from said primary communications manager to said first system manager and means for communicating said error and status information from said second secondary communications manager to said third system manager.
-
Specification