Monitoring and controlling system and method for data processing system
First Claim
1. A fault monitoring and controlling system for a data processing system, comprising:
- a computer system including a main storage, a central processing unit and a manipulating means equipped with functions for operation and maintenance of said central processing unit and an operation system,a monitor and control apparatus connected to said computer system for performing a fault monitoring and controlling operation for said computer system;
a remotely located supervision and control system for providing information to said monitor and control apparatus for supervising and controlling said computer system;
wherein said monitor and control apparatus comprises;
first storage means for storing message data of said operation system of said computer system or command data inputted from said manipulating means;
second storage means for storing reference message data for detecting abnormal states;
comparison means for comparing message data of said operation system with contents of said second storage means;
fault decision means for deciding whether a fault has occurred on the basis of said comparison; and
report control means for automatically reporting the occurrence of the fault to said remotely located supervision and control system.
1 Assignment
0 Petitions
Accused Products
Abstract
Fault monitoring/controlling apparatus and method for a data processing system comprises one or more computer systems, monitor and control apparatuses connected thereto for monitoring and controlling the associated computer systems and a single supervision/control system provided at a site remote from the computer systems. When abnormality taking place in the computer system is decided as a fault, report of the fault occurrence is automatically effectuated to the remotely located supervision/control system, which responds to the reception of the fault report and issues, if necessary, a command requesting additional supply of detailed fault information to the computer system suffering the fault. The additional fault information is comparatively collated with the fault information accumulated (precedents). When a reference precedent which coincides with the fault occurring currently is found to be present as the result of collation, a recovery procedure is generated on the basis of the precedent data to be subsequently transmitted to the computer system of concern. A plurality of computer systems can be monitored in a consolidated manner, wherein fault taking place in the computer systems can be removed rapidly through diagnosis and execution of appropriate procedure.
145 Citations
57 Claims
-
1. A fault monitoring and controlling system for a data processing system, comprising:
-
a computer system including a main storage, a central processing unit and a manipulating means equipped with functions for operation and maintenance of said central processing unit and an operation system, a monitor and control apparatus connected to said computer system for performing a fault monitoring and controlling operation for said computer system; a remotely located supervision and control system for providing information to said monitor and control apparatus for supervising and controlling said computer system; wherein said monitor and control apparatus comprises; first storage means for storing message data of said operation system of said computer system or command data inputted from said manipulating means; second storage means for storing reference message data for detecting abnormal states; comparison means for comparing message data of said operation system with contents of said second storage means; fault decision means for deciding whether a fault has occurred on the basis of said comparison; and report control means for automatically reporting the occurrence of the fault to said remotely located supervision and control system. - View Dependent Claims (17, 21, 25)
-
-
2. A fault monitoring and controlling system for a data processing system, comprising:
-
a computer system including a main storage, a central processing unit and a manipulating means equipped with functions for operation and maintenance of said central processing unit and an operation system, a monitor and control apparatus connected to said computer system for performing a fault monitoring and controlling operation for said computer system; a remotely located supervision and control system for providing information to said monitor and control apparatus for supervising and controlling said computer system; wherein said monitor and control apparatus comprises; means for displaying a message data received from said operation system and a service processor on a console unit; means for sending a command data inputted from said console unit to said operation system and said service processor; means for recording a history of said message and command data; means for detecting an occurrence of a fault in said computer system by comparing said message data with a fault message previously stored; means for storing a predetermined number of message data of the operation system and said service processor of said computer system; and means for sending the contents of said storing means to said remotely located supervision and control system in response to a command issued by said supervision and control system. - View Dependent Claims (18, 22, 26)
-
-
3. A fault monitoring and controlling system for a data processing system, comprising:
-
a computer system including a main storage, a central processing unit and a manipulating means equipped with functions for operation and maintenance of said central processing unit and an operation system, a monitor and control apparatus connected to said computer system for performing a fault monitoring and controlling operation for said computer system; a remotely located supervision and control system for providing information to said monitor and control apparatus for supervising and controlling said computer system; wherein said monitor and control apparatus comprises; means for receiving hardware fault information indicating occurrence of a fault in said central processing unit of the computer system from a service processor and processing units in said computer system; means for accumulating the hardware fault information to produce an accumulated history of hardware fault information; means for detecting the occurrence of the hardware fault in said computer system; and means for reporting the detected hardware fault to said remotely located supervision and control system. - View Dependent Claims (8, 9, 19, 23, 27)
-
-
4. A fault monitoring and controlling system for a data processing system, comprising:
-
a computer system including a main storage, a central processing unit and a manipulating means equipped with functions for operation and maintenance of said central processing unit and an operation system, a monitor and control apparatus connected to said computer system for performing a fault monitoring and controlling operation for said computer system; a remotely located supervision and control system for providing information to said monitor and control apparatus for supervising and controlling said computer system; wherein said monitor and control apparatus comprises; means for receiving hardware fault information indicating an occurrence of a fault in said central processing unit of said computer system; means for storing said hardware fault information; means for reading out, from a memorizing means, a hardware fault state of said central processing unit on the basis of a command issued from said remotely located supervision and control system; and means for sending information read out to said remotely located supervision and control system. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 20, 24, 28)
-
-
5. A fault monitoring and controlling system for a data processing system, comprising:
-
a computer system including a main storage, a central processing unit and a manipulating means equipped with functions for operation and maintenance of said central processing unit and an operation system, a monitor and control apparatus connected to said computer system for performing a fault monitoring and controlling operation for said computer system; and a remotely located supervision and control system for providing information to said monitor and control apparatus for supervising and controlling said computer system; wherein said remotely located supervision and control system comprises; fault precedent storage means for storing precedent fault information to serve as a precedent when said remotely located supervision and control system receives fault occurrence information from said monitor and control apparatus; collation means for collating said fault occurrence information with the contents of said fault precedent storage means; recovery processing procedure generating means for generating a recovery processing procedure for remedying a fault when said collation performed by said collation means shows the presence of a precedent which coincides with the fault; and sending means for sending the recovery processing procedure generated by said recovery processing procedure generating means to said monitor and control apparatus. - View Dependent Claims (6, 7, 29, 30, 31, 32, 33)
-
-
34. A fault monitoring and controlling system for a data processing system, comprising:
-
a computer system including a main storage, a central processing unit and a manipulating means equipped with functions for operation and maintenance of said central processing unit and an operation system, a monitor and control apparatus connected to said computer system for performing a fault monitoring and controlling operation for said computer system; and a remotely located supervision and control system for providing information to said monitor and control apparatus for supervising and controlling from a remote site said computer system; wherein when a software or hardware fault takes place in said computer system and when fault information of the occurrence of the software or hardware fault is received by said remotely located supervision and control system, said fault information is collated with a content of a fault precedent storage means incorporated in said remotely located supervision and control system, and when a precedent coinciding with said fault is present, recovery processing procedure for removing the fault from said computer system is generated and transferred to said monitor and control apparatus provided in association with said computer system suffering from the fault, said monitor and control apparatus receiving said recovery processing procedure, restarting said computer system in accordance with said recovery processing procedure, and displaying said recovery processing procedure on a console unit of said monitor and control apparatus.
-
-
35. A fault monitoring and controlling method for a data processing system which comprises:
-
a computer system including a main storage, a central processing unit and a manipulating means equipped with functions for operation and maintenance of said central processing unit and an operation system; a monitor and control apparatus connected to said computer system for performing a fault monitoring and controlling operation for said computer systems; and a single remotely located supervision and control system for providing information to said monitor and control apparatus for supervising and controlling said computer system; wherein said monitor and control apparatus performs; a first storage step for storing a predetermined number of message data of the operation system of the computer system or command data inputted through said manipulating means; a second storage step for storing message data for detecting abnormal states; a comparison step for comparing said message data of said operation system with the contents of a second storage means; a fault decision step for deciding occurrence of a fault based upon the result of said comparison; and a report control step for reporting automatically the occurrence of the fault to said remotely located supervision and control system.
-
-
36. A fault monitoring and controlling method for a data processing system which comprises:
-
a computer system including a main storage, a central processing unit and a manipulating means equipped with functions for operation and maintenance of said central processing unit and an operation system, a monitor and control apparatus connected to said computer system for performing a fault monitoring and controlling operation for said computer systems; and a single remotely located supervision and control system for providing information to said monitor and control apparatus for supervision and controlling said computer system; wherein said monitor and control apparatus performs; a first storage step for storing a predetermined number of message data of the operation system of the computer system or command data inputted for said manipulating means; and a send control step for sending contents of a first storage means to said remotely located supervision and control system in response to a command issued by said supervision and control system.
-
-
37. A fault monitoring and controlling method for a data processing system which comprises:
-
a computer system including a main storage, a central processing unit and a manipulating means equipped with functions for operation and maintenance of said central processing unit and an operation system, a monitor and control apparatus connected to said computer system for performing a fault monitoring and controlling operation for said computer system; and a single remotely located supervision and control system for providing information to said monitor and control apparatus for supervising and controlling said computer system; wherein said monitor and control apparatus performs; a control step for receiving a signal indicating occurrence of a fault in hardware of said central processing unit of the computer system; and a report control step for reporting automatically the occurrence of the fault to said remotely located supervision and control system. - View Dependent Claims (42, 43)
-
-
38. A fault monitoring and controlling method for a data processing system which comprises:
-
a computer system including a main storage, a central processing unit and a manipulating means equipped with functions for operation and maintenance of said central processing unit and an operation system, a monitor and control apparatus connected to said computer system for performing a fault monitoring and controlling operation for sad computer system; and a single remotely located supervision and control system for providing information to said monitor and control apparatus for supervising and controlling said computer system; wherein each of said monitor and control apparatus performs; a control step for reading out fault state of hardware in said central processing unit based upon a command issued from said remotely located supervision and control system; and a send control step for sending out information read out to said remotely located supervision and control system. - View Dependent Claims (44, 45, 46, 47, 48, 49)
-
-
39. A fault monitoring and controlling method for a data processing system, comprising:
-
a computer system including a main storage, a central processing unit and a manipulating means equipped with functions for operation and maintenance of said central processing unit and an operation system, a monitor and control apparatus connected to said computer system for performing a fault monitoring and controlling operation for said computer system; and a single remotely located supervision and control system for providing information to said monitor and control apparatus for supervising and controlling said computer system; wherein said single remotely located supervision and control system performs; a fault precedent storing step for storing fault information as a record to serve as a precedent when said remotely located supervision and control system receives fault occurrence information from said monitor and control apparatus; a collating step for collating said fault occurrence information with the contents of a fault precedent storage means; and a recovery processing procedure generating step for generating a recovery processing procedure for remedying the fault when said collation performed by a collation means shows the presence of a precedent which coincides with the fault as informed. - View Dependent Claims (40, 41, 50, 51, 52, 53, 54)
-
-
55. A fault monitoring and controlling method for a data processing system which comprises:
-
a computer system including a main storage, a central processing unit and a manipulating means equipped with functions for operation and maintenance of said central processing unit and an operation system, a monitor and control apparatus connected to said computer system for performing a fault monitoring and controlling operation for said computer systems; and a single remotely located supervision and control system for providing information to said monitor and control apparatus for supervising and controlling from a remote site said computer system; wherein when a fault takes place in said computer system and when fault information of the occurrence of the fault is received by said remotely located supervision and control system, said fault information is collated with contents of a fault precedent storage means incorporated in said remotely located supervision and control system, and when a precedent coinciding with said fault is present, recovery processing procedure for removing the fault from said computer system is generated and transferred to said monitor and control apparatus provided in association with said computer system suffering from the fault, said monitor and control apparatus receiving said recovery processing procedure for restarting said computer system in accordance with said recovery processing procedure.
-
-
56. A fault monitoring and controlling system for a data processing system, comprising:
-
a computer system including a main storage, a central processing unit and a manipulating means equipped with functions for operation and maintenance of said central processing unit and an operation system, a monitor and control apparatus connected to said computer system for performing a fault monitoring and controlling operation for said computer systems; and a single remotely located supervision and control system for providing information to said monitor and control apparatus for supervising and controlling said computer system; wherein each of said monitor and control apparatus comprises; first storage means for storing a predetermined number of message data of the operation system of the computer system or command data inputted from said manipulating means; second storage means for storing message data for detecting abnormal states; comparison means for comparing message data of said operation system with the contents of said second storage means; fault decision means for deciding whether a fault has occurred on the basis of said comparison; and report control means for automatically reporting the occurrence of the fault to said remotely located supervision and control system; and wherein said remotely located supervision and control system comprises; fault precedent storage means for storing precedent fault information to serve as a precedent when said remotely located supervision and control system receives fault occurrence information from said monitor and control apparatus; collation means for collating said fault occurrence information with the contents of said fault precedent storage means; and recovery processing procedure generating means for generating a recovery processing procedure for remedying a fault when said collation performed by said collation means shows the presence of a precedent which coincides with the fault.
-
-
57. A monitoring and controlling system comprising:
-
a first computer system including a service processor; a monitor and control apparatus for monitoring and controlling the operation of said first computer system; wherein said monitor and control apparatus includes; first means for receiving a message data from an operating system and said service processor and displaying the message data on a console unit; second means for sending a command data inputted from said console unit to said operating system and said service processor; memory means for storing a plurality of said message data received by said first means and a plurality of command data inputted from said console unit; third means for detecting an occurrence of a software fault in said first computer system by comparing said received message data with predetermined fault message data; fourth means for receiving hardware fault signals from said first computer system and gathering hardware fault information related to the received hardware fault signals from said first computer system; fifth means responsive to detection of the software fault by said third means, for informing a second computer system at a remotely located monitoring center of the generation of the software fault; and sixth means for sending said plurality of message data and said plurality of command data from said memory means to said second computer system and responsive to receipt of the hardware fault signal by said fourth means for informing said computer system of the occurrence of a hardware fault within the first computer system and for sending the hardware fault information gathered by said fourth means to said second computer system.
-
Specification