Cluster controlling system operating on a plurality of computers in a cluster system
First Claim
1. A cluster controlling system, comprising:
- cluster daemons for communicating with a plurality of computers, and for monitoring and controlling resources;
local data memories for storing the monitoring result as local data; and
one of said plurality of computers in the cluster system including;
global data memory for storing data collected from the local data memories of the respective computers, and for being referred to from any of the computers; and
a manager for communicating with the global data memory and the cluster daemons of the respective computers, and for monitoring and controlling the overall cluster system, wherein if a fault or failure occurs in the manager or the one computer in said plurality of computers, the same manager is re-started on another computer in said plurality of computers by its corresponding cluster daemon.
1 Assignment
0 Petitions
Accused Products
Abstract
A cluster controlling system transfers packages which have been operating on one computer to another computer when a fault or failure has occurred by monitoring and controlling the packages in the entire system. When the respective packages are started-up, cluster daemons on the respective computers monitor and control resources on the operating computers. The monitored and controlled data are stored in the respective computers as local data. A manager communicates with cluster daemons on the respective computers, and stores data in a global data memory to monitor and control the entire system. The manager is actually one of the packages operating in the cluster system. If a fault or failure occurs in the manager or in the computer running the manager, the manager is re-started on another computer by a cluster daemon.
138 Citations
20 Claims
-
1. A cluster controlling system, comprising:
-
cluster daemons for communicating with a plurality of computers, and for monitoring and controlling resources; local data memories for storing the monitoring result as local data; and one of said plurality of computers in the cluster system including; global data memory for storing data collected from the local data memories of the respective computers, and for being referred to from any of the computers; and a manager for communicating with the global data memory and the cluster daemons of the respective computers, and for monitoring and controlling the overall cluster system, wherein if a fault or failure occurs in the manager or the one computer in said plurality of computers, the same manager is re-started on another computer in said plurality of computers by its corresponding cluster daemon. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A cluster controlling system, comprising:
-
cluster daemons for communicating with packages and a plurality of computers; agents for communicating with a corresponding cluster daemon on each of the respective computers, a global data memory and other agents; local data memories for storing results which are monitored and controlled by the cluster daemons as local data; and global data memory for storing data collected from the local data memories of the respective computers, and for being referred to from any of the computers via the agents, and one computer in said plurality of computers including a manager which communicates with each of the plurality of cluster daemons and with the global data memory to control the cluster system, wherein if a fault or failure occurs in the manager or the one computer in said plurality of computers, the same manager is re-started on another computer in said plurality of computers by its corresponding cluster daemon. - View Dependent Claims (18)
-
-
19. A cluster system comprising:
-
a plurality of computers corresponding to and communicating with a plurality of daemons; local data memories for storing results which are monitored and controlled by the plurality of daemons; wherein one computer in said plurality of computers further includes; a manager which communicates with each of the plurality of daemons and with a global data memory, to control the cluster system; and wherein if a fault or failure occurs in the manager or the one computer in said plurality of computers, the manager is re-started on another computer in said plurality of computers by its corresponding daemon.
-
-
20. A manager comprising:
-
a demand processing device for receiving demands from a user; a resource controlling device which receives a demand from the demand processing device, refers to a database in a global memory, and transmits a resource state corresponding to said demand to a first daemon of a first computer which controls resources, wherein if a fault or failure occurs in the manager or in the first computer operating the manager, the manager is re-started on a second computer by the second computer'"'"'s corresponding daemon.
-
Specification