Mobile agents for fault diagnosis and correction in a distributed computer environment
First Claim
1. A method of diagnosing a fault in a large, distributed computer network having a management server servicing a set of gateway machines, each of the gateway machines servicing a set of endpoint machines, the method comprising the steps of:
- deploying a management infrastructure throughout the computer network, the management infrastructure including a runtime environment at each of the endpoint machines;
in response to occurrence of the fault, selecting a software agent, the software agent being executable by the runtime environment at an endpoint machine; and
deploying the selected software agent into the computer network to diagnose the fault.
1 Assignment
0 Petitions
Accused Products
Abstract
A large distributed enterprise includes computing resources that are organized into one or more managed regions, each region being managed by a management server servicing one or more gateway machines, with each gateway machine servicing a plurality of endpoint machines. A method of diagnosing a fault in such an environment begins by deploying a management infrastructure throughout the computer network, the management infrastructure including a runtime environment at each of the endpoint machines. In response to occurrence of the fault, a software agent is selected, the software agent being executable by the runtime environment at an endpoint machine. The selected software agent is then deployed into the computer network to diagnosis the fault. If the location of the fault is indeterminate, the software agent migrates to the location by gathering information about the fault as it traverses the network.
228 Citations
30 Claims
-
1. A method of diagnosing a fault in a large, distributed computer network having a management server servicing a set of gateway machines, each of the gateway machines servicing a set of endpoint machines, the method comprising the steps of:
-
deploying a management infrastructure throughout the computer network, the management infrastructure including a runtime environment at each of the endpoint machines;
in response to occurrence of the fault, selecting a software agent, the software agent being executable by the runtime environment at an endpoint machine; and
deploying the selected software agent into the computer network to diagnose the fault. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
11. A method of diagnosing a fault in a large, distributed computer network having a management server servicing a set of gateway machines, each of the gateway machines servicing a set of endpoint machines, the method comprising the steps of:
-
deploying a management infrastructure throughout the computer network, the management infrastructure including a runtime environment at each of the endpoint machines;
in response to occurrence of the fault, selecting a software agent, the software agent being executable by the runtime environment at an endpoint machine; and
deploying the selected software agent into the computer network to diagnosis the fault;
wherein the step of deploying the selected software agent comprises the steps of;
migrating the selected software agent across a given set of nodes in the computer network;
at each node at which the selected software agent is received;
(a) determining whether the fault originated from the node;
(b) if the fault did not originate from the node, identifying a given subset of nodes associated with the node that remain candidates for locating the fault;
(c) deploying the software agent to the given subset; and
(d) repeating steps (a)-(c) until a location of the fault is determined.
-
-
12. A method of diagnosing and correcting a fault in a large, distributed computer network in which a management infrastructure is supported, comprising the steps of:
-
in response to occurrence of the fault, selecting at least one software agent from a set of software agents useful in diagnosing network events;
migrating the selected software agent across a given set of nodes in the computer network until the software agent arrives at the location of the fault; and
diagnosing and correcting the fault. - View Dependent Claims (10, 13, 14, 15, 16)
-
-
17. A method of diagnosing and correcting a fault in a large, distributed computer network in which a management infrastructure is supported, comprising the steps of:
-
in response to occurrence of the fault, selecting at least one software agent from a set of software agents useful in diagnosing network events;
migrating the selected software agent across a given set of nodes in the computer network until the software agent arrives at the location of the fault; and
diagnosing and correcting the fault;
wherein the selecting step includes identifying a given characteristic of the fault and selecting the software agent based on the given characteristic. - View Dependent Claims (18)
-
-
19. In a large distributed enterprise having a management server servicing a set of gateway machines, each of which services a set of endpoint machines, a diagnostic method comprising the steps of:
-
deploying a management infrastructure throughout the computer network, the management infrastructure including a runtime environment at given endpoint machines;
in response to occurrence of a given event, selecting a software agent, the software agent being executable by the runtime environment at an endpoint machine;
deploying the selected software agent into the computer network to diagnosis the event. - View Dependent Claims (20)
-
-
21. In a large distributed enterprise having a management server servicing a set of gateway machines, each of which services a set of endpoint machines, a diagnostic method comprising the steps of:
-
deploying a management infrastructure throughout the computer network, the management infrastructure including a runtime environment at given endpoint machines;
in response to occurrence of a given event selecting a software agent, the software agent being executable by the runtime environment at an endpoint machine;
deploying the selected software agent into the computer network to diagnosis the event;
wherein the software agent is deployed by;
migrating the selected software agent across a given set of nodes in the computer network; and
at each node at which the selected software agent is received;
(a) determining whether the fault originated from the node;
(b) if the fault did not originate from the node, identifying a given subset of nodes associated with the node that remain candidates for locating the fault;
(c) deploying the software agent to the given subset; and
(d) repeating steps (a)-(c) until a location of the fault is identified.
-
-
22. Apparatus connectable into a large distributed enterprise having a management server servicing a set of gateway machines, each of which services a set of endpoint machines, comprising:
-
means for deploying a management infrastructure throughout the computer network, the management infrastructure including a runtime environment for use at each of the endpoint machines;
means, responsive to occurrence of a given event, for selecting a software agent to diagnose the event, the software agent being executable by the runtime environment at an endpoint machine; and
means for deploying the selected software agent into the computer network to diagnosis the event. - View Dependent Claims (23)
-
-
24. Apparatus connectable into a large distributed enterprise having a management server servicing a set of gateway machines, each of which services a set of endpoint machines, comprising:
-
means for deploying a management infrastructure throughout the computer network, the management infrastructure including a runtime environment for use at each of the endpoint machines;
means, responsive to occurrence of a given event, for selecting a software agent to diagnose the event, the software agent being executable by the runtime environment at an endpoint machine; and
means for deploying the computer network to diagnosis the event;
wherein the selecting means includes;
means for parsing information about the event to identify one or more characteristics of the event; and
means responsive to the parsing means for correlating the one or more event characteristics with information about prior events to identify the software agent.
-
-
25. A computer program product in computer-readable media for use in a computer having a processor, a memory, and means for connecting the computer into a large distributed enterprise having a management server servicing a set of gateway machines, the computer connectable to one of the gateway machines, the computer program product comprising:
-
a runtime environment downloaded to the computer during a deployment operation; and
a software agent deployed to the computer during a diagnostic operation and being executable by the runtime environment to diagnose a given condition associated with the computer. - View Dependent Claims (26)
-
-
27. A computer program product in computer-readable media for use in a computer having a processor, a memory, and means for connecting the computer into a large distributed enterprise having a management server servicing a set of gateway machines, the computer connectable to one of the gateway machines, the computer program product comprising:
-
a runtime environment downloaded to the computer during a deployment operation; and
a software agent deployed to the computer during a diagnostic operation and being executable by the runtime environment to diagnose a given condition associated with the computer;
wherein the runtime environment is associated with a browser of the computer and the software agent is a Java applet.
-
-
28. A computer program product in computer-readable media for use in a computer having a processor, a memory, and means for connecting the computer into a large distributed computer network, the computer network having a management server servicing a set of gateway machines, each of which services a set of endpoint machines, the computer program product comprising:
-
means for deploying a management infrastructure throughout the computer network, the management infrastructure including a runtime environment for use at each of the endpoint machines;
a set of software agents, each of the software agents comprising a set of one or more tasks;
means, responsive to occurrence of a given event, for selecting a software agent to diagnose the event, the software agent being executable by the runtime environment at an endpoint machine; and
means for deploying the selected software agent into the computer network to diagnosis the event. - View Dependent Claims (29)
-
-
30. A computer program product in computer-readable media for use in a computer having a processor, a memory, and means for connecting the computer into a large distributed computer network, the computer network having a management server servicing a set of gateway machines, each of which services a set of endpoint machines, the computer program product comprising:
-
means for deploying a management infrastructure throughout the computer network, the management infrastructure including a runtime environment for use at each of the endpoint machines;
a set of software agents, each of the software agents comprising a set of one or more tasks;
means, responsive to occurrence of a given event, for selecting a software agent to diagnose the event, the software agent being executable by the runtime environment at an endpoint machine, and means for deploying the selected software agent into the computer network to diagnosis the event;
wherein the selecting means includes;
means for parsing information about the event to identify one or more characteristics of the event; and
means responsive to the parsing means for correlating the one or more event characteristics with information about prior events to identify the software agent.
-
Specification