Monitoring Operating Parameters In A Distributed Computing System With Active Messages
First Claim
1. A method of monitoring operating parameters in a distributed computing system with active messages, the distributed computing system comprising a plurality of nodes organized for collective operations, the method comprising:
- initiating, by a root node through an active message to all other nodes, a collective operation, the active message comprising an instruction to each node to store operating parameter data in each node'"'"'s send buffer; and
responsive to the active message;
storing, by each node, the node'"'"'s operating parameter data in the node'"'"'s send buffer and returning, by the node, the operating parameter data as a result of the collective operation.
1 Assignment
0 Petitions
Accused Products
Abstract
In a distributed computing system including a nodes organized for collective operations: initiating, by a root node through an active message to all other nodes, a collective operation, the active message including an instruction to each node to store operating parameter data in each node'"'"'s send buffer; and, responsive to the active message: storing, by each node, the node'"'"'s operating parameter data in the node'"'"'s send buffer and returning, by the node, the operating parameter data as a result of the collective operation.
-
Citations
24 Claims
-
1. A method of monitoring operating parameters in a distributed computing system with active messages, the distributed computing system comprising a plurality of nodes organized for collective operations, the method comprising:
-
initiating, by a root node through an active message to all other nodes, a collective operation, the active message comprising an instruction to each node to store operating parameter data in each node'"'"'s send buffer; and responsive to the active message;
storing, by each node, the node'"'"'s operating parameter data in the node'"'"'s send buffer and returning, by the node, the operating parameter data as a result of the collective operation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An apparatus for monitoring operating parameters in a distributed computing system with active messages, the distributed computing system comprising a plurality of nodes organized for collective operations, the apparatus comprising a computer processor and a computer memory operatively coupled to the computer processor, the computer memory having disposed within it computer program instructions capable of:
-
initiating, by a root node through an active message to all other nodes, a collective operation, the active message comprising an instruction to each node to store operating parameter data in each node'"'"'s send buffer; and responsive to the active message;
storing, by each node, the node'"'"'s operating parameter data in the node'"'"'s send buffer and returning, by the node, the operating parameter data as a result of the collective operation. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
-
17. A computer program product for monitoring operating parameters in a distributed computing system with active messages, the distributed computing system comprising a plurality of nodes organized for collective operations, the computer program product disposed in a computer readable storage medium, the computer program product comprising computer program instructions capable of:
-
initiating, by a root node through an active message to all other nodes, a collective operation, the active message comprising an instruction to each node to store operating parameter data in each node'"'"'s send buffer; and responsive to the active message;
storing, by each node, the node'"'"'s operating parameter data in the node'"'"'s send buffer and returning, by the node, the operating parameter data as a result of the collective operation. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
Specification