Parallel computer system and method for controlling parallel computer system
First Claim
1. A parallel computer system, comprising:
- a plurality of computing nodes capable of executing a parallel program for generating a plurality of computing processes that perform respective predetermined computations and a plurality of monitoring processes,the plurality of computing processes and the plurality of monitoring processes are allocated to the plurality of computing nodes,the plurality of computing nodes form a monitoring hierarchical structure having two or more layers, each of the plurality of monitoring processes being capable of monitoring predetermined number of subordinate processes that are in a layer immediately lower than a layer that each of the plurality of monitoring processes exists, the subordinate processes including at least one of a monitoring process among the plurality of monitoring processes and a computing process among the plurality of computing processes,each node among the plurality of computing nodes operating as each of the plurality of monitoring processes performs processing that changes the monitoring hierarchical structure based on a first target value and a second target value, the first target value serving as a target value for total number of subordinate computing processes of each of the plurality of monitoring processes, the subordinate computing processes being in layers lower than a layer that a corresponding monitoring process among the plurality of monitoring processes exists and the subordinate computing processes connecting to the corresponding processes directly or indirectly, the second value serving as a target value for number of the subordinate processes of each of the plurality of monitoring processes, the second value being equal to the predetermined number, andthe first target value is calculated using a formula of “
total number of computing processes in the monitoring hierarchical structure/(the second target value)n”
, and the exponent “
n”
in the formula indicates a value of a layer that each of the plurality of monitoring processes exists in the monitoring hierarchical structure and the second target value is constant.
1 Assignment
0 Petitions
Accused Products
Abstract
A parallel computer system includes computing nodes to execute a parallel program for generating computing processes that perform computations and monitoring processes and forming a monitoring hierarchical structure, each monitoring process monitors a monitoring process and a computing process arranged immediately lower than the monitoring process, each of the computing nodes operating as the computing process when the computing process is allocated to the computing node and operating as the monitoring process when the monitoring process is allocated to the computing node. Each of the computing nodes allocated to the monitoring process performs processing that changes the hierarchical structure based on a first target value serving as a target value for the total number of subordinate computing processes of the allocated monitoring process and a second target value serving as a target value for the number of monitoring processes and computing processes arranged immediately lower than the allocated monitoring process.
12 Citations
10 Claims
-
1. A parallel computer system, comprising:
-
a plurality of computing nodes capable of executing a parallel program for generating a plurality of computing processes that perform respective predetermined computations and a plurality of monitoring processes, the plurality of computing processes and the plurality of monitoring processes are allocated to the plurality of computing nodes, the plurality of computing nodes form a monitoring hierarchical structure having two or more layers, each of the plurality of monitoring processes being capable of monitoring predetermined number of subordinate processes that are in a layer immediately lower than a layer that each of the plurality of monitoring processes exists, the subordinate processes including at least one of a monitoring process among the plurality of monitoring processes and a computing process among the plurality of computing processes, each node among the plurality of computing nodes operating as each of the plurality of monitoring processes performs processing that changes the monitoring hierarchical structure based on a first target value and a second target value, the first target value serving as a target value for total number of subordinate computing processes of each of the plurality of monitoring processes, the subordinate computing processes being in layers lower than a layer that a corresponding monitoring process among the plurality of monitoring processes exists and the subordinate computing processes connecting to the corresponding processes directly or indirectly, the second value serving as a target value for number of the subordinate processes of each of the plurality of monitoring processes, the second value being equal to the predetermined number, and the first target value is calculated using a formula of “
total number of computing processes in the monitoring hierarchical structure/(the second target value)n”
, and the exponent “
n”
in the formula indicates a value of a layer that each of the plurality of monitoring processes exists in the monitoring hierarchical structure and the second target value is constant. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A non-transitory computer-readable recording medium having stored therein a program for causing a plurality of computing nodes to execute a process comprising:
-
generating a plurality of computing processes that perform respective predetermined computations and a plurality of monitoring processes, the plurality of computing processes and the plurality of monitoring processes are allocated to the plurality of computing nodes; and forming a monitoring hierarchical structure having two or more layers, each of the plurality of monitoring processes being capable of monitoring predetermined number of subordinate processes that are in a layer immediately lower than a layer that each of the plurality of monitoring processes exists, the subordinate processes including at least one of a monitoring process among the plurality of monitoring processes and a computing process among the plurality of computing processes, the program causing each node among the plurality of computing nodes operating as each of the plurality of monitoring processes to execute processing including performing processing that changes the monitoring hierarchical structure based on a first target value and a second target value, the first target value serving as a target value for total number of subordinate computing processes corresponding to each of the plurality of monitoring processes, the subordinate computing processes being in layers lower than a layer that a corresponding monitoring process among the plurality of monitoring processes exists and connecting to the corresponding monitoring process directly or indirectly, the second value serving as a target value for number of the subordinate processes of each of the plurality of monitoring processes, the second value being equal to the predetermined number, the first target value is calculated using a formula of “
total number of computing processes in the monitoring hierarchical structure/(the second target value)n”
, and the exponent “
n”
in the formula indicates a value of the layer that each of the plurality of monitoring processes exists in the monitoring hierarchical structure and the second target value is constant.
-
-
10. A method of controlling a parallel computer system including a plurality of computing nodes the method comprising:
-
the plurality of computing nodes generating a plurality of computing processes that perform respective predetermined computations and a plurality of monitoring processes by executing a parallel program, the plurality of computing processes and the plurality of monitoring processes are allocated to the plurality of computing nodes; the plurality of computing nodes forming a monitoring hierarchical structure having two or more layers, each of the plurality of monitoring processes being capable of monitoring predetermined number of subordinate processes that are in a layer immediately lower than a layer that each of the plurality of monitoring processes exists, the subordinate processes including at least one of a monitoring process among the plurality of monitoring processes and a computing process among the plurality of computing processes; and each node among the plurality of computing nodes operating as each of the plurality of monitoring processes performing processing that changes the monitoring hierarchical structure based on a first target value and a second target value, the first target value serving as a target value for total number of subordinate computing processes corresponding to each of the plurality of monitoring processes, the subordinate computing processes being in layers lower than the layer that a corresponding monitoring process among the plurality of monitoring processes exists and the subordinate computing processes connecting to the corresponding monitoring process directly or indirectly, the second value serving as a target value for the number of subordinate processes of each of the plurality of monitoring processes, the second value being equal to the predetermined number, the first target value is calculated using a formula of “
total number of computing processes in the monitoring hierarchical structure/(the second target value)n”
, and the exponent “
n”
in the formula indicates a value of a layer that each of the plurality of monitoring processes exists in the monitoring hierarchical structure and the second target value is constant.
-
Specification