Failure detection in high-performance clusters and computers using chaotic map computations
First Claim
1. A failure detection system comprising:
- an exascale machine comprising a supercomputer or a cluster capable of executing at least one exaFLOPS;
a plurality of processors that comprise part of the exascale machine programmed to execute a programming thread that generates one or more chaotic map trajectories;
where at least one of the plurality of processors is further programmed to detect a component failure through an automated comparison of signal trajectories generated by a plurality of chaotic maps; and
where processing threads are inserted into application code that traces the execution paths of at least one tracer code.
2 Assignments
0 Petitions
Accused Products
Abstract
A programmable media includes a processing unit capable of independent operation in a machine that is capable of executing 1018 floating point operations per second. The processing unit is in communication with a memory element and an interconnect that couples computing nodes. The programmable media includes a logical unit configured to execute arithmetic functions, comparative functions, and/or logical functions. The processing unit is configured to detect computing component failures, memory element failures and/or interconnect failures by executing programming threads that generate one or more chaotic map trajectories. The central processing unit or graphical processing unit is configured to detect a computing component failure, memory element failure and/or an interconnect failure through an automated comparison of signal trajectories generated by the chaotic maps.
-
Citations
19 Claims
-
1. A failure detection system comprising:
-
an exascale machine comprising a supercomputer or a cluster capable of executing at least one exaFLOPS; a plurality of processors that comprise part of the exascale machine programmed to execute a programming thread that generates one or more chaotic map trajectories; where at least one of the plurality of processors is further programmed to detect a component failure through an automated comparison of signal trajectories generated by a plurality of chaotic maps; and where processing threads are inserted into application code that traces the execution paths of at least one tracer code. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A failure detection system comprising:
-
an exascale machine comprising a supercomputer or a cluster capable of executing at least one exaFLOPS; a plurality of processors that comprise part of the exascale machine programmed to execute a programming thread that generates one or more chaotic map trajectories;
where at least one of the plurality of processors is further programmed to detect a component failure through an automated comparison of signal trajectories generated by a plurality of chaotic maps;
,where at least one of the plurality of processors is programmed to detect component failures through the execution of inverse-pair operations.
-
-
9. A failure detection system comprising:
-
an exascale machine comprising a supercomputer or a cluster capable of executing at least one exaFLOPS; a plurality of processors that comprise part of the exascale machine programmed to execute a programming thread that generates one or more chaotic map trajectories; where at least one of the plurality of processors is further programmed to detect a component failure through an automated con comparison of signal trajectories generated by a plurality of chaotic maps; where the plurality of processors render a plurality of pipeline chains.
-
-
10. A failure detection system comprising:
-
an exascale machine comprising a supercomputer or a cluster capable of executing at least one exaFLOPS; a plurality of processors that comprise art of the exascale machine program to execute a programming thread that generates one or more chaotic map trajectories; where at least one of the plurality of processors is further to programmed to detect a component failure through an automated comparison of signal trajectories generated by a plurality of chaotic maps; where trajectories are generated by identity maps.
-
-
11. A failure detection system comprising:
-
an exascale machine comprising a supercomputer or a cluster capable of executing at least one exaFLOPS; a plurality of processors that comprise part of the exascale machine programmed to execute a programming thread that generates one or more chaotic map trajectories; where at least one of the plurality of processors is further programmed to detect a component failure through an automated comparison of signal trajectories generated by a plurality of chaotic maps; and where the trajectories are generated by a chaotic Poincare map and an identity map.
-
-
12. A failure detection system comprising:
-
an exascale machine comprising supercomputer or a cluster capable of executing at least one exaFLOPS; a plurality of processors that comprise part of the exascale machine programmed to execute a programming thread that generates one or more chaotic map trajectories; where at least one of the plurality of processors is further programmed to detect a component failure through an automated comparison of signal trajectories generated by a plurality of chaotic maps; and where the chaotic map comprises a Poincare logistic map.
-
-
13. A failure detection system comprising:
-
an exascale machine comprising a supercomputer or a cluster capable of executing at least one exaFLOPS; a plurality of processors that comprise part of the exascale machine programmed to execute a programming thread that generates one or more chaotic map trajectories; where at least one of the plurality of processors is further rammed to detect a component failure through an automated comparison of signal trajectories generated by a plurality of chaotic maps; and where the chaotic map comprises a Poincare tent map.
-
-
14. A failure detection system comprising:
-
an exascale machine comprising a supercomputer or a cluster capable of executing at least one exaFLOPS; a plurality of processors that comprise part of the exascale machine programmed to execute a programming thread that generates one or more chaotic map trajectories; where at least one of the plurality of processors is further programmed to detect a component failure through an automated comparison of signal trajectories generated by a plurality of chaotic maps; and where the chaotic map comprises a Poincare Henon map.
-
-
15. A programmable media comprising:
-
a single or multi-core central processing unit or a graphical processing unit, capable of independent operation in an exascale machine in communication with a memory element and an interconnect that couples computing elements; at least one logical unit configured to execute arithmetic, comparative functions, and logical functions; the central processing unit or graphical processing unit configured to detect one or more computing node or interconnect failures through programming threads that generate one or more chaotic map trajectories and being configured to detect a computing node failure or an interconnect failure through an automated comparison of signal trajectories generated by the one or more chaotic maps; where the single or multi-core central processing unit or graphical processing unit is programmed to render a plurality of pipeline chains. - View Dependent Claims (16, 17)
-
-
18. A programmable media comprising:
-
a single or multi-core central processing unit or a graphical processing unit, capable of independent operation in an exascale machine in communication with a memory element and an interconnect that couples computing elements; at least one logical unit configured to execute arithmetic, comparative functions, and logical functions; the central processing unit or graphical .processing unit configured to detect one or more computing node or interconnect failures through programming threads that generate one or more chaotic map trajectories and being configured to detect a computing node failure or an interconnect failure through an automated comparison of signal trajectories generated by the one or more chaotic maps; and where the single or multi-core central processing unit or graphical processing unit is programmed to detect the computing node failure and the interconnect failure through the execution of inverse-pair operations.
-
-
19. A. programmable media comprising:
-
a single or multi-core central processing unit or a graphical processing unit, capable of independent operation in an exascale machine in communication with a memory element and an interconnect that couples computing elements; at least one logical unit configured to execute arithmetic, comparative functions, and logical functions; the central processing unit or graphical processing unit configured to detect one or more computing node or interconnect failures through programming threads that generate one or more chaotic map trajectories and being configured to detect a computing node failure or an interconnect failure through an automated comparison of signal trajectories generated by the one or more chaotic map; where the central processing unit or graphical processing unit is configured to execute tracer threads inserted into application code that generates the execution paths of at least one tracer code.
-
Specification