Method and apparatus for identifying a failure mechanism for a component in a computer system
First Claim
1. A method for identifying a failure mechanism for a component, comprising:
- constructing a library of failure signatures from a set of components, wherein each failure signature in the library characterizes a known failure mechanism associated with a component in the set of components, and the failure signature is stored at a high-resolution;
receiving a telemetry signal which is measured from a component under surveillance, wherein the telemetry signal is measured at a low-resolution;
determining if there is an anomaly in the telemetry signal, and if so;
performing pattern-matching operations between the telemetry signal and failure signatures in the library; and
if a match is found, identifying the failure mechanism for the component under surveillance based on the matching failure signature,wherein the pattern-matching operations recall the high-resolution failure signature based on the low-resolution telemetry signal.
2 Assignments
0 Petitions
Accused Products
Abstract
One embodiment of the present invention provides a system that identifies a failure mechanism for a component. The system first constructs a library of failure signatures from a set of components, wherein each failure signature in the library characterizes a known failure mechanism associated with a component in the set of components. Next, the system receives a telemetry signal which is measured from a component under surveillance. The system then determines if there is an anomaly in the telemetry signal. If so, the system performs pattern-matching operations between the telemetry signal and failure signatures in the library. Next, if a match is found, the system identifies the failure mechanism for the component under surveillance based on the matching failure signature.
21 Citations
15 Claims
-
1. A method for identifying a failure mechanism for a component, comprising:
-
constructing a library of failure signatures from a set of components, wherein each failure signature in the library characterizes a known failure mechanism associated with a component in the set of components, and the failure signature is stored at a high-resolution; receiving a telemetry signal which is measured from a component under surveillance, wherein the telemetry signal is measured at a low-resolution; determining if there is an anomaly in the telemetry signal, and if so; performing pattern-matching operations between the telemetry signal and failure signatures in the library; and if a match is found, identifying the failure mechanism for the component under surveillance based on the matching failure signature, wherein the pattern-matching operations recall the high-resolution failure signature based on the low-resolution telemetry signal. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer-readable storage medium storing instructions that when executed by a computer cause the computer to perform a method for identifying a failure mechanism for a component, the method comprising:
-
constructing a library of failure signatures from a set of components, wherein each failure signature in the library characterizes a known failure mechanism associated with a component in the set of components, and the failure signature is stored at a high-resolution; receiving a telemetry signal which is measured from a component under surveillance, wherein the telemetry signal is measured at a low-resolution; determining if there is an anomaly in the telemetry signal, and if so; performing pattern-matching operations between the telemetry signal and failure signatures in the library; and if a match is found, identifying the failure mechanism for the component under surveillance based on the matching failure signature, wherein the pattern-matching operations recall the high-resolution failure signature based on the low-resolution telemetry signal. - View Dependent Claims (7, 8, 9, 10)
-
-
11. An apparatus that identifies a failure mechanism for a component, comprising:
-
a construction mechanism configured to construct a library of failure signatures from a set of components, wherein each failure signature in the library characterizes a known failure mechanism associated with a component in the set of components, and the failure signature is stored at a high-resolution; a receiving mechanism configured to receive a telemetry signal which is measured from a component under surveillance, wherein the telemetry signal is measured at a low-resolution; a determination mechanism configured to determine if there is an anomaly in the telemetry signal, wherein the determination mechanism is further configured to; perform pattern-matching operations between the telemetry signal and failure signatures in the library; and if a match is found, to identify a failure mechanism for the component under surveillance based on the matching failure signature, wherein the pattern-matching operations recall the high-resolution failure signature based on the low-resolution telemetry signal. - View Dependent Claims (12, 13, 14, 15)
-
Specification