COMPUTER SYSTEM, METHOD OF DETECTING SYMPTOM OF FAILURE IN COMPUTER SYSTEM, AND PROGRAM
First Claim
1. A computer system, comprising:
- a computer comprising;
a processor for carrying out an arithmetic operation; and
a memory for storing an application and an OS which are executed by the processor;
a plurality of sensors each provided to a component of hardware of the computer, for measuring a status quantity of the component; and
a failure symptom detection unit for detecting a symptom of a failure in the hardware based on a measurement of each of the plurality of sensors,wherein the failure symptom detection unit comprises;
an operation information acquisition unit for acquiring, from the OS, load information on the processor used for the application;
a sensor information processing unit for acquiring the measurement from the each of the plurality of sensors for each component;
a characteristic data storage unit for associating, in advance, each load information on the processor when the application is executed and the measurement of the each of the plurality of sensors for the each component when the application is executed with each other, and storing the associated load information and the associated measurement as characteristic information on the application;
a failure symptom determination processing unit for obtaining, from current load information acquired by the operation information acquisition unit and the characteristic information corresponding to the application, an estimation of the status quantity of the each component, which corresponds to the current load information, obtaining, from the sensor information processing unit, a current status quantity as a current value for the each component, and comparing, for the each component, an absolute value of a difference between the estimation and the current value with a permissible error set in advance, to thereby determine, when the absolute value of the difference is equal to or more than the permissible error, that the symptom of the failure is present; and
a failed location determination processing unit for identifying the component having the absolute value of the difference equal to or more than the permissible error as a component in which the symptom of the failure is present.
1 Assignment
0 Petitions
Accused Products
Abstract
Provided is a computer system comprising: a failure symptom detection unit for detecting a symptom of a failure in hardware of a computer based on a measurement of a sensor; and a plurality of the sensors each provided to a component of the hardware, for measuring a status quantity of the component. The failure symptom detection unit comprises: a failure symptom determination processing unit for obtaining, from a characteristic information for each application, an estimation of the status quantity of the each component, which corresponds to current load information, obtaining a current status quantity as a current value for the each component, and determining, when an absolute value of a difference between the estimation and the current value is equal to or more than a permissible error, that the symptom of the failure is present.
-
Citations
15 Claims
-
1. A computer system, comprising:
-
a computer comprising; a processor for carrying out an arithmetic operation; and a memory for storing an application and an OS which are executed by the processor; a plurality of sensors each provided to a component of hardware of the computer, for measuring a status quantity of the component; and a failure symptom detection unit for detecting a symptom of a failure in the hardware based on a measurement of each of the plurality of sensors, wherein the failure symptom detection unit comprises; an operation information acquisition unit for acquiring, from the OS, load information on the processor used for the application; a sensor information processing unit for acquiring the measurement from the each of the plurality of sensors for each component; a characteristic data storage unit for associating, in advance, each load information on the processor when the application is executed and the measurement of the each of the plurality of sensors for the each component when the application is executed with each other, and storing the associated load information and the associated measurement as characteristic information on the application; a failure symptom determination processing unit for obtaining, from current load information acquired by the operation information acquisition unit and the characteristic information corresponding to the application, an estimation of the status quantity of the each component, which corresponds to the current load information, obtaining, from the sensor information processing unit, a current status quantity as a current value for the each component, and comparing, for the each component, an absolute value of a difference between the estimation and the current value with a permissible error set in advance, to thereby determine, when the absolute value of the difference is equal to or more than the permissible error, that the symptom of the failure is present; and a failed location determination processing unit for identifying the component having the absolute value of the difference equal to or more than the permissible error as a component in which the symptom of the failure is present. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method of detecting a symptom of a failure in a computer system comprising:
-
a computer comprising; a processor for carrying out an arithmetic operation; and a memory for storing an application and an OS which are executed by the processor; and a plurality of sensors each provided to a component of hardware of the computer, for measuring a status quantity of the component, the symptom of the failure in the hardware being detected based on a measurement of each of the plurality of sensors, the method comprising; acquiring, by the processor from the OS, load information on the processor used for the application when the processor executes the application; acquiring, by the processor, the measurement of the each of the plurality of sensors for the each component when the processor executes the application; associating, by the processor in advance, each load information when the application is executed and the measurement of the each of the plurality of sensors for the each component when the application is executed with each other, and storing, in a storage system, the associated load information and the associated measurement as characteristic information on the application; acquiring, by the processor from the OS, current load information on the processor used for the application, and obtaining, from the characteristic information corresponding to the application, an estimation of the status quantity of the each component, which corresponds to the current load information; acquiring, by the processor from the each of the plurality of sensors, a current status quantity as a current value for the each component; comparing, by the processor for the each component, an absolute value of a difference between the estimation and the current value with a permissible error set in advance, to thereby determine, when the absolute value of the difference is equal to or more than the permissible error, that the symptom of the failure is present; and identifying, by the processor, the component having the absolute value of the difference equal to or more than the permissible error as a component in which the symptom of the failure is present. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A machine-readable medium for storing a program for detecting a symptom of a failure in a computer system comprising:
-
a computer comprising; a processor for carrying out an arithmetic operation; and a memory for storing an application and an OS which are executed by the processor; and a plurality of sensors each provided to a component of hardware of the computer, for measuring a status quantity of the component, the symptom of the failure in the hardware being detected based on a measurement of each of the plurality of sensors, the program controlling the computer to execute the procedures of; acquiring, from the OS, load information on the processor used for the application when the application is executed; acquiring the measurement of the each of the plurality of sensors for the each component when the application is executed; associating, in advance, each load information when the application is executed and the measurement of the each of the plurality of sensors for the each component when the application is executed with each other, and storing, in a storage system, the associated load information and the associated measurement as characteristic information on the application; acquiring, from the OS, current load information on the processor used for the application, and obtaining, from the characteristic information corresponding to the application, an estimation of the status quantity of the each component, which corresponds to the current load information; acquiring, from the each of the plurality of sensors, a current status quantity as a current value for the each component; comparing, for the each component, an absolute value of a difference between the estimation and the current value with a permissible error set in advance, to thereby determine, when the absolute value of the difference is equal to or more than the permissible error, that the symptom of the failure is present; and identifying the component having the absolute value of the difference equal to or more than the permissible error as a component in which the symptom of the failure is present. - View Dependent Claims (12, 13, 14, 15)
-
Specification