Multi-node computer system component proactive monitoring and proactive repair
First Claim
Patent Images
1. A data storage system comprising:
- a plurality of metadata server machines each configured to store metadata for a plurality of files that are stored in the system;
a plurality of slice server machines configured to store slices of said files at locations, in the slice server machines, indicated by the metadata;
a packet switching interconnect configured to communicatively couple the metadata and slice servers;
a distributed file system configured to execute in the metadata and slice server machines, the file system to hide complexity of the data storage system from clients; and
a storage device storing sequences of instructions which when executed by a processor cause monitoring a wear level of a hardware component of the data storage system and sending a notification in response to determining that the wear level has reached a predetermined level;
wherein the wear level is based, at least in part, on an amount of usage of the hardware component and wherein the wear level is independent of detected failure of the hardware component;
wherein the predetermined level is based at least in part on a predicted end of life for the hardware component;
wherein the wear level corresponds to a number of power up to power down cycles of the hardware component.
6 Assignments
0 Petitions
Accused Products
Abstract
A highly available multi-node computer system is operated by monitoring the aging and usage of a plurality of hardware components that are part of the system'"'"'s networked nodes. While monitoring the components, a determination is made that one of the components has aged, worn, or both, to a level that is selected as being close enough to the component'"'"'s predicted end of life in the system so as to prevent failure of the component in the system. A notification is sent to replace the component, in response to the determination. Other embodiments are also described and claimed.
53 Citations
20 Claims
-
1. A data storage system comprising:
-
a plurality of metadata server machines each configured to store metadata for a plurality of files that are stored in the system; a plurality of slice server machines configured to store slices of said files at locations, in the slice server machines, indicated by the metadata; a packet switching interconnect configured to communicatively couple the metadata and slice servers; a distributed file system configured to execute in the metadata and slice server machines, the file system to hide complexity of the data storage system from clients; and a storage device storing sequences of instructions which when executed by a processor cause monitoring a wear level of a hardware component of the data storage system and sending a notification in response to determining that the wear level has reached a predetermined level; wherein the wear level is based, at least in part, on an amount of usage of the hardware component and wherein the wear level is independent of detected failure of the hardware component; wherein the predetermined level is based at least in part on a predicted end of life for the hardware component; wherein the wear level corresponds to a number of power up to power down cycles of the hardware component. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer-implemented method comprising:
-
monitoring a wear level of a hardware component of a data storage system; and sending a notification in response to the wear level reaching a predetermined level; wherein the wear level is based, at least in part, on an amount of usage of the hardware component and wherein the wear level is independent of detected failure of the hardware component; wherein the predetermined level is based at least in part on a predicted end of life for the hardware component; wherein the wear level corresponds to a number of power up to power down cycles of the hardware component. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
Specification