Temperature management in a data storage system
First Claim
1. A system, comprising:
- a rack;
a data storage module coupled to the rack, the data storage module comprising;
a first backplane;
at least one first mass storage device coupled to the first backplane;
at least one second backplane;
at least one second mass storage device coupled to the at least one second backplane; and
a control device configured to;
access status information indicating;
a first current temperature of at least one of the first backplane or the at least one first mass storage device;
a second current temperature of at least one of the at least one second backplane or the at least one second mass storage device;
determine that the first current temperature exceeds a first threshold temperature and that the second current temperature does not exceed the first threshold temperature;
responsive to the first current temperature exceeding the first threshold temperature, emit at least one signal to cause a shutdown of the at least one first mass storage device coupled to the first backplane while the at least one second mass storage device remains operative; and
subsequent to the shutdown of the at least one first mass storage device;
responsive to a determination that the first current temperature is greater than a second threshold temperature and less than the first threshold temperature, wherein the second threshold temperature is less than the first threshold temperature, maintain the shutdown of the at least one first mass storage device; and
responsive to a determination that the first current temperature is less than the second threshold temperature, emit at least another signal to cause a restart of the at least one first mass storage device.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques are described for managing temperatures within a data storage system by selectively interrupting power to one or more components of the data storage system. Temperature sensors measure the temperature of various components included in racks of a data center, such as data storage modules, backplanes of data storage modules, or mass storage devices coupled to backplanes. A control device may determine that a thermal event, such as a higher than threshold temperature, is occurring in one or more components. The control device may emit signal(s) to instruct power distribution unit(s) to selectively interrupt or reduce the power sent to those component(s) exhibiting the thermal event. The components may also be instructed to reduce a number of operations being performed. In some cases, fan speeds may be selectively adjusted to cool the component(s) and thus mitigate the thermal event. Power consumption may be employed to infer the temperature of component(s).
-
Citations
17 Claims
-
1. A system, comprising:
-
a rack; a data storage module coupled to the rack, the data storage module comprising; a first backplane; at least one first mass storage device coupled to the first backplane; at least one second backplane; at least one second mass storage device coupled to the at least one second backplane; and a control device configured to; access status information indicating; a first current temperature of at least one of the first backplane or the at least one first mass storage device; a second current temperature of at least one of the at least one second backplane or the at least one second mass storage device; determine that the first current temperature exceeds a first threshold temperature and that the second current temperature does not exceed the first threshold temperature; responsive to the first current temperature exceeding the first threshold temperature, emit at least one signal to cause a shutdown of the at least one first mass storage device coupled to the first backplane while the at least one second mass storage device remains operative; and subsequent to the shutdown of the at least one first mass storage device; responsive to a determination that the first current temperature is greater than a second threshold temperature and less than the first threshold temperature, wherein the second threshold temperature is less than the first threshold temperature, maintain the shutdown of the at least one first mass storage device; and responsive to a determination that the first current temperature is less than the second threshold temperature, emit at least another signal to cause a restart of the at least one first mass storage device. - View Dependent Claims (2, 3)
-
-
4. A system, comprising:
-
one or more data storage modules configured to couple to a rack, the one or more data storage modules comprising one or more backplanes configured to couple to one or more mass storage devices; and a control device configured to; determine a temperature of one or more of the rack, the one or more data storage modules, the one or more backplanes, or the one or more mass storage devices; responsive to the temperature exceeding a first threshold temperature, determine a component group comprising one or more of the rack, the one or more data storage modules, the one or more backplanes, or the one or more mass storage devices; emit at least one signal to cause at least one action for lowering the temperature, the at least one action comprising one or more of; a shutdown of the component group, wherein at least one of the one or more mass storage devices remains active after the shutdown of the component group; a reduction in a number of operations performed in the component group; a speed adjustment for at least one cooling component arranged to move air in proximity to the component group; or a reduction in power supplied to the component group; and subsequent to performance of the at least one action; responsive to a determination that the temperature is greater than a second threshold temperature and less than the first threshold temperature, wherein the second threshold temperature is less than the first threshold temperature, maintain the at least one action; and responsive to a determination that the temperature is less than the second threshold temperature, emit at least another signal to cause at least a partial reversal of the at least one action. - View Dependent Claims (5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method, comprising:
-
accessing status information indicating a temperature of one or more components of a data storage system, the one or more components including one or more of;
a mass storage device, a backplane coupled to the mass storage device, or a data storage module that includes the backplane;based on the status information, detecting a thermal event in the data storage system, wherein the thermal event is associated with the temperature exceeding a first threshold temperature; determining a component group comprising the at least one of the one or more components associated with the thermal event; and emitting at least one signal to cause at least one action in response to the thermal event, the at least one action comprising one or more of; a shutdown of the component group, wherein at least one of the one or more mass storage devices remains active after the shutdown of the component group; a reduction in power supplied to the component group; a reduction of activity in the component group;
ora speed adjustment to increase a speed of at least one cooling component arranged to move air in proximity to the component group; subsequent to causing the at least one action; responsive to a determination that the temperature is between the first threshold temperature and a second threshold temperature, maintaining the at least one action, wherein the first threshold temperature is greater than the second threshold temperature; and responsive to a determination that the temperature is less than the second threshold temperature, emitting at least another signal to cause at least partial reversal of the at least one action. - View Dependent Claims (14, 15, 16, 17)
-
Specification