Storage system and control method of storage system
First Claim
Patent Images
1. A storage system for providing data storage service to a host apparatus, comprising:
- a host interface unit configured to be connectable to the host apparatus;
a disk interface unit configured to be connectable to a disk array unit having a volume storing data formed therein;
a cache memory unit connected to the host interface unit and the disk interface unit, the cache memory unit configured to cache user data;
a switch LSI connected respectively to the host interface unit, the disk interface unit, and the cache memory unit;
a controller having a plurality of processors and connected to the switch LSI,wherein the switch LSI cuts off an I/O path with the controller based on an error signal output when an error occurs in one component in the controller;
one or more signal lines configured to report an error, each of the signal lines providing an additional path to report the error, other than the I/O path;
a non-volatile memory configured to store information indicating a failed component, wherein the stored information is referred to when determining whether the controller is operable, wherein the controller is operable if a component includes a plurality of elements and one or more of the elements are operable or if a component includes one element and the one element is operable; and
wherein the controller, after performing error handling under control of one of the plurality of processors, commands the switch LSI to cancel the cutoff.
1 Assignment
0 Petitions
Accused Products
Abstract
A fault-tolerant storage system is provided. The storage system is composed of a controller having a plurality of processors and other units. When an error occurs in any one of the components in the controller, the storage system cuts off an I/O path of the controller, specifies the failed component in the cutoff status, and invalidates the failed component. After invalidating the failed component, the storage system determines whether it is operable only with the normal components, cancels (releases) the cutoff of the I/O path when it determines that it is operable, and resumes operation by rebooting.
16 Citations
20 Claims
-
1. A storage system for providing data storage service to a host apparatus, comprising:
-
a host interface unit configured to be connectable to the host apparatus; a disk interface unit configured to be connectable to a disk array unit having a volume storing data formed therein; a cache memory unit connected to the host interface unit and the disk interface unit, the cache memory unit configured to cache user data; a switch LSI connected respectively to the host interface unit, the disk interface unit, and the cache memory unit; a controller having a plurality of processors and connected to the switch LSI, wherein the switch LSI cuts off an I/O path with the controller based on an error signal output when an error occurs in one component in the controller; one or more signal lines configured to report an error, each of the signal lines providing an additional path to report the error, other than the I/O path; a non-volatile memory configured to store information indicating a failed component, wherein the stored information is referred to when determining whether the controller is operable, wherein the controller is operable if a component includes a plurality of elements and one or more of the elements are operable or if a component includes one element and the one element is operable; and wherein the controller, after performing error handling under control of one of the plurality of processors, commands the switch LSI to cancel the cutoff. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method of controlling a storage system for providing data storage service to a host apparatus, comprising:
-
performing the data storage service under control of one of a plurality of processors of at least one controller; detecting an error signal output when an error occurs in one component in the controller, the determining performed under control of a switch LSI; providing one or more signal lines configured to report an error, each of the signal lines providing an additional path to report the error, other than the I/O path; cutting off, an I/O path with the controller based on the detected error, the cutting off performed under control of the switch LSI; performing error handling while the I/O path is being cut off, the error handling performed under control of the controller; commanding the switch LSI to cancel the cutoff, the commanding performed under control of the controller; storing information indicating a failed component in a non-volatile memory; and referring to the stored information to determine whether the controller is operable, wherein the controller is operable if a component includes a plurality of elements and one or more of the elements are operable or if a component includes one element and the one element is operable. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A unit having a plurality of components composing a storage system for providing data storage service to a host apparatus, comprising:
-
an error detection unit configured to detect an occurrence of an error in any one of the plurality of components; a cutoff processing unit configured to cut off an I/O path with an external unit based on the detected occurrence of an error; an error specification unit configured to specify the failed component while the I/O path is being cut off; one or more signal lines configured to report an error, each of the signal lines providing an additional path to report the error, other than the I/O path; a non-volatile memory configured to store information indicating a failed component, wherein the stored information is referred to when determining whether the controller is operable, wherein the controller is operable if a component includes a plurality of elements and one or more of the elements are operable or if a component includes one element and the one element is operable; an inactive status setting unit configured to set the specified failed component to an inactive status; and wherein the cutoff processing unit cancels the cutoff after the inactive status is set.
-
Specification