System and method for distributed management of shared computers
First Claim
1. A method comprising:
- monitoring, at a co-location facility, hardware operations of a cluster of computers located at the co-location facility;
detecting a hardware failure in one of the computers in the cluster; and
performing an act, in response to detecting the hardware failure, to correct the hardware failure.
3 Assignments
0 Petitions
Accused Products
Abstract
A multi-tiered server management architecture is employed including an application development tier, an application operations tier, and a cluster operations tier. In the application development tier, applications are developed for execution on one or more server computers. In the application operations tier, execution of the applications is managed and sub-boundaries within a cluster of servers can be established. In the cluster operations tier, operation of the server computers is managed without concern for what applications are executing on the one or more server computers and boundaries between clusters of servers can be established. The multi-tiered server management architecture can also be employed in co-location facilities where clusters of servers are leased to tenants, with the tenants implementing the application operations tier and the facility owner (or operator) implementing the cluster operations tier.
114 Citations
22 Claims
-
1. A method comprising:
-
monitoring, at a co-location facility, hardware operations of a cluster of computers located at the co-location facility;
detecting a hardware failure in one of the computers in the cluster; and
performing an act, in response to detecting the hardware failure, to correct the hardware failure. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method comprising:
-
monitoring, from a location remote from a co-location facility, software operations of a cluster of computers located at the co-location facility;
detecting a software failure in one of the computers in the cluster; and
performing an act, in response to detecting the software failure, to correct the hardware failure. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. One or more computer-readable media having stored thereon a computer program that, when executed by one or more processors, causes the one or more processors to perform acts including:
-
monitoring, from a location remote from a co-location facility, software operations of a cluster of computers located at the co-location facility; and
taking corrective action in response to a failure in operation of software executing on one of the computers in the cluster. - View Dependent Claims (19, 20, 21, 22)
-
Specification