On-demand instantiation in a high-performance computing (HPC) system
First Claim
Patent Images
1. A computer-readable medium comprising software that, when executed by a processor, operates to:
- receive a connection request from a client specifying a first port number and a first host name, the first port number and the first host name advertised externally with respect to a plurality of nodes, each node comprising a switching fabric integrated to a card and at least two processors integrated to the card;
identify a service at the plurality of nodes, the identified service corresponding to the first port number and the first host name;
determine whether the identified service is available based on one or more aspects of the connection request and one or more rules indicating whether the service is available;
if the identified service is determined to be available based on one or more aspects the connection request and one or more rules indicating whether the service is available, instantiate a host providing the identified service at one or more nodes in the plurality of nodes, wherein instantiating the host comprises using a boot image, a file system, and an operating system (OS) configuration file corresponding to the identified service to boot at least one node in the plurality of nodes in response to the received connection request;
cause a first node of the plurality of nodes to stop providing a second service in response to not detecting one or more requests related to the second service;
cause the first node of the plurality of nodes to provide a third service, the third service being distinct from the second service, after causing the first node to stop providing the second service; and
in response to receiving a request relating to the second service, causing a second node of the plurality of nodes to provide the second service.
1 Assignment
0 Petitions
Accused Products
Abstract
In one embodiment, a method for on-demand instantiation in a high-performance computing (HPC) system includes receiving a connection request from a client specifying a first port number and a first host name advertised externally with respect to an HPC server including a cluster of nodes, identifying a service at the HPC server corresponding to the first port number and the first host name, determining whether the identified service is available, and, if the identified service is available, instantiating a host providing the identified service at one or more nodes in the cluster.
130 Citations
29 Claims
-
1. A computer-readable medium comprising software that, when executed by a processor, operates to:
-
receive a connection request from a client specifying a first port number and a first host name, the first port number and the first host name advertised externally with respect to a plurality of nodes, each node comprising a switching fabric integrated to a card and at least two processors integrated to the card; identify a service at the plurality of nodes, the identified service corresponding to the first port number and the first host name; determine whether the identified service is available based on one or more aspects of the connection request and one or more rules indicating whether the service is available; if the identified service is determined to be available based on one or more aspects the connection request and one or more rules indicating whether the service is available, instantiate a host providing the identified service at one or more nodes in the plurality of nodes, wherein instantiating the host comprises using a boot image, a file system, and an operating system (OS) configuration file corresponding to the identified service to boot at least one node in the plurality of nodes in response to the received connection request; cause a first node of the plurality of nodes to stop providing a second service in response to not detecting one or more requests related to the second service; cause the first node of the plurality of nodes to provide a third service, the third service being distinct from the second service, after causing the first node to stop providing the second service; and in response to receiving a request relating to the second service, causing a second node of the plurality of nodes to provide the second service. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method comprising:
-
receiving a connection request form a client specifying a first port number and a first host name, the first port number and the first host name advertised externally with respect to a plurality of nodes, each node comprising a switching fabric integrated to a card and at least two processors integrated to the card; identifying a service at the plurality of nodes, the identified service corresponding to the first port number and the first host name; determining whether the identified service is available based on one or more aspects of the connection request and one or more rules indicating whether the service is available; if the identified service is determined to be available based on one or more aspects the connection request and one or more rules indicating whether the service is available, instantiating a host providing the identified service at one or more nodes in the plurality of nodes, wherein instantiating the host comprises using a boot image, a file system, and an operating system (OS) configuration file corresponding to the identified service to boot at least one node in the plurality of nodes in response to the received connection request; causing a first node of the plurality of nodes to stop providing a second service in response to not detecting one or more requests related to the second service; causing the first node of the plurality of nodes to provide a third service, the third service being distinct from the second service, after causing the first node to stop providing the second service; and in response to receiving a request relating to the second service, causing a second node of the plurality of nodes to provide the second service. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A system, comprising:
-
a memory that stores a list of services at a plurality of nodes, each node comprising a switching fabric integrated to a card and at least two processors integrated to the card; at least one processor that; receives a connection request from a client specifying a first port number and a first host name, the first port number and the first host name advertised externally with respect to the plurality of nodes; identifies a service at the plurality of nodes, the identified service corresponding to the first port number and the first host name; determines whether the identified service is available based on one or more aspects of the connection request and one or more rules indicating whether the service is available; and if the identified service is determined to be available based on one or more aspects of the connection request and one or more rules indicating whether the service is available, instantiates a host providing the identified service at one or more nodes in the plurality of nodes, wherein instantiating the host comprises using a boot image, a file system, and an operating system (OS) configuration file corresponding to the identified service to boot at least one node in the plurality of nodes in response to the received connection request; causes a first node of the plurality of nodes to stop providing a second service in response to not detecting one or more requests related to the second service; causes the first node of the plurality of nodes to provide a third service, the third service being distinct from the second service, after causing the first node to stop providing the second service; and in response to receiving a request relating to the second service, causing a second node of the plurality of nodes to provide the second service. - View Dependent Claims (24, 25, 26, 27, 28, 29)
-
Specification