Capacity planning for server resources
First Claim
Patent Images
1. A method, comprising:
- deriving a load table that contains empirically-derived load table values, each load table value representing a maximum load handled by one or more servers having a known amount of memory and a processor having a known speed;
receiving server parameter values indicating operating parameters for one or more servers in a server cluster;
receiving a specified load value that indicates a load desired to be handled by the server cluster;
utilizing the load table with the server parameter values and the specified load value to derive server resource utilization estimates for the server resources to determine how handling the specified load will affect the utilization of server resources; and
displaying the server resource utilization estimates, wherein;
the server cluster handles a total number of multiple document types, each document type having a document type value assigned to it, the document type value for each document type indicating the amount of each document type in relation to the total amount of document types;
the deriving a load table further comprises deriving a load table for each document type, the load values in the load table being empirically-derived from servers having a known amount of memory and a processor having a known speed, and the load comprising only the document type for which the load table is derived; and
the utilizing the load table with the server parameter values and the specified load value further comprises;
for each document type, finding the closest match in the corresponding load table between the server parameter values and the entries in the load table; and
deriving server resource utilization estimates by using the load value of the closest match as the maximum load that can be handled by the server for each document type.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and systems for capacity planning of server resources are described wherein fixed resources of a server cluster are used in comparison to similar server cluster benchmarks to determine the maximum load—requests per second—that can be handled by the server cluster. The maximum load is used to determine utilization of server resources and to provide estimates of server resource utilization for hypothetical loads. A recommendation as to changes to server resources to handle the hypothetical loads is displayed to the user.
-
Citations
48 Claims
-
1. A method, comprising:
-
deriving a load table that contains empirically-derived load table values, each load table value representing a maximum load handled by one or more servers having a known amount of memory and a processor having a known speed;
receiving server parameter values indicating operating parameters for one or more servers in a server cluster;
receiving a specified load value that indicates a load desired to be handled by the server cluster;
utilizing the load table with the server parameter values and the specified load value to derive server resource utilization estimates for the server resources to determine how handling the specified load will affect the utilization of server resources; and
displaying the server resource utilization estimates, wherein;
the server cluster handles a total number of multiple document types, each document type having a document type value assigned to it, the document type value for each document type indicating the amount of each document type in relation to the total amount of document types;
the deriving a load table further comprises deriving a load table for each document type, the load values in the load table being empirically-derived from servers having a known amount of memory and a processor having a known speed, and the load comprising only the document type for which the load table is derived; and
the utilizing the load table with the server parameter values and the specified load value further comprises;
for each document type, finding the closest match in the corresponding load table between the server parameter values and the entries in the load table; and
deriving server resource utilization estimates by using the load value of the closest match as the maximum load that can be handled by the server for each document type. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An apparatus, comprising:
-
means for deriving a load table that contains empirically-derived load table values, each load table value representing a maximum load handled by one or more servers having a known amount of memory and a processor having a known speed;
first means for receiving configured to receive server parameter values indicating operating parameters for one or more servers in a server cluster;
second means for receiving configured to receive a specified load value that indicates a load desired to be handled by the server cluster;
means for utilizing the load table with the server parameter values and the specified load value to derive server resource utilization estimates for the server resources to determine how handling the specified load will affect the utilization of server resources; and
means for displaying the server resource utilization estimates, wherein;
the server cluster handles a total number of multiple document types, each document type having a document type value assigned to it, the document type value for each document type indicating the amount of each document type in relation to the total amount of document types;
the deriving a load table further comprises deriving a load table for each document type, the load values in the load table being empirically-derived from servers having a known amount of memory and a processor having a known speed, and the load comprising only the document type for which the load table is derived; and
the utilizing the load table with the server parameter values and the specified load value further comprises;
for each document type, finding the closest match in the corresponding load table between the server parameter values and the entries in the load table; and
deriving server resource utilization estimates by using the load value of the closest match as the maximum load that can be handled by the server for each document type.
-
-
10. A system for deriving server resource utilization estimates for a server cluster that handles multiple document types, the system comprising:
-
means for assigning a document type value to each document type, each document type value indicating a percentage that each document type makes up of a total amount of document types;
means for deriving a load table for each document type, each load table containing load table values empirically derived from a server cluster that has a known amount of memory and a processor having a known type and speed, each load table value representing a maximum load that can be handled by the server cluster when the load comprises only one of the multiple document types;
first means for receiving configured to receive one or more server cluster parameter values that indicate operating parameters for the server cluster;
second means for receiving configured to receive a specified load value that indicates a load desired to be handled by the server cluster;
means for utilizing the load tables to derive server resource utilization estimates to determine how handling the specified load will affect the utilization of server resources;
means for displaying the server resource utilization estimates; and
means for recommending a plan to optimize handling of the specified load by increasing resources of the server cluster. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A method comprising:
-
assigning a document type value to each document type, each document type value indicating a percentage that each document type makes up of a total amount of document types;
deriving a load table for each document type, each load table containing load table values empirically derived from a server cluster that has a known amount of memory and a processor having a known type and speed, each load table value representing a maximum load that can be handled by the server cluster when the load comprises only one of the multiple document types;
receiving one or more server cluster parameter values that indicate operating parameters for the server cluster;
receiving a specified load value that indicates a load desired to be handled by the server cluster;
utilizing the load tables to derive server resource utilization estimates to determine how handling the specified load will affect the utilization of server resources;
displaying the server resource utilization estimates; and
recommending a plan to optimize handling of the specified load by increasing resources of the server cluster.
-
-
20. One or more computer-readable media having computer-readable instructions thereon which, when executed by one or more computers, cause the computers to derive server resource utilization estimates for a server cluster by implementing acts of:
-
collecting one or more server cluster parameter values at different times during operation of the server cluster, the server cluster parameters indicating utilization of server resources;
extrapolating the collected values to determine the maximum load value that can be handled by the server cluster;
receiving a specified load value that indicates a load desired to be handled by the server cluster;
utilizing the extrapolated maximum load value to determine if the collected values are of sufficient accuracy to use in deriving server resource utilization estimates;
deriving server resource utilization estimates to determine how handling the specified load will affect the utilization of server resources if the collected values have been determined to provide sufficient accuracy;
displaying the server resource utilization estimates; and
recommending a plan to optimize handling of the specified load by increasing resources of the server cluster, if necessary, wherein the utilizing the extrapolated maximum load value to determine if the collected values are of sufficient accuracy further comprises;
determining the first derivative of CPU utilization as a function of the specified load;
determining the value of the function at the point of the largest collected incoming server load value to obtain a slope value;
comparing the slope value with a pre-determined value and, if the slope value is larger than the pre-determined value, then assuming the collected values are of sufficient accuracy to proceed; and
determining the specified load value at which the CPU utilization equals one hundred percent, and using that load value as the maximum load value that can be handled by the server cluster for further calculations. - View Dependent Claims (21, 22, 23, 24)
-
-
25. A method for deriving server resource utilization estimates for a server cluster, the method comprising:
-
collecting one or more server cluster parameter values at different times during operation of the server cluster, the server cluster parameters indicating utilization of server resources;
extrapolating the collected values to determine the maximum load value that can be handled by the server cluster;
receiving a specified load value that indicates a load desired to be handled by the server cluster;
utilizing the extrapolated maximum load value to determine if the collected values are of sufficient accuracy to use in deriving server resource utilization estimates;
deriving server resource utilization estimates to determine how handling the specified load will affect the utilization of server resources if the collected values have been determined to provide sufficient accuracy;
displaying the server resource utilization estimates; and
recommending a plan to optimize handling of the specified load by increasing resources of the server cluster, if necessary, wherein the server resource utilization is processor utilization, the method further comprising;
finding a functional dependency approximation between processor utilization and load;
transforming functional dependency into linear form by using logarithmic transformation;
deriving first and second processor regression constants using linear regression methodology;
dividing the first processor regression constant by e to the power of the product of the second processor regression constant and the specified load to obtain the processor utilization estimate.
-
-
26. One or more computer-readable media having computer-readable instructions thereon which, when executed by one or more computers, cause the computers to derive server resource utilization estimates for a server cluster by implementing acts of:
-
collecting one or more server cluster parameter values at different times during operation of the server cluster, the server cluster parameters indicating utilization of server resources;
extrapolating the collected values to determine the maximum load value that can be handled by the server cluster;
receiving a specified load value that indicates a load desired to be handled by the server cluster;
utilizing the extrapolated maximum load value to determine if the collected values are of sufficient accuracy to use in deriving server resource utilization estimates;
deriving server resource utilization estimates to determine how handling the specified load will affect the utilization of server resources if the collected values have been determined to provide sufficient accuracy;
displaying the server resource utilization estimates; and
recommending a plan to optimize handling of the specified load by increasing resources of the server cluster, if necessary, wherein the server resource utilization is communication bandwidth utilization, the method further comprising;
finding a functional dependency approximation between communication bandwidth utilization;
transforming functional dependency into linear form by using logarithmic transformation;
deriving first and second bandwidth regression constants using linear regression methodology;
deriving a transmission overhead factor that, when applied to a certain size web page, results in the actual capacity necessary to transmit the web page;
deriving a weighted communication overhead factor from the transmission overhead factor and the available communication bandwidth;
deriving an adjusted communication load from the specified load and the first and second bandwidth regression constants; and
determining the communication bandwidth utilization estimate utilizing the weighted communication overhead factor and the adjusted communication load.
-
-
27. One or more computer-readable media having computer-readable instructions thereon which, when executed by one or more computers, cause the computers to derive server resource utilization estimates for a server cluster by implementing acts of:
-
collecting one or more server cluster parameter values at different times during operation of the server cluster, the server cluster parameters indicating utilization of server resources;
extrapolating the collected values to determine the maximum load value that can be handled by the server cluster;
receiving a specified load value that indicates a load desired to be handled by the server cluster;
utilizing the extrapolated maximum load value to determine if the collected values are of sufficient accuracy to use in deriving server resource utilization estimates;
deriving server resource utilization estimates to determine how handling the specified load will affect the utilization of server resources if the collected values have been determined to provide sufficient accuracy;
displaying the server resource utilization estimates; and
recommending a plan to optimize handling of the specified load by increasing resources of the server cluster, if necessary, wherein the server resource utilization is communication bandwidth utilization, the method further comprising;
finding a functional dependency approximation between communication bandwidth utilization;
transforming functional dependency into linear form by using logarithmic transformation;
deriving first and second bandwidth regression constants using linear regression methodology;
deriving a transmission overhead factor that, when applied to a certain size web page, results in the actual capacity necessary to transmit the web page;
deriving a weighted communication overhead factor by dividing the transmission overhead factor by the available communication bandwidth;
deriving an adjusted communication load by adding the first bandwidth regression constant to the product of the specified load and the second bandwidth regression constant; and
determining the communication bandwidth utilization estimate by multiplying the weighted communication overhead factor by the adjusted communication load.
-
-
28. One or more computer-readable media having computer-readable instructions thereon which, when executed by one or more computers, cause the computers to derive server resource utilization estimates for a server cluster by implementing acts of:
-
collecting one or more server cluster parameter values at different times during operation of the server cluster, the server cluster parameters indicating utilization of server resources;
extrapolating the collected values to determine the maximum load value that can be handled by the server cluster;
receiving a specified load value that indicates a load desired to be handled by the server cluster;
utilizing the extrapolated maximum load value to determine if the collected values are of sufficient accuracy to use in deriving server resource utilization estimates;
deriving server resource utilization estimates to determine how handling the specified load will affect the utilization of server resources if the collected values have been determined to provide sufficient accuracy;
displaying the server resource utilization estimates; and
recommending a plan to optimize handling of the specified load by increasing resources of the server cluster, if necessary, wherein the server resource utilization is memory utilization, the method further comprising;
deriving a connection memory factor that is the adjusted average of the incoming connections at different speeds;
deriving a weighted connection memory factor by multiplying the connection memory factor by the specified load;
deriving a page load ratio by dividing the specified load by the difference of the maximum load value and the specified load;
deriving a total number of concurrent connections from the weighted connection memory factor and the page load ratio; and
deriving a gross memory utilization using the total number of concurrent connections, the amount of memory necessary to support each connection for communications, the amount of memory necessary to support data structures associated with each connection, the amount of memory required by a server operating system, and the amount of memory required by the server communication program; and
deriving the memory utilization estimate from the gross memory utilization and total memory available.
-
-
29. A server-client system, comprising:
-
a server cluster having one or more servers, one of which is a primary server that controls the operation of the server cluster;
means for controlling the cluster resident in memory on the primary server of the server cluster, the cluster controller means controlling communications between the primary server and the secondary servers, if any, and between clients and the server cluster;
operating system means configured to be operably stored in the memory of the primary server;
a communications program within the cluster controller to provide communications capability for the server-client system;
filter means configured to collect one or more server parameter values indicating certain operating parameters for the server cluster;
individual monitor means each coupled to one server in the server cluster to collect one or more server parameter values indicating certain operating parameters for the server cluster; and
capacity planner means configured to operate within the cluster controller to utilize the collected server parameter values to derive one or more server resource utilization estimates for server resources to determine how handling a specified load will affect the utilization of the server resources, and to produce a plan recommending changes to be made to the server cluster to adequately handle the specified load, wherein the capacity planner is further configured to determine the maximum load that can be handled by the server cluster, based on the collected server parameter values, and wherein;
the one or more server resource utilization estimates is general server utilization; and
capacity controller means is further configured to derive general server utilization by solving;
wherein U is general server utilization;
X is the maximum load that can be handled by the server cluster; and
L is the specified load.
-
-
30. One or more computer-readable media having computer-readable instructions thereon which, when executed by one or more computers, cause the computers to derive server resource utilization estimates for a server cluster by implementing acts of:
-
collecting one or more server cluster parameter values at different times during operation of the server cluster, the server cluster parameters indicating utilization of server resources;
extrapolating the collected values that are configured to determine the maximum load value that can be handled by the server cluster;
receiving a specified load value that indicates a load desired to be handled by the server cluster;
utilizing the extrapolated maximum load value configured to determine if the collected values are of sufficient accuracy to use in deriving server resource utilization estimates;
deriving server resource utilization estimates configured to determine how handling the specified load will affect the utilization of server resources if the collected values have been determined to provide sufficient accuracy;
displaying the server resource utilization estimates; and
recommending a plan to optimize handling of the specified load by increasing resources of the server cluster, if necessary, wherein the server resource utilization is memory utilization, the system further comprising means configured to perform acts of;
deriving a connection memory factor that is the adjusted average of the incoming connections at different speeds;
deriving a weighted connection memory factor by multiplying the connection memory factor by the specified load;
deriving a page load ratio by dividing the specified load by the difference of the maximum load value and the specified load;
deriving a total number of concurrent connections by adding the weighted connection memory factor and the page load ratio; and
deriving a gross memory utilization by multiplying the total number of concurrent connections by the sum of the amount of memory necessary to support each connection for communications and the amount of memory necessary to support data structures associated with each connection, and adding the amount of memory required by a server operating system and the amount of memory required by the server communication program; and
deriving the memory utilization estimate by dividing the gross memory utilization by total memory available.
-
-
31. A method comprising:
- collecting one or more server cluster parameter values at different times during operation of the server cluster, the server cluster parameters indicating utilization of server resources;
extrapolating the collected values to determine the maximum load value that can be handled by the server cluster;
receiving a specified load value that indicates a load desired to be handled by the server cluster;
utilizing the extrapolated maximum load value to determine if the collected values are of sufficient accuracy to use in deriving server resource utilization estimates;
deriving server resource utilization estimates to determine how handling the specified load will affect the utilization of server resources if the collected values have been determined to provide sufficient accuracy;
displaying the server resource utilization estimates; and
recommending a plan to optimize handling of the specified load by increasing resources of the server cluster, if necessary, wherein the server resource utilization is general server utilization, the method further comprising, deriving the general server utilization estimate as a function of the specified load and the maximum load.
- collecting one or more server cluster parameter values at different times during operation of the server cluster, the server cluster parameters indicating utilization of server resources;
-
32. One or more computer-readable media having computer-readable instructions thereon which, when executed by one or more computers, cause the computers to derive server resource utilization estimates for a server cluster by implementing acts of:
-
collecting one or more server cluster parameter values at different times during operation of the server cluster, the server cluster parameters indicating utilization of server resources;
extrapolating the collected values to determine the maximum load value that can be handled by the server cluster;
receiving a specified load value that indicates a load desired to be handled by the server cluster;
utilizing the extrapolated maximum load value to determine if the collected values are of sufficient accuracy to use in deriving server resource utilization estimates;
deriving server resource utilization estimates to determine how handling the specified load will affect the utilization of server resources if the collected values have been determined to provide sufficient accuracy;
displaying the server resource utilization estimates; and
recommending a plan to optimize handling of the specified load by increasing resources of the server cluster, if necessary, wherein the server resource utilization is general server utilization, the method further comprising;
dividing the specified load by the maximum load to derive the general server utilization estimate.
-
-
33. A method comprising:
-
collecting one or more server cluster parameter values at different times during operation of the server cluster, the server cluster parameters indicating utilization of server resources;
extrapolating the collected values to determine the maximum load value that can be handled by the server cluster;
receiving a specified load value that indicates a load desired to be handled by the server cluster;
utilizing the extrapolated maximum load value to determine if the collected values are of sufficient accuracy to use in deriving server resource utilization estimates;
deriving server resource utilization estimates to determine how handling the specified load will affect the utilization of server resources if the collected values have been determined to provide sufficient accuracy;
displaying the server resource utilization estimates; and
recommending a plan to optimize handling of the specified load by increasing resources of the server cluster, if necessary, wherein the utilizing the extrapolated maximum load value to determine if the collected values are of sufficient accuracy further comprises;
determining the first derivative of CPU utilization as a function of the specified load;
determining the value of the function at the point of the largest collected incoming server load value to obtain a slope value;
comparing the slope value with a pre-determined value and, if the slope value is larger than the pre-determined value, then assuming the collected values are of sufficient accuracy to proceed; and
determining the specified load value at which the CPU utilization equals one hundred percent, and using that load value as the maximum load value that can be handled by the server cluster for further calculations.
-
-
34. A server-client system, comprising:
-
a server cluster having one or more servers, one of which is a primary server that controls the operation of the server cluster;
cluster controller means configured to be executed via memory on the primary server of the server cluster, the cluster controller means controlling communications between the primary server and the secondary servers, if any, and between clients and the server cluster;
operating system means resident in the memory of the primary server, the operating means for providing an operating system for the primary server;
communications program means within the cluster controller for providing communications capability for the server-client system;
filter means configured to collect one or more server parameter values indicating certain operating parameters for the server cluster;
individual monitor means associated with each respective server in the server cluster for collecting one or more server parameter values indicating certain operating parameters for the server cluster; and
capacity planner means operably coupled with the cluster controller means and configured to utilize the collected server parameter values for deriving one or more server resource utilization estimates for server resources to determine how handling a specified load will affect the utilization of the server resources, and for producing a plan recommending changes to be made to the server cluster to adequately handle the specified load, wherein the capacity planner means is further configured for determining a maximum load that can be handled by the server cluster, based on the collected server parameter values, and wherein the capacity planner means is further configured to derive memory utilization by solving;
wherein N is a total number of concurrent connections derived by the capacity controller by solving;
wherein UM is memory utilization;
MTCP is an amount of memory necessary to support each connection for communications;
MIISStruct is an amount of memory necessary to support data structures associated with each connection;
MOS is an amount of memory required by the operating system means;
MIIS is an amount of memory required by the server communication program means;
M is a total amount of memory available;
L is the specified load;
X is the maximum load that can be handled by the server cluster; and
S1 is a connection memory factor that is the adjusted average of the incoming connections at different speeds.
-
-
35. One or more computer-readable media having computer-readable instructions thereon which, when executed by one or more computers, cause the computers to derive server resource utilization estimates for a server cluster by implementing acts of:
-
collecting one or more server cluster parameter values at different times during operation of the server cluster, the server cluster parameters indicating utilization of server resources;
extrapolating the collected values to determine the maximum load value that can be handled by the server cluster;
receiving a specified load value that indicates a load desired to be handled by the server cluster;
utilizing the extrapolated maximum load value to determine if the collected values are of sufficient accuracy to use in deriving server resource utilization estimates;
deriving server resource utilization estimates to determine how handling the specified load will affect the utilization of server resources if the collected values have been determined to provide sufficient accuracy;
displaying the server resource utilization estimates; and
recommending a plan to optimize handling of the specified load by increasing resources of the server cluster, if necessary, wherein the server resource utilization is general server utilization, the method further comprising, deriving the general server utilization estimate as a function of the specified load and the maximum load, wherein the extrapolating the collected values to determine the maximum load value that can be handled by the server cluster further comprises;
deriving processor regression constant a and processor regression constant b for each document type by solving the linear equation;
y=a+b·
xfor multiple pairs of (x, y) values;
giving;
to derive the processor regression constants;
wherein x is an independent variable corresponding to an incoming server load value collected at time i, and y is the dependent variable corresponding to CPU utilization collected at time i; and
determining the maximum load value that can be handled by the server cluster by determining a first derivative of UCPU as a function of L, by solving;
U′
CPU=ea′
·
b′
·
eb′
·
Lwherein L is a value at which U′
CPU is one hundred percent, or the maximum load that can be handled by the server cluster. - View Dependent Claims (36, 37)
-
-
38. One or more computer-readable media having computer-readable instructions thereon which, when executed by one or more computers, cause the computers to derive server resource utilization estimates for a server cluster by implementing acts of:
-
collecting one or more server cluster parameter values at different times during operation of the server cluster, the server cluster parameters indicating utilization of server resources;
extrapolating the collected values to determine the maximum load value that can be handled by the server cluster;
receiving a specified load value that indicates a load desired to be handled by the server cluster;
utilizing the extrapolated maximum load value to determine if the collected values are of sufficient accuracy to use in deriving server resource utilization estimates;
deriving server resource utilization estimates to determine how handling the specified load will affect the utilization of server resources if the collected values have been determined to provide sufficient accuracy;
displaying the server resource utilization estimates; and
recommending a plan to optimize handling of the specified load by increasing resources of the server cluster, if necessary, wherein the server resource utilization is general server utilization, the method further comprising, deriving the general server utilization estimate as a function of the specified load and the maximum load, wherein the server resource utilization is communication bandwidth utilization, the method further comprising;
deriving bandwidth regression constant c and bandwidth regression constant d by solving;
y=c+d·
xfor multiple pairs of (x, y), giving;
and d=ddcwherein solving for c and d gives;
and d=ddcwherein x is an independent variable corresponding to an incoming server load value collected at time i, and y is the dependent variable corresponding to communication bandwidth utilization collected at time i; and
deriving communication bandwidth utilization by solving;
wherein UB is communication bandwidth utilization;
FTCP is a transmission overhead factor that, when applied to a certain size page, results in the actual bandwidth necessary to transmit the page;
L is the specified load; and
B is the total communication bandwidth available.
-
-
39. One or more computer-readable media having computer-readable instructions thereon which, when executed by one or more computers, cause the computers to derive server resource utilization estimates for a server cluster by implementing acts of:
-
collecting one or more server cluster parameter values at different times during operation of the server cluster, the server cluster parameters indicating utilization of server resources;
extrapolating the collected values to determine the maximum load value that can be handled by the server cluster;
receiving a specified load value that indicates a load desired to be handled by the server cluster;
utilizing the extrapolated maximum load value to determine if the collected values are of sufficient accuracy to use in deriving server resource utilization estimates;
deriving server resource utilization estimates to determine how handling the specified load will affect the utilization of server resources if the collected values have been determined to provide sufficient accuracy;
displaying the server resource utilization estimates; and
recommending a plan to optimize handling of the specified load by increasing resources of the server cluster, if necessary, wherein the server resource utilization is general server utilization, the method further comprising, deriving the general server utilization estimate as a function of the specified load and the maximum load, wherein the server resource utilization is memory utilization, the method further comprising;
deriving memory utilization by solving;
wherein N is a total number of concurrent connections derived by solving;
wherein UM is memory utilization;
MTCP is a an amount of memory necessary to support each connection for communications;
MIISStruct is the amount of memory necessary to support data structures associated with each connection;
MOS is the amount of memory required by a server operating system;
MIIS is the amount of memory required by a server communication program;
M is the total amount of memory available;
L is the specified load;
X is the maximum load that can be handled by the server cluster; and
S1 is a connection memory factor that is the adjusted average of the incoming connections at different speeds.
-
-
40. One or more computer-readable media having computer-readable instructions thereon which, when executed by one or more computers, cause the computers to derive server resource utilization estimates for a server cluster by implementing acts of:
-
collecting one or more server cluster parameter values at different times during operation of the server cluster, the server cluster parameters indicating utilization of server resources;
extrapolating the collected values to determine the maximum load value that can be handled by the server cluster;
receiving a specified load value that indicates a load desired to be handled by the server cluster;
utilizing the extrapolated maximum load value to determine if the collected values are of sufficient accuracy to use in deriving server resource utilization estimates;
deriving server resource utilization estimates to determine how handling the specified load will affect the utilization of server resources if the collected values have been determined to provide sufficient accuracy;
displaying the server resource utilization estimates; and
recommending a plan to optimize handling of the specified load by increasing resources of the server cluster, if necessary, wherein the server resource utilization is general server utilization, the method further comprising, deriving the general server utilization estimate as a function of the specified load and the maximum load, wherein the server resource utilization is CPU utilization, the method further comprising;
deriving the CPU utilization estimate by solving;
wherein L is the specified load.
-
-
41. A method comprising:
-
collecting one or more server cluster parameter values at different times during operation of the server cluster, the server cluster parameters indicating utilization of server resources;
extrapolating the collected values to determine the maximum load value that can be handled by the server cluster;
receiving a specified load value that indicates a load desired to be handled by the server cluster;
utilizing the extrapolated maximum load value to determine if the collected values are of sufficient accuracy to use in deriving server resource utilization estimates;
deriving server resource utilization estimates to determine how handling the specified load will affect the utilization of server resources if the collected values have been determined to provide sufficient accuracy;
displaying the server resource utilization estimates; and
recommending a plan to optimize handling of the specified load by increasing resources of the server cluster, if necessary, wherein the server resource utilization is general server utilization, the method further comprising, deriving the general server utilization estimate as a function of the specified load and the maximum load.
-
-
42. A server-client system, comprising:
-
a server cluster having one or more servers, one of which is a primary server that controls the operation of the server cluster;
cluster controller means comprising computer readable instructions configured to be resident in memory on the primary server of the server cluster, the cluster controller means for controlling communications between the primary server and the secondary servers, if any, and between clients and the server cluster;
operating system means resident in the memory of the primary server;
communications program means within the cluster controller for providing communications capability for the server-client system;
one or more load tables containing empirically-derived load table values, each load table value representing a maximum load handled by a server cluster having a known amount of memory and a processor having a known type and speed;
one or more server parameter values indicating certain operating parameters for the server cluster; and
capacity planner means within the cluster controller configured to utilize the load table and the server parameter values for deriving one or more server resource utilization estimates for server resources to determine how handling a specified load will affect utilization of the server resources, and for producing a plan recommending changes to be made to the server cluster to adequately handle the specified load, wherein;
the server cluster is configured to handle a total number of document types, each document type having a document type value assigned thereto, the document type value for each document type indicating the percentage that each document type makes up of the total amount of documents types;
the one or more load tables comprise one load table for each document type;
the load table values for each load table being derived when a test load used to derive the load table values comprises only one document type; and
the capacity planner is further configured to find, for each document type, the closest match in the corresponding load table between the memory and processor of the server cluster and the memory and processor entries in the load table. - View Dependent Claims (43, 44, 45, 46)
-
-
47. A server-client system, comprising:
-
a server cluster having one or more servers, one of which is a primary server that controls the operation of the server cluster;
cluster controller means resident in memory on the primary server of the server cluster, the cluster controller means for controlling communications between the primary server and the secondary servers, if any, and between clients and the server cluster;
operating system means resident in the memory of the primary server;
communications program means within the cluster controller means for provide communications capability for the server-client system;
filter means for collecting one or more server parameter values indicating certain operating parameters for the server cluster;
respective monitor means on each server in the server cluster for collecting one or more server parameter values indicating certain operating parameters for the server cluster; and
capacity planner means within the cluster controller configured for utilizing the collected server parameter values to derive one or more server resource utilization estimates for server resources to determine how handling a specified load will affect the utilization of the server resources, and for producing a plan recommending changes to be made to the server cluster to adequately handle the specified load, wherein the capacity planner means is further configured for determining the maximum load that can be handled by the server cluster, based on the collected server parameter values, and wherein;
the one or more server resource utilization estimates is processor utilization; and
the capacity planner means is further configured for deriving processor utilization by solving;
wherein L is the specified load, and a and b are processor regression constants found by applying linear regression methodology to a linear equation stating a functional dependency between load and processor utilization.
-
-
48. A server-client system, comprising:
-
server cluster means having one or more servers, one of which is a primary server means for controlling operation of the server cluster means;
cluster controller means resident in memory on the primary server means, the cluster controller means for controlling communications between the primary server means and any secondary server means, if any, and between clients and the server cluster means;
operating system means resident in the memory of the primary server means;
communications program means within the cluster controller means for providing communications capability for the server-client system;
filter means for collecting one or more server parameter values indicating certain operating parameters for the server cluster means;
monitor means on each server means in the server cluster means for collecting one or more server parameter values indicating certain operating parameters for the server cluster means; and
a capacity planner within the cluster controller configured to utilize the collected server parameter values to derive one or more server resource utilization estimates for server resources to determine how handling a specified load will affect the utilization of the server resources, and to produce a plan recommending changes to be made to the server cluster means to adequately handle the specified load, wherein the capacity planner is further configured to determine the maximum load that can be handled by the server cluster means, based on the collected server parameter values, and wherein;
the one or more server resource utilization estimates is communications bandwidth utilization; and
the capacity controller means is further configured for deriving communication bandwidth utilization by solving;
wherein UB is communication bandwidth utilization;
FTCP is a transmission overhead factor that, when applied to a certain size page, results in the actual bandwidth necessary to transmit the page;
L is the specified load;
B is the total communication bandwidth available; and
c and d are bandwidth regression constants derived by the capacity controller means by applying linear regression methodology to a linear equation stating a functional dependency between load and communications bandwidth utilization.
-
Specification