Analyzing database cluster behavior by transforming discrete time series measurements
First Claim
1. A computer implemented method for analyzing performance of components of a database cluster by transforming discrete events into a time series to identify dominant signals, the method comprising:
- sampling, using a measurement instrument, a plurality of discrete event measurements of the database cluster to produce a set of timestamped events, the database cluster comprising a plurality of nodes that share a shared resource, and the plurality of discrete event measurements obtained from multiple ones of the plurality of nodes;
tagging at least one of the timestamped events sampled by one or more hardware components or one or more software components of the database cluster with a semantic tag, wherein tagging the at least one of the timestamped events with the semantic tag comprises assigning a state corresponding to the shared resource of the database cluster, the semantic tag includes at least a first state of waiting for the shared resource and a second state of obtaining access to the shared resource;
generating a single magnitude value for a related set of the timestamped events for multiple sessions with respect to the semantic tag for the first state of waiting for the shared resource or the second state of obtaining access to the shared resource by applying different weighting factors to some or all of the sessions of the timestamped events corresponding to the first and second state to derive the single magnitude value;
formatting the set of timestamped events into a time series, wherein a time series entry comprises a time indication and a plurality of values, at least one of the plurality of values corresponding to the single magnitude value generated for the related set of the semantic tag for the multiple sessions in the database cluster; and
processing the time series to identify at least one state signal.
1 Assignment
0 Petitions
Accused Products
Abstract
A method, system, and computer program product for analyzing performance of a database cluster. Disclosed are techniques for analyzing performance of components of a database cluster by transforming many discrete event measurements into a time series to identify dominant signals. The method embodiment commences by sampling the database cluster to produce a set of timestamped events, then pre-processing the timestamped events by tagging at least some of the timestamped events with a semantic tag drawn from a semantic dictionary and formatting the set of timestamped events into a time series where a time series entry comprises a time indication and a plurality of values corresponding to signal state values. Further techniques are disclosed for identifying certain signals from the time series to which is applied various statistical measurement criteria in order to isolate a set of candidate signals which are then used to identify indicative causes of database cluster behavior.
19 Citations
20 Claims
-
1. A computer implemented method for analyzing performance of components of a database cluster by transforming discrete events into a time series to identify dominant signals, the method comprising:
-
sampling, using a measurement instrument, a plurality of discrete event measurements of the database cluster to produce a set of timestamped events, the database cluster comprising a plurality of nodes that share a shared resource, and the plurality of discrete event measurements obtained from multiple ones of the plurality of nodes; tagging at least one of the timestamped events sampled by one or more hardware components or one or more software components of the database cluster with a semantic tag, wherein tagging the at least one of the timestamped events with the semantic tag comprises assigning a state corresponding to the shared resource of the database cluster, the semantic tag includes at least a first state of waiting for the shared resource and a second state of obtaining access to the shared resource; generating a single magnitude value for a related set of the timestamped events for multiple sessions with respect to the semantic tag for the first state of waiting for the shared resource or the second state of obtaining access to the shared resource by applying different weighting factors to some or all of the sessions of the timestamped events corresponding to the first and second state to derive the single magnitude value; formatting the set of timestamped events into a time series, wherein a time series entry comprises a time indication and a plurality of values, at least one of the plurality of values corresponding to the single magnitude value generated for the related set of the semantic tag for the multiple sessions in the database cluster; and processing the time series to identify at least one state signal. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer system for analyzing performance of components of a database cluster by transforming discrete events into a time series to identify dominant signals, the computer system comprising:
-
a computer processor to execute a set of program code instructions; and a memory to hold the program code instructions, in which the program code instructions comprises program code to perform; sampling a plurality of discrete event measurements of the database cluster to produce a set of timestamped events, the database cluster comprising a plurality of nodes that share a shared resource, and the plurality of discrete event measurements obtained from multiple ones of the plurality of nodes; tagging at least one of the timestamped events sampled by one or more hardware components or one or more software components of the database cluster with a semantic tag, wherein tagging the at least one of the timestamped events with the semantic tag comprises assigning a state corresponding to the shared resource of the database cluster, the semantic tag includes at least a first state of waiting for the shared resource and a second state of obtaining access to the shared resource; generating a single magnitude value for a related set of the timestamped events for multiple sessions with respect to the semantic tag for the first state of waiting for the shared resource or the second state of obtaining access to the shared resource by applying different weighting factors to some or all of the sessions of the timestamped events corresponding to the first and second state to derive the single magnitude value; formatting the set of timestamped events into a time series, wherein a time series entry comprises a time indication and a plurality of values, at least one of the plurality of values corresponding to the single magnitude value generated for the related set of the semantic tag for the multiple sessions in the database cluster; and processing the time series to identify at least one state signal. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer program product embodied in a non-transitory computer readable medium, the computer readable medium having stored thereon a sequence of instructions which, when executed by a processor causes the processor to execute a process to analyze performance of components of a database cluster by transforming discrete events into a time series to identify dominant signals, the process comprising:
-
sampling a plurality of discrete event measurements of the database cluster to produce a set of timestamped events the database cluster comprising a plurality of nodes that share a shared resource, and the plurality of discrete event measurements obtained from multiple ones of the plurality of nodes; tagging at least one of the timestamped events sampled by one or more hardware components or one or more software components of the database cluster with a semantic tag, wherein tagging the at least one of the timestamped events with the semantic tag comprises assigning a state corresponding to the shared resource of the database cluster, the semantic tag includes at least a first state of waiting for the shared resource and a second state of obtaining access to the shared resource; generating a single magnitude value for a related set of the timestamped events for multiple sessions with respect to the semantic tag for the first state of waiting for the shared resource or the second state of obtaining access to the shared resource by applying a different weighting factors to some or all of the sessions of the timestamped events corresponding to the first and second state to derive the single magnitude value; formatting the set of timestamped events into a time series, wherein a time series entry comprises a time indication and a plurality of values, at least one of the plurality of values corresponding to the single magnitude value generated for the related set of the semantic tag for the multiple sessions in the database cluster; and processing the time series to identify at least one state signal. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification