Compressing database workloads
First Claim
1. A method for compressing a workload used by an application comprising:
- providing a distance function for a pair of statements within an initial workload W containing an initial plurality of statements; and
finding a compressed workload W′
containing a compressed plurality of statements.
1 Assignment
0 Petitions
Accused Products
Abstract
Relational database applications such as index selection, histogram tuning, approximate query processing, and statistics selection have recognized the importance of leveraging workloads. Often these applications are presented with large workloads, i.e., a set of SQL DML statements, as input. A key factor affecting the scalability of such applications is the size of the workload. The invention concerns workload compression which helps improve the scalability of such applications. The exemplary embodiment is broadly applicable to a variety of workload-driven applications, while allowing for incorporation of application specific knowledge. The process is described in detail in the context of two workload-driven applications: index selection and approximate query processing.
-
Citations
51 Claims
-
1. A method for compressing a workload used by an application comprising:
-
providing a distance function for a pair of statements within an initial workload W containing an initial plurality of statements; and
finding a compressed workload W′
containing a compressed plurality of statements. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for compressing a workload used by an application and including a plurality of statements comprising:
-
providing a distance function for a pair of statements within an initial workload W;
finding a compressed workload W′
;
establishing an initial running time which is a running time of the application on the initial workload W;
establishing a total running time which is a sum of a running time of the application on the compressed workload W′
plus a time taken to find the compressed workload W′
; and
establishing a limit on the total running time. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer system for evaluating a workload comprising:
-
a database management system including an initial workload W including an initial plurality of statements;
an evaluation component that determines a distance function for a pair of statements within a workload; and
a search component which evaluates the distance function and finds a compressed workload W′
including a compressed plurality of statements. - View Dependent Claims (18, 19, 20, 21)
-
-
22. A computer system for evaluating a workload comprising:
-
a database management system including an initial workload W including an initial plurality of statements;
an evaluation component that determines a distance function for a pair of statements within a workload;
a search component which evaluates the distance function and finds a compressed workload W′
including a compressed plurality of statements.an initial running time which is a running time of an application on the initial workload W;
a total running time which is a sum of a running time of the application on the compressed workload W′
plus a time taken to find the compressed workload W′
; and
a limit on the total running time. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29, 30, 31, 32)
-
-
33. A computer readable medium having instructions stored on the medium for:
-
providing a distance function for a pair of statements within an initial workload W containing an initial plurality of statements; and
finding a compressed workload W′
containing a compressed plurality of statements. - View Dependent Claims (34, 35, 36, 37, 38, 39)
-
-
40. A computer readable medium having instructions stored on the medium for:
-
providing a distance function for a pair of statements within an initial workload W;
finding a compressed workload;
establishing an initial running time which is a running time of the application on the initial workload W;
establishing a total running time which is a sum of a running time of the application on the compressed workload W′
plus a time taken to find the compressed workload W′
; and
establishing a limit on the total running time. - View Dependent Claims (41, 42, 43, 44, 45, 46, 47, 48, 49, 50)
-
-
51. A computer system for evaluating a workload comprising:
-
a database management means for managing a database;
a distance function means for determining a distance function; and
an evaluation means for evaluating the distance function.
-
Specification