System and method for adaptive query identification and acceleration
First Claim
1. A computer-implemented data management system, comprising:
- an application server, including computer readable storage media comprising;
software, when executed by a computer performs an operation of;
collecting a plurality of realtime queries requiring execution;
software, when executed by a computer performs an operation of;
providing an ordered vector space of data attributes;
providing, from available data, a query data attribute extent vector space;
executing a plurality of predefined queries and providing a query signature for each query, each query signature including a data attribute component;
executing each query of the plurality of predefined queries without using cache and recording an uncached execution time;
executing each query of the plurality of predefined queries using cache and recording a cached execution time;
creating, for each query, a time estimate coefficient and estimated time for query coefficient;
calculating, for each query, a similarity score with respect to each other query; and
providing metrics including the similarity score for each one of the plurality of predefined queries; and
software, when executed by a computer performs an operation of;
ordering realtime queries based on said metrics of corresponding similar predefined queries to improve performance of said system.
1 Assignment
0 Petitions
Accused Products
Abstract
A data management system and method. The system is adapted to analyze a plurality of data queries and provide a metric with respect thereto. The system then orders the queries based on the metric to improve a performance parameter of the system or estimate an execution time with respect to the query. In a more specific implementation, the system includes code for analyzing a signature of each of the queries, comparing the signature to a calibrated signature and providing a signature score in response thereto. The system further includes code for estimating an execution time of at least one of the queries. The signature may be a cluster of statements in an appropriate language such as SQL (standard query language). As an alternative, the signature may be a set of features extracted from a plurality of statements.
-
Citations
14 Claims
-
1. A computer-implemented data management system, comprising:
-
an application server, including computer readable storage media comprising; software, when executed by a computer performs an operation of; collecting a plurality of realtime queries requiring execution; software, when executed by a computer performs an operation of; providing an ordered vector space of data attributes; providing, from available data, a query data attribute extent vector space; executing a plurality of predefined queries and providing a query signature for each query, each query signature including a data attribute component; executing each query of the plurality of predefined queries without using cache and recording an uncached execution time; executing each query of the plurality of predefined queries using cache and recording a cached execution time; creating, for each query, a time estimate coefficient and estimated time for query coefficient; calculating, for each query, a similarity score with respect to each other query; and providing metrics including the similarity score for each one of the plurality of predefined queries; and software, when executed by a computer performs an operation of; ordering realtime queries based on said metrics of corresponding similar predefined queries to improve performance of said system. - View Dependent Claims (2, 3, 4)
-
-
5. A computer readable storage media comprising:
software, when executed by a computer performs an operation of; providing an ordered vector space of data attributes; providing, from available data, a query data attribute extent vector space; executing a plurality of predefined queries and providing a query signature for each query, each query signature including a data attribute component; executing each query of the plurality of predefined queries without using cache and recording an uncached execution time; executing each query of the plurality of predefined queries using cache and recording a cached execution time; creating, for each query, a time estimate coefficient and estimated time for query coefficient; and calculating, for each query, a similarity score with respect to each other query. - View Dependent Claims (6, 7, 8)
-
9. A computer-implemented data management system, comprising:
computer-readable storage media comprising a database; a processor coupled to said database; a communications interface for supplying a plurality of queries to said database via said processor; and computer readable storage media including computer-readable instructions for execution by said processor for analyzing a plurality of said queries and organizing said queries to optimize a performance parameter with respect to a retrieval thereof and output results for the queries to a user;
said computer-readable instructions when executed by a computer performs an operation of;providing an ordered vector space of data attributes;
providing, from available data, a query data attribute extent vector space;executing a plurality of predefined queries and providing a query signature for each query, each query signature including a data attribute component; executing each query of the plurality of predefined queries without using cache and recording an uncached execution time; executing each query of the plurality of predefined queries using cache and recording a cached execution time; creating, for each query, a time estimate coefficient and estimated time for query coefficient; calculating, for each query, a similarity score with respect to each other query; and providing metrics including the similarity score for each one of the plurality of predefined queries. - View Dependent Claims (10, 11, 12, 13)
-
14. A data management method for implementing by a computer including computer-implemented steps of:
-
collecting a plurality of realtime queries requiring execution; providing an ordered vector space of data attributes; providing, from available data, a query data attribute extent vector space; executing a plurality of predefined queries and providing a query signature for each query, each query signature including a data attribute component; executing each query of the plurality of predefined queries without using cache and recording an uncached execution time; executing each query of the plurality of predefined queries using cache and recording a cached execution time; creating, for each query, a time estimate coefficient and estimated time for query coefficient; and calculating, for each query, a similarity score with respect to each other query; providing metrics including the similarity score for each one of the plurality of predefined queries; and ordering the realtime queries based on said metrics associated with similar corresponding predefined queries; and outputting results of the realtime queries.
-
Specification