Dynamic optimization of multi-feature queries
First Claim
1. A method for optimally performing a similarity search for a query object using at least one data stream, each of the at least one data stream for a feature attribute and being a list in distance order, comprising the steps of:
- determining a query plan using a cost-aware model; and
executing the query plan to obtain at least one object using at least one of the at least one data stream;
wherein information related to the similarity search is returned once the distance of the first of the at least one obtained object is at most equal to a threshold value based on an aggregate distance of highest distances of objects obtained from each of the at least one data stream.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention provides an elegant solution for processing multi-feature queries, which considers the differing access costs associated with each feature. Access cost is a critical factor in determining how individual features should be processed in terms of retrieving through sorted or random access, and, hence, in minimizing the overall query response time. The present invention operates dynamically during query processing and seeks to minimize the total query cost in terms of number of features retrieved and cost for access. It works by evaluating different combinations of feature access plans (sorted and random access) according to the number of retrieved features and forward access costs, and it selects the lowest cost plan. Experimental results on practical data show a significant speed-up in multi-features queries using the proposed solution.
91 Citations
14 Claims
-
1. A method for optimally performing a similarity search for a query object using at least one data stream, each of the at least one data stream for a feature attribute and being a list in distance order, comprising the steps of:
-
determining a query plan using a cost-aware model; and
executing the query plan to obtain at least one object using at least one of the at least one data stream;
wherein information related to the similarity search is returned once the distance of the first of the at least one obtained object is at most equal to a threshold value based on an aggregate distance of highest distances of objects obtained from each of the at least one data stream. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A program storage device readable by a machine, tangibly embodying a program of instructions executable on the machine to perform method steps for optimally performing a similarity search for a query object using at least one data stream, each of the at least one data stream for a feature attribute and being a list in distance order, the method steps comprising:
-
determining a query plan using a cost-aware model; and
executing the query plan to obtain at least one object using at least one of the at least one data stream;
wherein information related to the similarity search is returned once the distance of the first of the at least one obtained object is at most equal to a threshold value based on an aggregate distance of highest distances of objects obtained from each of the at least one data stream. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
Specification