System and method for optimizing pattern query searches on a graph database
First Claim
1. A method for optimizing pattern query searches on a graph database, the method being implemented by a computer including at least one processor and comprising:
- providing a pattern search engine operative to generate a search plan on the graph database from a first pattern query, the pattern search engine being executed by the processor;
identifying, by the at least one processor, in the first pattern query a first subpattern query and a second subpattern query that is structurally equivalent to the first subpattern query;
wherein the first subpattern query and the second subpattern query each comprise a single path;
wherein the structural equivalence meets criteria of;
the paths have a same number of nodes;
nodes in a same position on the paths are of a same type;
nodes in a same position on the paths have same qualifications;
only start and end nodes are shared;
none of non-shared nodes between the paths are exported;
any shared node is shared with both paths in a same position;
having one shared node between both paths; and
non-shared nodes in the paths are not used in pattern query language (PQL) value or constraint expressions; and
reducing the number of search expressions in the search plan based on the structural equivalence between the first subpattern query and the second subpattern query.
3 Assignments
0 Petitions
Accused Products
Abstract
An embodiment of the system and method for optimizing pattern query searches on a graph database uses a pattern query optimizer to optimize execution of the search plan for any sequence of SQL expressions by separating or breaking a pattern query into multiple subpattern queries before converting the subpattern queries into SQL expressions. An embodiment of the pattern query optimizer algorithmically, without intervention by an analyst, decomposes any pattern query into a set of subpattern queries by first identifying branches and cycles within a pattern query and then decomposing each identified branch and cycle into equivalent straight line paths, i.e., straight line nodes joined by edges. Cardinality may be used to improve the performance of pattern searches.
-
Citations
16 Claims
-
1. A method for optimizing pattern query searches on a graph database, the method being implemented by a computer including at least one processor and comprising:
-
providing a pattern search engine operative to generate a search plan on the graph database from a first pattern query, the pattern search engine being executed by the processor; identifying, by the at least one processor, in the first pattern query a first subpattern query and a second subpattern query that is structurally equivalent to the first subpattern query; wherein the first subpattern query and the second subpattern query each comprise a single path; wherein the structural equivalence meets criteria of; the paths have a same number of nodes; nodes in a same position on the paths are of a same type; nodes in a same position on the paths have same qualifications; only start and end nodes are shared; none of non-shared nodes between the paths are exported; any shared node is shared with both paths in a same position; having one shared node between both paths; and non-shared nodes in the paths are not used in pattern query language (PQL) value or constraint expressions; and reducing the number of search expressions in the search plan based on the structural equivalence between the first subpattern query and the second subpattern query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
Specification