Multistage intelligent string comparison method
First Claim
Patent Images
1. A method of searching a database for a query comprising the steps of:
- (a) providing a database of strings of characters;
(b) providing a query string;
(c) identifying polygraphs that occur in said query string and also in said database strings;
(d) providing a cost to each said identified polygraph;
(e) positioning the query string relative to each database string;
(f) matching polygraph occurrences in the query string with those in each database string, the cost of matching providing a numerical indication of the similarity between said query string and each said database string;
(g) realigning said query string to reduce the cost by examining edges present in the matching solution;
(h) repeating said matching and said realigning a predetermined number of times or until the cost of matching fails to improve; and
(i) repeating the steps (c) to (h) above for each database string for the purpose of identifying those database strings most similar to said query string.
2 Assignments
0 Petitions
Accused Products
Abstract
A multistage database search method includes a first stage that only considers textual polygraphic frequencies and a second stage that considers the location of the polygraphs and coarsely approximates the proper alignment between the query and the database strings and a third stage that provides a better approximation of the alignment. The final output is a listing of suggested database records ranked by their similarity to the query.
61 Citations
15 Claims
-
1. A method of searching a database for a query comprising the steps of:
-
(a) providing a database of strings of characters; (b) providing a query string; (c) identifying polygraphs that occur in said query string and also in said database strings; (d) providing a cost to each said identified polygraph; (e) positioning the query string relative to each database string; (f) matching polygraph occurrences in the query string with those in each database string, the cost of matching providing a numerical indication of the similarity between said query string and each said database string; (g) realigning said query string to reduce the cost by examining edges present in the matching solution; (h) repeating said matching and said realigning a predetermined number of times or until the cost of matching fails to improve; and (i) repeating the steps (c) to (h) above for each database string for the purpose of identifying those database strings most similar to said query string. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A multistage method of searching a database for a query comprising the steps of:
-
providing a database of strings of characters; providing a query string; processing each database string using a first method selected on the basis of computational time efficiency for identifying database strings that are similar to said query string; processing said similar database strings using a second method selected to be more accurate but less time efficient than said first method for identifying smaller numbers of database strings that are similar to said query string, and providing final identification of most similar database strings based on the result of processing said similar database strings. - View Dependent Claims (13, 14, 15)
-
Specification