Similarity search engine for use with relational databases
First Claim
1. A method for performing similarity searching, comprising the steps of:
- receiving a request instruction from a client for initiating a similarity search;
generating one or more query commands from the request instruction, each query command designating an anchor document and at least one search document;
executing each query command, including;
computing a normalized document similarity score having a value of between 0.00 and 1.00 for each search document in each query command for indicating a degree of similarity between the anchor document and each search document;
creating a result dataset containing the computed normalized document similarity scores for each search document; and
sending a response including the result dataset to the client.
4 Assignments
0 Petitions
Accused Products
Abstract
The invention provides a system and method for defining a schema and sending a query to a Similarity Search Engine to determine a quantitative assessment of the similarity of attributes between an anchor record and one or more target records. The Similarity Search Engine makes a similarity assessment in a single pass through the target records having multiple relationship characteristics. The Similarity Search Engine is a server configuration that comprises a Gateway for command and response routing, a Virtual Document Manager for document generation, a Search Manager for document scoring, and an Relational Database Management System for providing data persistence, data retrieval and access to User Defined Functions. The Similarity Search Engine uses a unique command syntax based on the Extensible Markup Language to implement functions necessary for similarity searching and scoring.
-
Citations
47 Claims
-
1. A method for performing similarity searching, comprising the steps of:
-
receiving a request instruction from a client for initiating a similarity search;
generating one or more query commands from the request instruction, each query command designating an anchor document and at least one search document;
executing each query command, including;
computing a normalized document similarity score having a value of between 0.00 and 1.00 for each search document in each query command for indicating a degree of similarity between the anchor document and each search document;
creating a result dataset containing the computed normalized document similarity scores for each search document; and
sending a response including the result dataset to the client. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A system for performing similarity searching, comprising:
-
a gateway for receiving a request instruction from a client for initiating a similarity search;
the gateway for generating one or more query commands from the request instruction, each query command designating an anchor document and at least one search document;
a search manager for executing each query command, including;
means for computing a normalized document similarity score having a value of between 0.00 and 1.00 for each search document in each query command for indicating a degree of similarity between the anchor document and each search document;
means for creating a result dataset containing the computed normalized document similarity scores for each search document; and
the gateway for sending a response including the result dataset to the client. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33)
-
-
34. A system for performing similarity searching, comprising:
-
a gateway for handling all communication between a client, a virtual document manager and a search manager;
the virtual document manager connected between the gateway and a relational database management system for providing document management;
the search manager connected between the gateway and the relational database management system for searching and scoring documents; and
the relational database management system for providing relational data management, document and measure persistence, and similarity measure execution. - View Dependent Claims (35, 36, 37, 38)
-
-
39. A method for performing similarity searching, comprising the steps of:
-
creating a search schema document by a virtual document manager;
generating one or more query commands by a gateway;
executing one or more query commands in a search manager and relational database management system for determining the degree of similarity between an anchor document and search documents; and
assembling a result document containing document similarity scores of between 0.00 and 1.00. - View Dependent Claims (40, 41, 42, 43, 44, 45, 46, 47)
-
Specification