Similarity search engine for use with relational databases
First Claim
1. A software system implemented by instructions in a computer system for performing similarity searching, comprising:
- a gateway that handles all communication between a client and the software system, between the gateway and a virtual document manager and between the gateway and a search manager;
the virtual document manager connected between the gateway and a relational database management system that receives command from the gateway and provides schema and document management;
the search manager connected between the gateway and the relational database management system that receives commands from the gateway, initiates attribute token similarity searches in the relational database management system using prestored attribute token similarity measure algorithms, receives attribute token similarity search results from the relational database management system, aggregates the attribute token similarity search results into document similarity search results sets, and forwards the document similarity search result sets to the gateway;
the relational database management system that provides relational data management, document and attribute token similarity measure algorithm persistence, and attribute token similarity measure algorithm execution; and
the gateway that sends documents from the virtual document manager and document similarity search result sets from the search manager to the client.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention provides a system and method for defining a schema and sending a query to a Similarity Search Engine to determine a quantitative assessment of the similarity of attributes between an anchor record and one or more target records. The Similarity Search Engine makes a similarity assessment in a single pass through the target records having multiple relationship characteristics. The Similarity Search Engine is a server configuration that comprises a Gateway for command and response routing, a Virtual Document Manager for document generation, a Search Manager for document scoring, and an Relational Database Management System for providing data persistence, data retrieval and access to User Defined Functions. The Similarity Search Engine uses a unique command syntax based on the Extensible Markup Language to implement functions necessary for similarity searching and scoring.
-
Citations
14 Claims
-
1. A software system implemented by instructions in a computer system for performing similarity searching, comprising:
-
a gateway that handles all communication between a client and the software system, between the gateway and a virtual document manager and between the gateway and a search manager; the virtual document manager connected between the gateway and a relational database management system that receives command from the gateway and provides schema and document management; the search manager connected between the gateway and the relational database management system that receives commands from the gateway, initiates attribute token similarity searches in the relational database management system using prestored attribute token similarity measure algorithms, receives attribute token similarity search results from the relational database management system, aggregates the attribute token similarity search results into document similarity search results sets, and forwards the document similarity search result sets to the gateway; the relational database management system that provides relational data management, document and attribute token similarity measure algorithm persistence, and attribute token similarity measure algorithm execution; and the gateway that sends documents from the virtual document manager and document similarity search result sets from the search manager to the client. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A software method implemented by instructions in a computer system for performing similarity searching, comprising the steps of:
-
receiving a request by a gateway from a client for initiating a similarity search; creating a search schema document by a virtual document manager; generating one or more query commands by the gateway; executing one or more query commands in a search manager that causes a relational database management system to perform attribute token similarity searches; determining the degree of similarity between an anchor document and search documents by the search manager that aggregates attribute token similarity search scores from the relational database management system; and
assembles document similarity search result sets containing document similarity scores of between 0.00 and 1.00; andsending the similarity search sets to the client by the gateway. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 14)
-
Specification