Integrated full text search system and method
First Claim
1. A full-text search computer implemented system comprising:
- a plug-in component executed on a computer provided by a third party developer that defines a relevant score algorithm and a full-text index schema, wherein the relevance algorithm facilitates ranking matching documents and providing a list of documents in order of their relevance; and
a search component to receive and utilize the plug-in component to query data from a data store, populate an index in accordance with the provided index schema and utilize the index to generate a list of matching documents in order of their relevance as specified by the relevant score algorithm, the query data is modified by one or more components specified by the third party developer;
wherein the third party developer specified components are executed as separate daemon processes managed by an external host controller component from within a database management system.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates to an integrated full-text search architecture that integrates full text search with database server applications in a tight manner so that it becomes part of the server in all aspects including storage, metadata, query processing, back/up restore and resource management. This provides improved performance with respect to queries as well as improved manageability. Furthermore, the subject architecture is open and schema directed unlike other conventional full-text search architectures. Accordingly, the full-text search architecture enables expert users (e.g., search providers) to define their own index structures as well as ranking algorithms.
51 Citations
30 Claims
-
1. A full-text search computer implemented system comprising:
-
a plug-in component executed on a computer provided by a third party developer that defines a relevant score algorithm and a full-text index schema, wherein the relevance algorithm facilitates ranking matching documents and providing a list of documents in order of their relevance; and a search component to receive and utilize the plug-in component to query data from a data store, populate an index in accordance with the provided index schema and utilize the index to generate a list of matching documents in order of their relevance as specified by the relevant score algorithm, the query data is modified by one or more components specified by the third party developer; wherein the third party developer specified components are executed as separate daemon processes managed by an external host controller component from within a database management system. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A full-text indexing computer implemented system comprising:
-
a gatherer component to retrieve a document from a data store; a producer pipeline component that parses the structure and text of the retrieved document in accordance with a plurality of third party developer specified components; and a consumer pipeline component that receives data from the producer pipeline component and persists data to an inverted index, upon receipt of data, the consumer pipeline component consumes the data and takes action via executing a message through transacted message queues; the inverted index is modified by the third party developer specified components wherein the third party developer specified components are executed as separate daemon processes managed by an external host controller component from within a database management system. - View Dependent Claims (9, 10, 11)
-
-
12. A full text query computer implemented system tightly integrated with a database management system comprising:
-
a parser component that tokenizes received queries; an execution plan generation system the generates an execution plan based on tokens received from the parser component and a ranking algorithm provided by a third party developer via a ranking plug-in component, wherein the ranking algorithm ranks documents from most to least relevant; an execution engine component that utilizes the execution plan to search an index and produce a list of matching documents in order as specified by the ranking agorithm; the execution plan is modified by the third party developer specified component wherein the third party developer specified component is executed as separate daemon process managed by an external host controller component from within a database management system. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19)
-
-
20. A computer implemented method of employing a customized full-text query comprising:
-
retrieving a full-text indexing schema and ranking algorithm from a plug-in component provided by a third party developer; populating an index in accordance with the provided indexing schema; receiving a query; generating a list of documents utilizing the index; displaying the list of documents by ranked in accordance with their relevance by the ranking algorithm, wherein the list of documents are displayed in order from most to least relevant; and modifying the query by the third party developer specified component wherein the third party developer specified component is executed as separate daemon process managed by an external host controller component from within a database management system. - View Dependent Claims (21)
-
-
22. A customized indexing computer implemented methodology comprising:
-
retrieving a document from a data source; removing document formatting data and emitting text chunks; parsing the text chunks into keywords; persisting the keywords to an index, the index schema being defined by a third party developer; employing a ranking algorithm to facilitate ranking matching documents, wherein the ranking algorithm ranks documents from most to least relevant; and displaying a list of documents by rank in accordance with the ranking algorithm; and modifying the index by the third party developer specified component wherein the third party developer specified component is executed as separate daemon process managed by an external host controller component from within a database management system. - View Dependent Claims (23, 24, 25, 26)
-
-
27. A full text search computer implemented methodology comprising:
-
receiving a search request; generating a query expression in response to the search request which includes a third party developer specified ranking algorithm for determining the relevance of result documents, wherein the ranking algorithm ranks documents form most to least relevant; modifying the query expression to include, remove, add, or modify keyword terms, wherein the query expression is modified by one or more components specified by a third party developer and the third party developer specified components are executed as separate daemon processes managed by an external host controller component from within a database management system; and displaying a list of documents by rank in accordance with the ranking algorithm. - View Dependent Claims (28, 29, 30)
-
Specification