CONTENT DATA INDEXING WITH CONTENT ASSOCIATIONS
First Claim
1. In a computing system having access to multiple content entities, each content entity including searchable content, a method for building a database for facilitating searching and retrieving of content entities in an efficient manner that returns results of content entities expected to be found, the method comprising:
- creating a skeleton database for storing a search index table and one or more other tables for facilitating a search for content entities within one or more content sources;
inserting index data into the skeleton database, the index data including index entries pointing to content within the one or more content sources;
processing content entities from a first content source and inserting data associated with the content entities into the search index table;
adding to the skeleton database associations between content entities of the one or more content sources and processing related content entities identified by the associations; and
outputting the skeleton database into an output file that includes the search index table and the one or more other tables, and detaching the output file from a build server used to create the skeleton database.
3 Assignments
0 Petitions
Accused Products
Abstract
A full text indexing system is provided for processing content associated with data applications such as encyclopedia and dictionary applications. A build process collects data from various sources, processes the data into constituent parts, including alternative word sets, and stores the constituent parts in structured database tables. A run-time process is used to query the database tables and the results in order to provide effective matches in an efficient manner. Run-time processing is optimized by preprocessing all steps that are query-independent during the build process. A double word table representing all possible word pair combinations for each index entry and an alternative word table are used to further optimize runtime processing.
100 Citations
20 Claims
-
1. In a computing system having access to multiple content entities, each content entity including searchable content, a method for building a database for facilitating searching and retrieving of content entities in an efficient manner that returns results of content entities expected to be found, the method comprising:
-
creating a skeleton database for storing a search index table and one or more other tables for facilitating a search for content entities within one or more content sources;
inserting index data into the skeleton database, the index data including index entries pointing to content within the one or more content sources;
processing content entities from a first content source and inserting data associated with the content entities into the search index table;
adding to the skeleton database associations between content entities of the one or more content sources and processing related content entities identified by the associations; and
outputting the skeleton database into an output file that includes the search index table and the one or more other tables, and detaching the output file from a build server used to create the skeleton database. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. In a computing system having access to multiple content entities, each content entity including searchable content, a method performing a run-time search for tokens related to key terms input in a user query, the method comprising:
-
receiving a user query, the user query including one or more key terms input by a user for searching and retrieving content entities;
converting the user query to a tokenized query;
sending the original user query and the tokenized query to a database;
in the database, conducting an exact match on the user query;
in the database, conducting an exact match on the tokenized query;
identifying all terms in the user query and tokenized query which are valid words in the one or more search indexes;
searching the one or more search indexes with the key terms of the user query; and
returning to a user a result set of all content entities matching the user query and the tokenized query. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification