Search engine with hierarchically stored indices
First Claim
1. A method for indexing data items in a database, the method comprising:
- retrieving data items associated with a respective ranking from a database, wherein the ranking is based at least in part on a relevance score for the data item;
mapping the data items on to at least a first tier and a second tier based on the respective rankings of the data items;
producing at least a first and a second sub-index from the primary index based on the mapping;
storing the first sub-index in a first plurality of search nodes logically arranged in a first plurality of columns; and
storing the second sub-index in a second plurality of search nodes logically arranged in a second plurality of columns.
3 Assignments
0 Petitions
Accused Products
Abstract
A search engine comprising a crawler which crawls the WWW and stores pages found on the WWW in a database. An indexer indexes the pages in the database to produce a primary index. A document mapping section maps pages in the database into a plurality of tiers based on a ranking of the pages. The ranking may be based on portions of the pages which have a relatively higher value context. A processor produces a plurality of sub-indices from the primary index based on the mapping. The sub-indices are stored in a search node cluster. The cluster is a matrix of search nodes logically arranged in a plurality of rows and columns. Search nodes in the same column include the same sub-index. Search nodes in the same row include distinct sub-indices. A search query received by a user is sent to a dispatcher which, in turn, forwards the query to the first tier of search nodes. A fall through algorithm is disclosed which indicates when the dispatcher should forward the search query to other tiers of search nodes.
82 Citations
17 Claims
-
1. A method for indexing data items in a database, the method comprising:
-
retrieving data items associated with a respective ranking from a database, wherein the ranking is based at least in part on a relevance score for the data item; mapping the data items on to at least a first tier and a second tier based on the respective rankings of the data items; producing at least a first and a second sub-index from the primary index based on the mapping; storing the first sub-index in a first plurality of search nodes logically arranged in a first plurality of columns; and storing the second sub-index in a second plurality of search nodes logically arranged in a second plurality of columns. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for searching a database, the method comprising:
-
retrieving data items associated with a respective ranking from a database, wherein the ranking is based at least in part on a relevance score for the data item; mapping data items on to at least a first tier and a second tier based on the respective rankings of the data items; producing at least a first and a second sub-index from the primary index based on the mapping; storing the first sub-index in a first plurality of search nodes logically arranged in a first plurality of columns; storing the second sub-index in a second plurality of search nodes logically arranged in a second plurality of columns; receiving a search query; and searching the first tier for result data items relating to the search query and providing one or more of the result data items to a user. - View Dependent Claims (9, 10, 11)
-
-
12. A system for indexing a database, the system comprising:
-
a crawler which crawls the database to find data items; an indexer which receives the data items associated with a respective ranking, wherein the ranking is based at least in part on a relevance score for the data item, produces a primary index; a document mapping section which maps data items on to at least a first and a second tier based on the respective rankings of the data items; a processor which produces at least a first and a second sub-index from the primary index based on the mapping; a first plurality of search nodes logically arranged in a first plurality of columns for storing the first sub-index; and a second plurality of search nodes logically arranged in a second plurality of columns for storing the second sub-index. - View Dependent Claims (17)
-
-
13. A search engine comprising:
-
a crawler which crawls a database to find data items; an indexer which receives the data items associated with a respective ranking wherein the ranking is based at least in part on a relevance score for the data item, produces a primary index; a document mapping section which maps data items on to at least a first and a second tier based on the respective rankings of the data items; a processor which produces at least a first and a second sub-index from the primary index based on the mapping; a first plurality of search nodes logically arranged in a first plurality of columns for storing the first sub-index; a second plurality of search nodes logically arranged in a second plurality of columns for storing the second sub-index; and a dispatcher which receives a query and forwards the query to a search node in the first plurality of search nodes. - View Dependent Claims (14, 15, 16)
-
Specification