Search engine with hierarchically stored indices
First Claim
1. A method for indexing data items in a database, the method comprising:
- retrieving data items from a database;
producing a primary index of the data items;
mapping the data items on to at least a first tier and a second tier based on respective rankings of the data items;
producing at least a first and a second sub-index from the primary index based on the mapping; and
storing the at least a first and second sub-index in different search nodes.
3 Assignments
0 Petitions
Accused Products
Abstract
A search engine comprising a crawler which crawls the WWW and stores pages found on the WWW in a database. An indexer indexes the pages in the database to produce a primary index. A document mapping section maps pages in the database into a plurality of tiers based on a ranking of the pages. The ranking may be based on portions of the pages which have a relatively higher value context. A processor produces a plurality of sub-indices from the primary index based on the mapping. The sub-indices are stored in a search node cluster. The cluster is a matrix of search nodes logically arranged in a plurality of rows and columns. Search nodes in the same column include the same sub-index. Search nodes in the same row include distinct sub-indices. A search query received by a user is sent to a dispatcher which, in turn, forwards the query to the first tier of search nodes. A fall through algorithm is disclosed which indicates when the dispatcher should forward the search query to other tiers of search nodes.
-
Citations
16 Claims
-
1. A method for indexing data items in a database, the method comprising:
-
retrieving data items from a database;
producing a primary index of the data items;
mapping the data items on to at least a first tier and a second tier based on respective rankings of the data items;
producing at least a first and a second sub-index from the primary index based on the mapping; and
storing the at least a first and second sub-index in different search nodes. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for searching a database, the method comprising:
-
retrieving data items from a database;
producing a primary index of the data items;
mapping data items on to at least a first tier and a second tier based on respective rankings of the data items;
producing at least a first and a second sub-index from the primary index based on the mapping;
storing the at least a first and second sub-index in different search nodes;
receiving a search query; and
searching the first tier for result data items relating to the search query. - View Dependent Claims (9, 10)
-
-
11. A system for indexing a database, the system comprising:
-
a crawler which crawls the database to find data items;
an indexer which receives the data items and produces a primary index;
a document mapping section which maps data items on to at least a first and a second tier based on respective rankings of the data items;
a processor which produces at least a first and a second sub-index from the primary index based on the mapping;
a first search node which stores the first sub-index; and
a second search node which stores the second sub-index.
-
-
12. A search node cluster for enabling a search of a database, the cluster comprising:
-
search nodes logically arranged in a plurality of columns and plurality of rows;
all search nodes in any one of the columns including substantially the same information;
all search nodes in any one of the rows including distinct information;
the search nodes in the rows being logically divided into at least a first and a second tier;
the search nodes in the first tier including an index for a first portion of the database; and
the search nodes in the second tier including an index for a second portion of the database;
whereinthe data in the first and second tier is based on respective rankings of the information in the first and second portion of the database.
-
-
13. A search engine comprising:
-
a crawler which crawls a database to find data items;
an indexer which receives the data items and produces a primary index;
a document mapping section which maps data items on to at least a first and a second tier based on respective rankings of the data items;
a processor which produces at least a first and a second sub-index from the primary index based on the mapping;
a first search node which stores the first sub-index;
a second search node which stores the second sub-index; and
a dispatch which receives a query and forwards the query to the first search node. - View Dependent Claims (14, 15, 16)
-
Specification