EFFICIENT FORWARD RANKING IN A SEARCH ENGINE
First Claim
1. A computer-implemented method for using a forward index to extract information for ranking documents based on a search query, the method comprising:
- receiving a search query;
parsing the search query to identify one or more atoms;
creating a token map of query tokens using the one or more atoms parsed from the search query;
for a first document, identifying, in a first entry of a forward index, document tokens in a token stream corresponding to the first document that match the query tokens in the token map;
for the document tokens that match the query tokens, updating a data structure with a position in the token stream corresponding to the first document of each of the document tokens that match the query tokens; and
utilizing data in the data structure, extracting ranking information for ranking calculations from the first entry of the forward index.
3 Assignments
0 Petitions
Accused Products
Abstract
Methods and computer storage media are provided for generating entries for documents in a forward index. A document and its document identification are received, in addition to static features that are query-independent. The document is parsed into tokens to form a token stream corresponding to the document. Relevant data used to calculate rankings of document is identified and a position of the data is determined. The entry is then generated from the document identification, the token stream of the document, the static features, and the positional information of the relevant data. The entry is stored in the forward index.
6 Citations
20 Claims
-
1. A computer-implemented method for using a forward index to extract information for ranking documents based on a search query, the method comprising:
-
receiving a search query; parsing the search query to identify one or more atoms; creating a token map of query tokens using the one or more atoms parsed from the search query; for a first document, identifying, in a first entry of a forward index, document tokens in a token stream corresponding to the first document that match the query tokens in the token map; for the document tokens that match the query tokens, updating a data structure with a position in the token stream corresponding to the first document of each of the document tokens that match the query tokens; and utilizing data in the data structure, extracting ranking information for ranking calculations from the first entry of the forward index. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. One or more computer storage memory storing computer-useable instructions that, when used by one or more computing devices, cause the one or more computing devices to perform a method for using a forward index to extract information for ranking documents based on a search query, the method comprising:
-
receiving a search query; parsing the search query to identify one or more atoms; creating a token map of query tokens using the one or more atoms parsed from the search query; for a first document, identifying, in a first entry of a forward index, document tokens in a token stream corresponding to the first document that match the query tokens in the token map; for the document tokens that match the query tokens, updating a data structure with a position in the token stream corresponding to the first document of each of the document tokens that match the query tokens; and utilizing data in the data structure, extracting ranking information for ranking calculations from the first entry of the forward index. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. A system for using a forward index to extract information for ranking documents based on a search query, the system comprising:
-
an index generator having one or more processors and one or more computer-readable storage media; and a forward index coupled with the index generator, wherein the index generator is configured for; receiving a search query; parsing the search query to identify one or more atoms; creating a token map of query tokens using the one or more atoms parsed from the search query; for a first document, identifying, in a first entry of the forward index, document tokens in a token stream corresponding to the first document that match the query tokens in the token map; for the document tokens that match the query tokens, updating a data structure with a position in the token stream corresponding to the first document of each of the document tokens that match the query tokens; and utilizing data in the data structure, extracting ranking information for ranking calculations from the first entry of the forward index. - View Dependent Claims (19, 20)
-
Specification