Weight-based stemming for improving search quality
First Claim
1. A computer-implemented method comprising:
- receiving a search query;
identifying a first original query term based on the query;
identifying a first expanded query term related to the first original query term;
determining a first lexical distance between the first original query term and the first expanded query term;
determining a first weight for the first expanded query term based on the determined first lexical distance;
identifying a plurality of documents, from among a corpus of documents, as each relevant to the search query, the plurality of documents including a first document identified based on an inclusion of the first expanded query term;
ranking the plurality of documents, with the ranking of the first document being based upon the calculated first weight; and
generating a response to the search query identifying two or more of the plurality of documents, ordered according to the ranking.
2 Assignments
0 Petitions
Accused Products
Abstract
A technique including receiving a search query; identifying a first original query term based on the query; identifying a first expanded query term related to the first original query term; determining a first lexical distance between the first original query term and the first expanded query term; determining a first weight for the first expanded query term based on the determined first lexical distance; identifying a plurality of documents, from among a corpus of documents, as each relevant to the search query, the plurality of documents including a first document identified based on its inclusion of the first expanded query term; ranking the plurality of documents, with the ranking of the first document being based upon the calculated first weight; and generating a response to the search query identifying two or more of the plurality of documents, ordered according to the ranking.
-
Citations
20 Claims
-
1. A computer-implemented method comprising:
-
receiving a search query; identifying a first original query term based on the query; identifying a first expanded query term related to the first original query term; determining a first lexical distance between the first original query term and the first expanded query term; determining a first weight for the first expanded query term based on the determined first lexical distance; identifying a plurality of documents, from among a corpus of documents, as each relevant to the search query, the plurality of documents including a first document identified based on an inclusion of the first expanded query term; ranking the plurality of documents, with the ranking of the first document being based upon the calculated first weight; and generating a response to the search query identifying two or more of the plurality of documents, ordered according to the ranking. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A search system comprising:
-
a query expansion engine programmed to receive a search query; identify a first original query term based on the query; identify a first expanded query term related to the first original query term; determine a first lexical distance between the first original query term and the first expanded query term; and determine a first weight for the first expanded query term based on the determined first lexical distance; a search system programmed to identify a plurality of documents, from among a corpus of documents, as each relevant to the search query, the plurality of documents including a first document identified based on an inclusion of the first expanded query term; and a ranking engine programmed to rank the plurality of documents, with the ranking of the first document being based upon the calculated first weight, wherein the search system is further programmed to generate a response to the search query identifying two or more of the plurality of documents, ordered according to the ranking. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification