Query Intent in Information Retrieval
First Claim
1. A method for use at a search engine arranged to receive queries and to provide search results for those queries, the search results comprising links to documents, the method comprising:
- receiving a query;
accessing query reformulation data and using that data to create a query neighborhood around the received query, the query neighborhood being a graph data structure in memory comprising a node representing the received query and links connecting that node to other nodes representing observed reformulations of that query;
accessing evidence about queries related to the received query comprising a record of query-document pairs;
using a processor to filter the query neighborhood by editing the graph data structure on the basis of the accessed evidence to produce and store in memory a reformulation graph;
using the reformulation graph to identify queries related to the received query.
2 Assignments
0 Petitions
Accused Products
Abstract
Inferring query intent in information retrieval is described. In an example reformulations of an initial query by a user are used to create a query neighborhood. In the example, the query neighborhood is used to identify a set of possibly related queries. First and higher order reformulations of the initial query may be used to expand the query neighborhood. In an example precision can be improved by reducing the query neighborhood to more closely related queries for example, two queries can be connected if they are often clicked for the same document. In an example two queries can be connected using a random walk and all pairs of queries that are not connected by a random walk of less than a fixed threshold are removed. The connected queries can be used to form clusters and weights can be applied in order to determine the most likely related queries.
-
Citations
20 Claims
-
1. A method for use at a search engine arranged to receive queries and to provide search results for those queries, the search results comprising links to documents, the method comprising:
-
receiving a query; accessing query reformulation data and using that data to create a query neighborhood around the received query, the query neighborhood being a graph data structure in memory comprising a node representing the received query and links connecting that node to other nodes representing observed reformulations of that query; accessing evidence about queries related to the received query comprising a record of query-document pairs; using a processor to filter the query neighborhood by editing the graph data structure on the basis of the accessed evidence to produce and store in memory a reformulation graph; using the reformulation graph to identify queries related to the received query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A query generation apparatus comprising:
-
an input arranged to receive a query to be issued to a search engine to retrieve documents; a memory holding a query neighborhood for the received query, the query neighborhood being a graph data structure comprising a node representing the received query and links connecting that node to other nodes representing observed reformulations of that query; a processor arranged to access evidence about queries related to the received query comprising a record of query-document pairs; the processor being arranged to filter the query neighborhood by editing the graph data structure on the basis of the accessed evidence to produce a reformulation graph; the processor being arranged to use the reformulation graph to generate queries related to the received query. - View Dependent Claims (16, 17)
-
-
18. A method for use at a search engine arranged to receive queries and to provide search results for those queries, the search results comprising links to documents, the method comprising:
-
receiving a query; accessing query reformulation data and using that data to create a query neighborhood around the received query, the query neighborhood being a graph data structure in memory comprising a node representing the received query and links connecting that node to other nodes representing observed reformulations of that query; accessing a click graph comprising nodes connected by edges, each node representing any of a document and a query and each edge representing at least one observed click; using a processor to filter the query neighborhood by editing the query neighborhood on the basis of the click graph to produce and store in memory a reformulation graph; using the reformulation graph to identify queries related to the received query. - View Dependent Claims (19, 20)
-
Specification