Substantially similar queries
First Claim
Patent Images
1. A method comprising the following computer-executable acts:
- analyzing a relationship between a first query and a second query based at least in part upon search results previously selected by users, wherein the search results previously selected by the users were presented to the users in response to submission of the first query and/or the second query to a search engine, wherein analyzing the relationship between the first query and the second query comprises;
accessing a data repository that comprises a computer-implemented bipartite graph that includes a first set of nodes and a second set of nodes, wherein the first set of nodes represents queries and the second set of nodes represents URLs, wherein the first set of nodes includes a first node that is representative of the first query and a second node that is representative of the second query, and wherein the graph further comprises edges that are weighted to indicate relationships between queries and URLs;
initiating a random walk at the first node; and
determining a number of steps in the random walk until the second node is reached in the random walk, wherein a step is from a node in the first set of nodes to another node in the first set of nodes;
determining whether the first query is substantially similar to the second query based at least in part upon the number of steps in the random walk until the second node is reached in the random walk; and
generating correlation data that correlates the first query and the second query if the first query and second query are determined to be substantially similar.
2 Assignments
0 Petitions
Accused Products
Abstract
A system described herein includes analyzer component that analyzes queries submitted by users and corresponding URLs selected by the users, wherein the queries include a first query and a second query, and wherein the analyzer component determines that the first query and the second query are substantially similar queries. The system additionally includes a correlator component that, responsive to the analyzer component determining that the first query and the second query are substantially similar, generates correlation data that indicates that the first and second queries are substantially similar.
56 Citations
20 Claims
-
1. A method comprising the following computer-executable acts:
-
analyzing a relationship between a first query and a second query based at least in part upon search results previously selected by users, wherein the search results previously selected by the users were presented to the users in response to submission of the first query and/or the second query to a search engine, wherein analyzing the relationship between the first query and the second query comprises; accessing a data repository that comprises a computer-implemented bipartite graph that includes a first set of nodes and a second set of nodes, wherein the first set of nodes represents queries and the second set of nodes represents URLs, wherein the first set of nodes includes a first node that is representative of the first query and a second node that is representative of the second query, and wherein the graph further comprises edges that are weighted to indicate relationships between queries and URLs; initiating a random walk at the first node; and determining a number of steps in the random walk until the second node is reached in the random walk, wherein a step is from a node in the first set of nodes to another node in the first set of nodes; determining whether the first query is substantially similar to the second query based at least in part upon the number of steps in the random walk until the second node is reached in the random walk; and generating correlation data that correlates the first query and the second query if the first query and second query are determined to be substantially similar. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system, comprising:
-
a processor; a data repository that includes a computer-implemented bipartite graph, wherein the bipartite graph includes a first set of nodes that represent queries, a second set of nodes that represent URLs, and weighted edges that represent relationships between queries and URLs, wherein the edges are weighted based at least in part upon a number of user selections of URLs given particular queries, wherein the first set of nodes includes a first node that represents a first query and a second node that represents a second query; a memory that comprises a plurality of components that are executed by the processor, the plurality of components comprising; an analyzer component that analyzes queries submitted by users and corresponding URLs selected by the users, wherein the queries include the first query and the second query, and wherein the analyzer component initiates a random walk at the first node in the bi-partite graph and counts a number of steps taken during the random walk until the random walk reaches the second node in the bipartite graph, the analyzer component determining that the first query and the second query are substantially similar based at least in part upon the number of steps taken during the random walk; and a correlator component that, responsive to the analyzer component determining that the first query and the second query are substantially similar, generates correlation data that indicates that the first and second queries are substantially similar. - View Dependent Claims (14, 15, 16, 17)
-
-
18. A computer-readable data storage device comprising instructions that, when executed by a processor, cause the processor to perform acts comprising:
-
receiving a first query; accessing a computer-implemented bipartite graph, wherein the bipartite graph includes a first set of nodes that represent queries and a second set of nodes that represent URLs, wherein the first set of nodes includes a first node that is representative of the first query and wherein the bipartite graph further includes a first weighted edge that couples the first node to at least one node in the second set of nodes and includes weighted edges that couple nodes in the first set of nodes with nodes in the second set of nodes; initiating a random walk from the first node, wherein edges in the random walk are selected pseudo-randomly while considering weights of the edges; and outputting a second query that is determined to be substantially similar to the first query based at least in part upon a number of steps taken during the random walk between the first node and a second node that represents the second query. - View Dependent Claims (19, 20)
-
Specification