Systems and methods for analyzing and clustering search queries
First Claim
1. A method for analyzing search query relationships, the method comprising the following operations performed by one or more processors:
- receiving, over an electronic network, log data relating to a plurality of search queries received from users;
generating a click graph representing relationships among a plurality of queries and a plurality of visited query results associated with each of the plurality of queries,wherein the click graph depicts at least one first layer relationship between a first query and a second query in the plurality of queries, the first layer relationship indicating that at least one of the plurality of visited query results is associated with both the first query and the second query, andfurther wherein the click graph depicts at least one second layer relationship between the first query and second query in the plurality of queries, the second layer relationship indicating that each of the first and second queries has a first layer relationship to a third query;
computing a numeric value representing a degree of the at least one second layer relationship;
identifying temporal similarities between at least one pair of the plurality of queries, the temporal similarities determined based on a temporal distance between peaks in frequency of occurrence for both queries in the at least one pair of the plurality of queries;
evaluating the at least one pair of queries based on the generated click graph and the identified temporal similarities to determine whether the at least one pair of queries are related; and
designating the queries in the at least one pair of queries as related based on the computed value being greater than zero and the temporal distance being below a threshold value.
7 Assignments
0 Petitions
Accused Products
Abstract
Computerized systems and methods are disclosed for analyzing search query relationships and managing electronic content. In accordance with one implementation, log data pertaining to a plurality of queries may be received over an electronic network. A click graph may be generated representing one or more relationships between the queries. Further, temporal similarities may be identified between the queries, for example, by looking at peaks in frequency of queries over time. A pair of search queries may be evaluated based on the generated click graph and the identified temporal similarities to determine whether the queries in the pair are related.
25 Citations
30 Claims
-
1. A method for analyzing search query relationships, the method comprising the following operations performed by one or more processors:
-
receiving, over an electronic network, log data relating to a plurality of search queries received from users; generating a click graph representing relationships among a plurality of queries and a plurality of visited query results associated with each of the plurality of queries, wherein the click graph depicts at least one first layer relationship between a first query and a second query in the plurality of queries, the first layer relationship indicating that at least one of the plurality of visited query results is associated with both the first query and the second query, and further wherein the click graph depicts at least one second layer relationship between the first query and second query in the plurality of queries, the second layer relationship indicating that each of the first and second queries has a first layer relationship to a third query; computing a numeric value representing a degree of the at least one second layer relationship; identifying temporal similarities between at least one pair of the plurality of queries, the temporal similarities determined based on a temporal distance between peaks in frequency of occurrence for both queries in the at least one pair of the plurality of queries; evaluating the at least one pair of queries based on the generated click graph and the identified temporal similarities to determine whether the at least one pair of queries are related; and designating the queries in the at least one pair of queries as related based on the computed value being greater than zero and the temporal distance being below a threshold value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for analyzing search query relationships, the system comprising:
-
a server configured to receive, over an electronic network, log data relating to a plurality of search queries received from users; and a processor configured to; generate a click graph representing relationships among a plurality of queries and a plurality of visited query results associated with each of the plurality of queries, wherein the click graph depicts at least one first layer relationship between a first query and a second query in the plurality of queries, the first layer relationship indicating that at least one of the plurality of visited query results is associated with both the first query and the second query, and further wherein the click graph depicts at least one second layer relationship between the first query and second query in the plurality of queries, the second layer relationship indicating that each of the first and second queries has a first layer relationship to a third query; compute a numeric value representing a degree of the at least one second layer relationship; identify temporal similarities between at least one pair of the plurality of queries, the temporal similarities determined based on a temporal distance between peaks in frequency of occurrence for both queries in the at least one pair of the plurality of queries; evaluate the at least one pair of queries based on the generated click graph and the identified temporal similarities to determine whether the at least one pair of queries are related; and designate the queries in the at least one pair of queries as related based on the computed value being greater than zero and the temporal distance being below a threshold value. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20, 26, 27, 28)
-
-
21. A computer-readable storage medium including instructions for analyzing search query relationships, which, when executed by at least one processor, cause the processor to perform steps comprising:
-
receiving, over an electronic network, log data relating to a plurality of search queries received from users; generating a click graph representing relationships among a plurality of queries and a plurality of visited query results associated with each of the plurality of queries, wherein the click graph depicts at least one first layer relationship between a first query and a second query in the plurality of queries, the first layer relationship indicating that at least one of the plurality of visited query results is associated with both the first query and the second query, and further wherein the click graph depicts at least one second layer relationship between the first query and second query in the plurality of queries, the second layer relationship indicating that each of the first and second queries has a first layer relationship to a third query; computing a numeric value representing a degree of the at least one second layer relationship; identifying temporal similarities between at least one pair of the plurality of queries, the temporal similarities determined based on a temporal distance between peaks in frequency of occurrence for both queries in the at least one pair of the plurality of queries; evaluating the at least one pair of queries based on the generated click graph and the identified temporal similarities to determine whether the at least one pair of queries are related; and designating the queries in the at least one pair of queries as related based on the computed value being greater than zero and the temporal distance being below a threshold value. - View Dependent Claims (22, 23, 24, 25, 29, 30)
-
Specification