Click-through log mining
First Claim
Patent Images
1. A method for providing keyword searches, implemented at least in part by a computing device, the method comprising:
- processing raw search click-through log data based at least in part on set time intervals including a month, a day, and a year, to generate a click-through log;
generating ordered query keywords from the raw search click-through log data in which the ordered query keywords include query-uniform resource locator pairs;
constructing a bipartite graph based at least in part on the query-uniform resource locator pairs in which one set of vertices corresponds to queries and a second set of vertices corresponds to uniform resource locators;
based at least in part on the bipartite graph, utilizing an algorithm to compute(a) similarities between queries in which the similarities of the queries are based at least in part on visiting similar web pages associated with the uniform resource locators, and(b) similarities between the web pages associated with the uniform resource locators in which the similarities of the web pages are based at least in part on being visited by similar type queries to capture related phrases, wherein the algorithm comprises determining relationships between the queries and the web pages by iteratively computing similarities between the queries and the web pages;
identifying advertising keywords based at least in part on the similarities of the queries and the similarities of the web pages used to capture the related phrases by using a keyword expansion file associated with the set time intervals to expand the queries to include the advertising keywords according to a query-uniform resource locator (URL) correlation, the advertising keywords are based at least in part on a bidding criteria for advertisements; and
suggesting the related phrases that have similar page-click behaviors based at least in part on the bidding criteria for the advertisements.
2 Assignments
0 Petitions
Accused Products
Abstract
Click-through log mining is described. Raw search click-through log data is processed to generate ordered query keywords, utilizing an algorithm to expand user-submitted keywords to include high frequency user queries, managing the keywords for a keyword expansion file, analyzing the algorithm performance on a bidding criteria, and identifying related phrases with similar page-click behaviors for advertisements.
-
Citations
17 Claims
-
1. A method for providing keyword searches, implemented at least in part by a computing device, the method comprising:
-
processing raw search click-through log data based at least in part on set time intervals including a month, a day, and a year, to generate a click-through log; generating ordered query keywords from the raw search click-through log data in which the ordered query keywords include query-uniform resource locator pairs; constructing a bipartite graph based at least in part on the query-uniform resource locator pairs in which one set of vertices corresponds to queries and a second set of vertices corresponds to uniform resource locators; based at least in part on the bipartite graph, utilizing an algorithm to compute (a) similarities between queries in which the similarities of the queries are based at least in part on visiting similar web pages associated with the uniform resource locators, and (b) similarities between the web pages associated with the uniform resource locators in which the similarities of the web pages are based at least in part on being visited by similar type queries to capture related phrases, wherein the algorithm comprises determining relationships between the queries and the web pages by iteratively computing similarities between the queries and the web pages; identifying advertising keywords based at least in part on the similarities of the queries and the similarities of the web pages used to capture the related phrases by using a keyword expansion file associated with the set time intervals to expand the queries to include the advertising keywords according to a query-uniform resource locator (URL) correlation, the advertising keywords are based at least in part on a bidding criteria for advertisements; and suggesting the related phrases that have similar page-click behaviors based at least in part on the bidding criteria for the advertisements. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer-readable storage media comprising computer-readable instructions executed on a computing device, the computer-readable instructions comprising instructions for:
-
processing raw search click-through log data based at least in part on set time intervals including a month, a day, and a year, to generate a click-through log; generating ordered query keywords from the raw search click-through log data in which the ordered query keywords include query-uniform resource locator pairs; constructing a bipartite graph based at least in part on the query-uniform resource locator pairs in which one set of vertices corresponds to queries and a second set of vertices corresponds to uniform resource locators; based at least in part on the bipartite graph, utilizing an algorithm to compute (a) similarities between queries in which the similarities of the queries are based at least in part on visiting similar web pages associated with the uniform resource locators, and (b) similarities between the web pages associated with the uniform resource locators in which the similarities of the web pages are based at least in part on being visited by similar type queries to capture related phrases, wherein the algorithm comprises determining relationships between the queries and the web pages by iteratively computing similarities between the queries and the web pages; identifying advertising keywords based at least in part on the similarities of the queries and the similarities of the web pages used to capture the related phrases for a keyword expansion file associated with the set time intervals to expand the queries to include the advertising keywords according to a query-uniform resource locator (URL) correlation, the advertising keywords are based at least in part on a bidding criteria for advertisements; and suggesting the related phrases that have similar page-click behaviors based at least in part on the bidding criteria for the advertisements. - View Dependent Claims (11, 12, 13)
-
-
14. A keyword suggestion system, comprising:
-
a processor; a memory coupled to the processor, wherein the processor is configured for; processing raw search click-through log data based at least in part on set time intervals including a month, a day, and a year, to generate a click-through log, the ordered query keywords include keywords that are frequently submitted by users; generating ordered query keywords from the raw search click-through log data in which the ordered query keywords include query-uniform resource locator pairs; constructing a bipartite graph based at least in part on the query-uniform resource locator pairs in which one set of vertices corresponds to queries and a second set of vertices corresponds to uniform resource locators; based at least in part on the bipartite graph, utilizing an algorithm to compute (a) similarities between queries in which the similarities of the queries are based at least in part on visiting similar web pages associated with the uniform resource locators, and (b) similarities between the web pages associated with the uniform resource locators in which the similarities of the web pages are based at least in part on being visited by similar type queries to capture related phrases, wherein the algorithm comprises determining relationships between the queries and the web pages by iteratively computing similarities between the queries and the web pages; identifying advertising keywords based at least in part on the similarities of the queries and the similarities of the web pages used to capture the related phrases by using a keyword expansion file associated with the set time intervals to expand the queries to include the advertising keywords according to a query-uniform resource locator (URL) correlation, the advertising keywords are based at least in part on a bidding criteria for advertisements, wherein the bidding criteria comprises at least one of a click-through rate or a revenue per search; and suggesting the related phrases that have similar page-click behaviors based at least in part on the bidding criteria for the advertisements. - View Dependent Claims (15, 16, 17)
-
Specification