Verifying relevance between keywords and Web site contents
First Claim
1. A method for verifying relevance between terms and Web site contents, the method comprising:
- retrieving site contents from a bid URL;
formulating expanded term(s) semantically and/or contextually related to bid term(s), generating content similarity and expanded similarity measurements from respective combinations of the bid term(s), the site contents, and the expanded terms, the similarity measurements indicating relatedness between respective ones of the bid term(s), site contents, and/or expanded terms;
calculating category similarity measurements between the expanded terms and the site contents in view of a similarity classifier, the similarity classifier having been trained from mined web site content associated with directory data;
calculating a confidence value from combined ones of multiple similarity measurements, the combined ones comprising content, expanded, and category similarity measurements, the confidence value providing an objective measure of relevance between the bid term(s) and the site contents.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for verifying relevance between terms and Web site contents are described. In one aspect, site contents from a bid URL are retrieved. Expanded term(s) semantically and/or contextually related to bid term(s) are calculated. Content similarity and expanded similarity measurements are calculated from respective combinations of the bid term(s), the site contents, and the expanded terms. Category similarity measurements between the expanded terms and the site contents are determined in view of a trained similarity classifier. The trained similarity classifier having been trained from mined web site content associated with directory data. A confidence value providing an objective measure of relevance between the bid term(s) and the site contents is determined from the content, expanded, and category similarity measurements evaluating the multiple similarity scores in view of a trained relevance classifier model.
-
Citations
45 Claims
-
1. A method for verifying relevance between terms and Web site contents, the method comprising:
-
retrieving site contents from a bid URL;
formulating expanded term(s) semantically and/or contextually related to bid term(s), generating content similarity and expanded similarity measurements from respective combinations of the bid term(s), the site contents, and the expanded terms, the similarity measurements indicating relatedness between respective ones of the bid term(s), site contents, and/or expanded terms;
calculating category similarity measurements between the expanded terms and the site contents in view of a similarity classifier, the similarity classifier having been trained from mined web site content associated with directory data;
calculating a confidence value from combined ones of multiple similarity measurements, the combined ones comprising content, expanded, and category similarity measurements, the confidence value providing an objective measure of relevance between the bid term(s) and the site contents. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computer-readable medium comprising computer-executable instructions for verifying relevance between terms and Web site contents, the computer-executable instructions comprising instructions for:
-
retrieving site contents from a bid URL;
formulating expanded term(s) semantically and/or contextually related to bid term(s), generating content similarity and expanded similarity measurements from respective combinations of the bid term(s), the site contents, and the expanded terms, the similarity measurements indicating relatedness between respective ones of the bid term(s), site contents, and/or expanded terms;
calculating category similarity measurements between the expanded terms and the site contents in view of a similarity classifier, the similarity classifier having been trained from mined web site content associated with directory data;
calculating a confidence value from combined ones of multiple similarity measurements, the combined ones comprising content, expanded, and category similarity measurements, the confidence value providing an objective measure of relevance between the bid term(s) and the site contents. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A computing device for verifying relevance between terms and Web site contents, the computing device comprising:
-
a processor; and
a memory coupled to the processor, the memory comprising computer-program instructions executable by the processor for;
retrieving site contents from a bid URL;
formulating expanded term(s) semantically and/or contextually related to bid term(s), generating content similarity and expanded similarity measurements from respective combinations of the bid term(s), the site contents, and the expanded terms, the similarity measurements indicating relatedness between respective ones of the bid term(s), site contents, and/or expanded terms;
calculating category similarity measurements between the expanded terms and the site contents in view of a similarity classifier, the similarity classifier having been trained from mined web site content associated with directory data;
calculating a confidence value from combined ones of multiple similarity measurements, the combined ones comprising content, expanded, and category similarity measurements, the confidence value providing an objective measure of relevance between the bid term(s) and the site contents. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34, 35)
-
-
36. A computing device for verifying relevance between terms and Web site contents, the computing device comprising:
-
retrieving means to obtain site contents from a bid URL;
formulating means to identify expanded term(s) semantically and/or contextually related to bid term(s), generating means to create content similarity and expanded similarity measurements from respective combinations of the bid term(s), the site contents, and the expanded terms, the similarity measurements indicating relatedness between respective ones of the bid term(s), site contents, and/or expanded terms;
calculating means to determine category similarity measurements between the expanded terms and the site contents in view of a similarity classifier, the similarity classifier having been trained from mined web site content associated with directory data;
calculating means to generate a confidence value from combined ones of multiple similarity measurements, the combined ones comprising content, expanded, and category similarity measurements, the confidence value providing an objective measure of relevance between the bid term(s) and the site contents. - View Dependent Claims (37, 38, 39, 40, 41, 42, 43, 44)
-
-
45. A computing device as recited in 44, wherein the identifying means further comprise:
-
generating means to generate a set of term clusters from term vectors based on calculated term similarity, the term vectors being generated from search engine results of submitted historical queries, each historical query having a relatively low frequency of occurrence as compared to other query terms in a query log; and
evaluating means to evaluate the site contents in view of term(s) specified by the term clusters to identify one or more semantically and/or contextually related terms, the terms being the one or more other terms.
-
Specification