Propagating information among web pages
First Claim
Patent Images
1. A computer-implemented method comprising:
- selecting a candidate term from among one or more terms in a first web page of a web site, wherein the first web page is not a home or root web page of the web site;
determining that the candidate term is uncommon, wherein an uncommon term is a term that occurs with less than a threshold frequency in a collection of web pages; and
in an index of web pages that identifies terms and, for each term, identifies one or more web pages that are associated with the term, updating an association of the candidate term with a home or root web page of the web site to generate an updated index of web pages, including updating a boost value associated with the association of the candidate term with the home or root web page in the index of web pages.
2 Assignments
0 Petitions
Accused Products
Abstract
Web pages of a Website may be processed to improve search results. For example, information likely to pertain to more than just the Web page it is directly associated with may be identified. One or more other, related, Web pages that such information is likely to pertain to is also identified. The identified information is associated with the identified other Web page(s) and this association is saved in a way to affect a search result score of the Web page(s).
-
Citations
44 Claims
-
1. A computer-implemented method comprising:
-
selecting a candidate term from among one or more terms in a first web page of a web site, wherein the first web page is not a home or root web page of the web site; determining that the candidate term is uncommon, wherein an uncommon term is a term that occurs with less than a threshold frequency in a collection of web pages; and in an index of web pages that identifies terms and, for each term, identifies one or more web pages that are associated with the term, updating an association of the candidate term with a home or root web page of the web site to generate an updated index of web pages, including updating a boost value associated with the association of the candidate term with the home or root web page in the index of web pages. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A system comprising:
-
one or more processors; at least one input device; and one or more storage devices storing processor-executable instructions which, when executed by the one or more processors, cause the one or more processors to perform operations comprising; selecting a candidate term from among one or more terms in a first web page of a web site, wherein the first web page is not a home or root web page of the web site; determining that the candidate term is uncommon, wherein an uncommon term is a term that occurs with less than a threshold frequency in a collection of web pages; and in an index of web pages that identifies terms and, for each term, identifies one or more web pages that are associated with the term, updating an association of the candidate term with a home or root web page of the web site to generate an updated index of web pages, including updating a boost value associated with the association of the candidate term with the home or root web page in the index of web pages. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A computer-implemented method comprising:
-
selecting a candidate term from among one or more terms in a first web page of a web site, wherein the first web page is not a home or root web page of the web site; determining that the candidate term is uncommon, wherein an uncommon term is a term that occurs with less than a threshold frequency in a collection of web pages; identifying in the web site a second web page that is above the first web page in a hierarchy of the web site and that is within a predetermined distance from the first web page in the hierarchy; and in an index of web pages that identifies terms and, for each term, identifies one or more web pages that are associated with the term, updating an association of the candidate term with the second web page of the web site to generate an updated index of web pages, including updating a boost value associated with the association of the candidate term with the second web page in the index of web pages to generate the updated index of web pages. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. A system comprising:
-
one or more processors; at least one input device; and one or more storage devices storing processor-executable instructions which, when executed by the one or more processors, cause the one or more processors to perform operations comprising; selecting a candidate term from among one or more terms in a first web page of a web site, wherein the first web page is not a home or root web page of the web site; determining that the candidate term is uncommon, wherein an uncommon term is a term that occurs with less than a threshold frequency in a collection of web pages; identifying in the web site a second web page that is above the first web page in a hierarchy of the web site and that is within a predetermined distance from the first web page in the hierarchy; and in an index of web pages that identifies terms and, for each term, identifies one or more web pages that are associated with the term, updating an association of the candidate term with the second web page of the web site to generate an updated index of web pages, including updating a boost value associated with the association of the candidate term with the second web page in the index of web pages to generate the updated index of web pages. - View Dependent Claims (19, 20, 21, 22, 23, 24)
-
-
25. A computer-implemented method comprising:
-
selecting a candidate term from among one or more terms in a first web page of a web site, wherein the first web page is not a home or root web page of the web site; determining that the candidate term is uncommon, wherein an uncommon term is a term that occurs with less than a threshold frequency in a collection of web pages; and in an index of web pages that identifies terms and, for each term, identifies one or more web pages that are associated with the term, updating an association of the candidate term with a home or root web page of the web site to generate an updated index of web pages, wherein the index of web pages has no association between the candidate term and the home or root web page of the web site, and wherein updating the association of the candidate term with the home or root web page of the web site comprises adding an association of the candidate term with the home or root web page of the web site to generate the updated index of web pages; receiving a search query including the candidate term; and determining that the home or root web page satisfies the query if the updated index is used to perform the search requested by the search query. - View Dependent Claims (26, 27, 28)
-
-
29. A system comprising:
-
one or more processors; at least one input device; and one or more storage devices storing processor-executable instructions which, when executed by the one or more processors, cause the one or more processors to perform operations comprising; selecting a candidate term from among one or more terms in a first web page of a web site, wherein the first web page is not a home or root web page of the web site; determining that the candidate term is uncommon, wherein an uncommon term is a term that occurs with less than a threshold frequency in a collection of web pages; and in an index of web pages that identifies terms and, for each term, identifies one or more web pages that are associated with the term, updating an association of the candidate term with a home or root web page of the web site to generate an updated index of web pages; receiving a search query including the candidate term; and determining that the home or root web page satisfies the query if the updated index is used to perform the search requested by the search query. - View Dependent Claims (30, 31, 32)
-
-
33. A computer-implemented method comprising:
-
selecting a candidate term from among one or more terms in a first web page of a web site, wherein the first web page is not a home or root web page of the web site; determining that the candidate term is uncommon, wherein an uncommon term is a term that occurs with less than a threshold frequency in a collection of web pages; identifying in the web site a second web page that is above the first web page in a hierarchy of the web site and that is within a predetermined distance from the first web page in the hierarchy; and in an index of web pages that identifies terms and, for each term, identifies one or more web pages that are associated with the term, updating an association of the candidate term with the second web page of the web site to generate an updated index of web pages, wherein the index of web pages has no association between the candidate term and the second web page of the web site, and wherein updating the association of the candidate term with the second web page of the web site comprises adding an association of the candidate term with the second web page of the web site to generate the updated index of web pages; receiving a search query including the candidate term; and determining that the second web page satisfies the query if the updated index is used to perform the search requested by the search query. - View Dependent Claims (34, 35, 36, 37, 38)
-
-
39. A system comprising:
-
one or more processors; at least one input device; and one or more storage devices storing processor-executable instructions which, when executed by the one or more processors, cause the one or more processors to perform operations comprising; selecting a candidate term from among one or more terms in a first web page of a web site, wherein the first web page is not a home or root web page of the web site; determining that the candidate term is uncommon, wherein an uncommon term is a term that occurs with less than a threshold frequency in a collection of web pages; identifying in the web site a second web page that is above the first web page in a hierarchy of the web site and that is within a predetermined distance from the first web page in the hierarchy; and in an index of web pages that identifies terms and, for each term, identifies one or more web pages that are associated with the term, updating an association of the candidate term with the second web page of the web site to generate an updated index of web pages, wherein the index of web pages has no association between the candidate term and the second web page of the web site, and wherein updating the association of the candidate term with the second web page of the web site comprises adding an association of the candidate term with the second web page of the web site to generate the updated index of web pages; receiving a search query including the candidate term; and determining that the second web page satisfies the query if the updated index is used to perform the search requested by the search query. - View Dependent Claims (40, 41, 42, 43, 44)
-
Specification