ENRICHED DOCUMENT REPRESENTATIONS USING AGGREGATED ANCHOR TEXT
First Claim
Patent Images
1. A computer implemented method comprising:
- receiving a URL of a target page;
identifying at least one internal inlink, which is a page pointing to the target page and within a site containing the target page;
identifying at least one external anchor that points to the at least one internal inlink from a page outside of the site;
collecting anchor text of the at least one external anchor; and
storing in a database the external anchor text of the at least one internal inlink as aggregated anchor text of the target page.
3 Assignments
0 Petitions
Accused Products
Abstract
A system and method for aggregating anchor text over the web graph and using the aggregated anchor text to enrich document representations. For a target page, its internal inlinks, which point to the target page and are within the site containing the target page, are identified first. Then external anchors that point to the internal inlinks from pages outside of the site are identified. Anchor text of the external anchors are collected, weighted, stored, and used to enrich document presentations. The method not only reduces the number of pages with no anchor text, but also adds lines of anchor text to URLs.
-
Citations
20 Claims
-
1. A computer implemented method comprising:
-
receiving a URL of a target page; identifying at least one internal inlink, which is a page pointing to the target page and within a site containing the target page; identifying at least one external anchor that points to the at least one internal inlink from a page outside of the site; collecting anchor text of the at least one external anchor; and storing in a database the external anchor text of the at least one internal inlink as aggregated anchor text of the target page. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer system comprising:
-
a processor for receiving a URL of a target page;
identifying at least one internal inlink, which is a page pointing to the target page and within a site containing the target page;
identifying at least one external anchor that points to the first internal inlink from a page outside of the site; and
collecting anchor text of the at least one external anchor; anda data storage device for storing the external anchor text of the at least one internal inlink as aggregated anchor text of the target page. - View Dependent Claims (11)
-
-
12. A computer program product comprising a computer-readable medium having instructions which, when performed by a computer, perform a method comprising:
-
receiving a URL of a target page; identifying at least one internal inlink, which is a page pointing to the target page and within a site containing the target page; identifying at least one external anchor that points to the at least one internal inlink from a page outside of the site; collecting anchor text of the at least one external anchor; and storing in a database the external anchor text of the at least one internal inlink as aggregated anchor text of the target page. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
Specification