GLOBAL ANCHOR TEXT PROCESSING
First Claim
1. A computer-implemented method for building a search index:
- while building the search index and using the search index to respond to one or more search requests,maintaining an anchor information store, wherein each entry of the anchor information store identifies a referring document, a target document, and anchor text associated with a link from the referring document to the target document;
receiving a document for processing;
locating one or more entries in the anchor information store for which the document to be processed is identified as the target document;
retrieving anchor text from each of the identified entries; and
storing the retrieved anchor text in an entry of the search index for the document.
1 Assignment
0 Petitions
Accused Products
Abstract
Provided are techniques for building a search index. While building the search index and using the search index to respond to one or more search requests, an anchor information store is maintained, wherein each entry of the anchor information store identifies a referring document, a target document, and anchor text associated with a link from the referring document to the target document; a document is received for processing; one or more entries in the anchor information store for which the document to be processed is identified as the target document are located; anchor text is retrieved from each of the identified entries; and the retrieved anchor text is stored in an entry of the search index for the document.
32 Citations
36 Claims
-
1. A computer-implemented method for building a search index:
while building the search index and using the search index to respond to one or more search requests, maintaining an anchor information store, wherein each entry of the anchor information store identifies a referring document, a target document, and anchor text associated with a link from the referring document to the target document; receiving a document for processing; locating one or more entries in the anchor information store for which the document to be processed is identified as the target document; retrieving anchor text from each of the identified entries; and storing the retrieved anchor text in an entry of the search index for the document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
11. A computer-implemented method for maintaining a search index, comprising:
-
maintaining an anchor information store, wherein each entry of the anchor information store identifies a referring document, a target document, and anchor text associated with a link from the referring document to the target document; maintaining a rebuild agenda, wherein each entry in the rebuild agenda identifies a target document; and asynchronously, for each target document identified in each entry in the rebuild agenda, using the anchor information store to find anchor text for that target document; and updating the anchor text in an entry of the search index for the target document, wherein the anchor text for the target document is updated incrementally. - View Dependent Claims (12)
-
-
13. A computer program product comprising a computer useable medium including a computer readable program, wherein the computer readable program when executed on a computer causes the computer to:
while building the search index and using the search index to respond to one or more search requests, maintain an anchor information store, wherein each entry of the anchor information store identifies a referring document, a target document, and anchor text associated with a link from the referring document to the target document; receive a document for processing; locate one or more entries in the anchor information store for which the document to be processed is identified as the target document; retrieve anchor text from each of the identified entries; and store the retrieved anchor text in an entry of the search index for the document. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22)
-
23. A computer program product comprising a computer useable medium including a computer readable program, wherein the computer readable program when executed on a computer causes the computer to:
-
maintain an anchor information store, wherein each entry of the anchor information store identifies a referring document, a target document, and anchor text associated with a link from the referring document to the target document; maintain a rebuild agenda, wherein each entry in the rebuild agenda identifies a target document; and asynchronously, for each target document identified in each entry in the rebuild agenda, use the anchor information store to find anchor text for that target document; and update the anchor text in an entry of the search index for the target document, wherein the anchor text for the target document is updated incrementally. - View Dependent Claims (24)
-
-
25. A system for building a search index, comprising:
logic capable of performing operations, the operations comprising; while building the search index and using the search index to respond to one or more search requests, maintaining an anchor information store, wherein each entry of the anchor information store identifies a referring document, a target document, and anchor text associated with a link from the referring document to the target document; receiving a document for processing; locating one or more entries in the anchor information store for which the document to be processed is identified as the target document; retrieving anchor text from each of the identified entries; and storing the retrieved anchor text in an entry of the search index for the document. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34)
-
35. A system for building a search index, comprising:
logic capable of performing operations, the operations comprising; maintaining an anchor information store, wherein each entry of the anchor information store identifies a referring document, a target document, and anchor text associated with a link from the referring document to the target document; maintaining a rebuild agenda, wherein each entry in the rebuild agenda identifies a target document; and asynchronously, for each target document identified in each entry in the rebuild agenda, using the anchor information store to find anchor text for that target document; and updating the anchor text in an entry of the search index for the target document, wherein the anchor text for the target document is updated incrementally. - View Dependent Claims (36)
Specification