Click distance determination
First Claim
1. A computer-implemented method for determining a click distance associated with documents on a network, comprising:
- storing document and link information for the documents;
within the document and link information, additionally storing a specialized word in association with a target document, wherein the specialized word designates a source document corresponding to the target document;
including the specialized word in an inverted index, wherein the locally stored inverted index relates the specialized word with an identifier of the target document; and
assigning a click distance to the source document when an inverted index is queried for the target document according to a query that passes in the specialized word.
2 Assignments
0 Petitions
Accused Products
Abstract
An efficient determination of a click distance value is made for each document in a corpus of documents from data included in a locally-stored inverted index. The click distance is measurement of the number clicks or user navigations from a first document on the network to another document. Specialized words are included in the locally-stored inverted index. The specialized words relate source documents to a set of target documents. A click distance is assigned to a source document when an inverted index is queried for the corresponding set of target documents according to a query that passes in one of the specialized words. The process is repeated for each document in the corpus of documents.
142 Citations
20 Claims
-
1. A computer-implemented method for determining a click distance associated with documents on a network, comprising:
-
storing document and link information for the documents;
within the document and link information, additionally storing a specialized word in association with a target document, wherein the specialized word designates a source document corresponding to the target document;
including the specialized word in an inverted index, wherein the locally stored inverted index relates the specialized word with an identifier of the target document; and
assigning a click distance to the source document when an inverted index is queried for the target document according to a query that passes in the specialized word. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system, comprising:
-
a document interface that is arranged to provide access to documents stored on a network;
an anchor text table that is arranged to store document and link information corresponding to the documents on the network, wherein the anchor text table includes records of target documents and their associated anchor text;
specialized words that are appended to the anchor text associated with each target document, wherein the specialized word is configured to identify a source document corresponding to each target document;
an inverted index that is arranged to list words included in anchor text and the target documents associated with each word, such that the specialized words are also listed in the inverted index with the target documents associated with each specialized word; and
a client interface that is arranged to implement a search engine, wherein the search engine determines a click distance associated with each document stored on the network by incrementing a click distance value associated with each document stored on the network when a query to the inverted index for the target documents correspond to the document stored on the network is made. - View Dependent Claims (16, 17, 18, 19)
-
-
20. A computer-readable medium that includes computer-executable instructions for determining click distance, the instructions comprising:
-
storing document and link information for documents on a network such that a network graph representing the network is initiated in memory;
storing each document represented in the network graph in a queue when the document has a click distance value that is different from a first click distance value; and
when the queue is not empty;
retrieving a document from the queue, determining target documents associated with the retrieved document by querying an anchor index, wherein the anchor index includes specialized words that are arranged to associate the documents on the network with their target documents;
assigning a click distance for each of the target documents associated with the retrieved document, wherein each target document is updated with a new click distance value other than the first click distance value when each target document'"'"'s click distance is greater than the click distance associated with the removed document plus a variable, and adding each of the target documents to the queue that have been updated.
-
Specification