Feature diffusion across hyperlinks
First Claim
Patent Images
1. A computer including a data storage device including a computer usable medium having computer usable code means for ranking documents in a set of documents in response to a query, the computer usable code means having:
- computer readable code means for identifying a reference to a second document in a first document;
computer readable code means for receiving a lexical distance, the lexical distance defining a number of document terms;
computer readable code means for receiving a query including one or more query terms; and
computer readable code means for determining a number of times at least one of the query terms is present in the first document within the lexical distance of the reference to the second document, for ranking the documents based thereon.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for ranking wide area computer network (e.g., Web) pages by popularity in response to a query. Further, using a query and the response thereto from a search engine, the system and method finds additional key words that might be good extended search terms, essentially generating a local thesaurus on the fly at query time.
150 Citations
36 Claims
-
1. A computer including a data storage device including a computer usable medium having computer usable code means for ranking documents in a set of documents in response to a query, the computer usable code means having:
-
computer readable code means for identifying a reference to a second document in a first document; computer readable code means for receiving a lexical distance, the lexical distance defining a number of document terms; computer readable code means for receiving a query including one or more query terms; and computer readable code means for determining a number of times at least one of the query terms is present in the first document within the lexical distance of the reference to the second document, for ranking the documents based thereon. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer program device comprising:
-
a computer program storage device readable by a digital processing apparatus; and a program means on the program storage device and including instructions executable by the digital processing apparatus for performing method steps for finding key words in a set of documents, the method steps comprising; receiving the set of documents; determining referring documents and referred-to documents in the set of documents, the referring documents being documents in the set containing references to referred-to documents; for each document term in a referring document, determining the number of times the document term appears within a predetermined distance of a reference to a referred-to document; and ranking at least some of the document terms in the documents based on the respective number of times. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer-implemented method for finding associations in computer-stored documents between document terms and query topics represented by one or more query terms, the documents having respective references to referred-to documents, the method comprising the steps of:
-
receiving at least a list of documents in response to the query terms; and when a document term and a reference to a referred-to document are both found in a document within a predetermined distance of a query term, outputting a signal representative of an association between the document term and the query topic. - View Dependent Claims (18, 19, 20, 21, 22, 23)
-
-
24. A computer including a data storage device including a computer usable medium having computer usable code means for finding associations in computer-stored documents between document terms and query topics represented by one or more query terms, the documents having respective references to referred-to documents, the computer usable code means having:
-
computer readable code means for receiving at least a list of documents in response to the query terms; and computer readable code means for outputting, when a document term and a reference to a referred-to document are both found in a document within a predetermined distance of a query term, a signal representative of an association between the document term and the query topic. - View Dependent Claims (25, 26, 27, 28)
-
-
29. A computer including a data storage device including a computer usable medium having computer usable code means for finding key words in a set of documents, the computer usable code means having:
-
computer readable code means for receiving the set of documents; computer readable code means for determining referring documents and referred-to documents in the set of documents, the referring documents being documents in the set containing references to referred-to documents; computer readable code means for determining, for each document term in a referring document, the number of times the document term appears within a predetermined distance of a reference to a referred-to document; and computer readable code means for ranking at least some of the document terms in the documents based on the respective number of times. - View Dependent Claims (30, 31, 32, 33, 34)
-
-
35. A computer including a data storage device including a computer usable medium having computer usable code means for ranking documents in a set of documents in response to a query, the computer usable code means having
computer readable code means for receiving a set "U" of documents; -
computer readable code means for, for at least one test document "u" in the set "U", defining as neighbor documents "N(u)" documents in the set "U" that include at least one reference to the test document "u"; computer readable code means for determining, for at least one document term in at least one neighbor document "N(u)", whether the at least one document term is within a predetermined distance of a reference in the neighbor document "N(u)" to the test document "u"; and computer readable code means for outputting a signal in response to the means for determining whether the at least one document term is within a predetermined distance of a reference. - View Dependent Claims (36)
-
Specification