Information search using knowledge agents
First Claim
Patent Images
1. A method for searching a corpus of linked documents containing terms, comprising:
- defining a knowledge domain;
identifying a set of reference documents in the corpus pertinent to the domain;
inputting a search query;
searching the corpus to find one or more of the documents in the corpus that contain information relevant to the query;
evaluating a textual resemblance between the found documents and the reference documents so as to assign respective textual scores to the found documents;
assessing links between the found documents and the reference documents so as to assign respective topological scores to the found documents; and
ranking the found documents with respect to their relevance to the domain responsive to the textual scores and the topological scores.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for searching a corpus of documents, such as the World Wide Web, includes defining a knowledge domain and identifying a set of reference documents in the corpus pertinent to the domain. Upon inputting a query, the corpus is searched using the set of reference documents to find one or more of the documents in the corpus that contain information in the domain relevant to the query. The set of reference documents is updated with the found documents that are most relevant to the domain. The updated set is used in searching the corpus for information in the domain relevant to subsequent queries.
181 Citations
5 Claims
-
1. A method for searching a corpus of linked documents containing terms, comprising:
-
defining a knowledge domain;
identifying a set of reference documents in the corpus pertinent to the domain;
inputting a search query;
searching the corpus to find one or more of the documents in the corpus that contain information relevant to the query;
evaluating a textual resemblance between the found documents and the reference documents so as to assign respective textual scores to the found documents;
assessing links between the found documents and the reference documents so as to assign respective topological scores to the found documents; and
ranking the found documents with respect to their relevance to the domain responsive to the textual scores and the topological scores. - View Dependent Claims (2, 3)
-
-
4. Apparatus for searching a corpus of linked documents containing terms, comprising:
-
a memory, adapted to store an identification of a set of reference documents in the corpus pertinent to a predefined knowledge domain; and
a search processor, which responsive to receiving a query as input, is adapted to search the corpus to find one or more of the documents in the corpus that contain information relevant to the query, to evaluate a textual resemblance between the found documents and the reference documents so as to assign respective textual scores to the found documents, to assess links between the found documents and the reference documents so as to assign respective topological scores to the found documents, and to rank the found documents with respect to their relevance to the domain responsive to the textual scores and the topological scores.
-
-
5. A computer software product for searching a corpus of documents, the product comprising a computer-readable medium in which program instructions are stored, which instructions, when read by a computer, cause the computer to receive a definition of a knowledge domain and an identification of a set of reference documents in the corpus pertinent to the domain, and further cause the computer, responsive to a query, to search the corpus to find one or more of the documents in the corpus that contain information relevant to the query, to evaluate a textual resemblance between the found documents and the reference documents to assign respective textual scores to the found documents, to assess links between the found documents and the reference documents to assign respective topological scores to the found documents, and to rank the found documents with respect to their relevance to the domain responsive to the textual scores and the topological scores.
Specification