Information search using knowledge agents
First Claim
Patent Images
1. A method for searching a corpus of documents, comprising:
- defining a knowledge domain;
identifying a set of reference documents in the corpus pertinent to the domain;
inputting a first query;
searching the corpus using the set of reference documents to find one or more of the documents in the corpus that contain information in the domain relevant to the first query; and
adding at least one of the found documents to the set of reference documents for use in searching the corpus for information in the domain relevant to a second, subsequent query.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for searching a corpus of documents, such as the World Wide Web, includes defining a knowledge domain and identifying a set of reference documents in the corpus pertinent to the domain. Upon inputting a query, the corpus is searched using the set of reference documents to find one or more of the documents in the corpus that contain information in the domain relevant to the query. The set of reference documents is updated with the found documents that are most relevant to the domain. The updated set is used in searching the corpus for information in the domain relevant to subsequent queries.
-
Citations
34 Claims
-
1. A method for searching a corpus of documents, comprising:
-
defining a knowledge domain;
identifying a set of reference documents in the corpus pertinent to the domain;
inputting a first query;
searching the corpus using the set of reference documents to find one or more of the documents in the corpus that contain information in the domain relevant to the first query; and
adding at least one of the found documents to the set of reference documents for use in searching the corpus for information in the domain relevant to a second, subsequent query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A method for searching a corpus of documents containing terms, comprising:
-
defining a knowledge domain;
identifying a set of reference documents in the corpus pertinent to the domain;
finding lexical characteristics of the terms in the reference documents;
inputting a search query;
refining the search query using the lexical characteristics; and
searching the corpus to find information in the domain responsive to the refined query. - View Dependent Claims (16, 17)
-
-
18. A method for searching a corpus of linked documents containing terms, comprising:
-
defining a knowledge domain;
identifying a set of reference documents in the corpus pertinent to the domain;
inputting a search query;
searching the corpus to find one or more of the documents in the corpus that contain information relevant to the query;
evaluating a textual resemblance between the found documents and the reference documents so as to assign respective textual scores to the found documents;
assessing links between the found documents and the reference documents so as to assign respective topological scores to the found documents; and
ranking the found documents with respect to their relevance to the domain responsive to the textual scores and the topological scores. - View Dependent Claims (19, 20)
-
-
21. Apparatus for searching a corpus of documents, comprising:
-
a memory, adapted to store an identification of a set of reference documents in the corpus pertinent to a predefined knowledge domain; and
a search processor, which responsive to receiving a first query as input, is adapted to search the corpus using the set of reference documents to find one or more of the documents in the corpus that contain information in the domain relevant to the first query, and to add at least one of the found documents to the set of reference documents stored in the memory for use in searching the corpus for information in the domain relevant to a second, subsequent query. - View Dependent Claims (22, 23, 24, 25, 26, 27)
-
-
28. Apparatus for searching a corpus of documents containing terms, comprising:
-
a memory, adapted to store an identification of a set of reference documents in the corpus pertinent to a predefined knowledge domain; and
a search processor, which is adapted to find lexical characteristics of the terms in the reference documents, and responsive to receiving a query as input, is adapted to refine the search query using the lexical characteristics and to search the corpus to find information in the domain responsive to the refined query.
-
-
29. Apparatus for searching a corpus of linked documents containing terms, comprising:
-
a memory, adapted to store an identification of a set of reference documents in the corpus pertinent to a predefined knowledge domain; and
a search processor, which responsive to receiving a query as input, is adapted to search the corpus to find one or more of the documents in the corpus that contain information relevant to the query, to evaluate a textual resemblance between the found documents and the reference documents so as to assign respective textual scores to the found documents, to assess links between the found documents and the reference documents so as to assign respective topological scores to the found documents, and to rank the found documents with respect to their relevance to the domain responsive to the textual scores and the topological scores.
-
- 30. A computer software product for searching a corpus of documents, the product comprising a computer-readable medium in which program instructions are stored, which instructions, when read by a computer, cause the computer to receive a definition of a knowledge domain and an identification of a set of reference documents in the corpus pertinent to the domain, and further cause the computer, responsive to a first query, to search the corpus using the set of reference documents to find one or more of the documents in the corpus that contain information in the domain relevant to the first query, and to add at least one of the found documents to the set of reference documents for use in searching the corpus for information in the domain relevant to a second, subsequent query.
-
33. A computer software product for searching a corpus of documents, the product comprising a computer-readable medium in which program instructions are stored, which instructions, when read by a computer, cause the computer to receive a definition of a knowledge domain and an identification of a set of reference documents in the corpus pertinent to the domain and to find lexical characteristics of the terms in the reference documents, and further cause the computer, responsive to a query, to refine the search query using the lexical characteristics and to search the corpus to find information in the domain responsive to the refined query.
-
34. A computer software product for searching a corpus of documents, the product comprising a computer-readable medium in which program instructions are stored, which instructions, when read by a computer, cause the computer to receive a definition of a knowledge domain and an identification of a set of reference documents in the corpus pertinent to the domain, and further cause the computer, responsive to a query, to search the corpus to find one or more of the documents in the corpus that contain information relevant to the query, to evaluate a textual resemblance between the found documents and the reference documents to assign respective textual scores to the found documents, to assess links between the found documents and the reference documents to assign respective topological scores to the found documents, and to rank the found documents with respect to their relevance to the domain responsive to the textual scores and the topological scores.
Specification