SYSTEM AND METHOD FOR HIGH PRECISION AND HIGH RECALL RELEVANCY SEARCHING
First Claim
1. A computer-implemented method for identifying one or more relevant documents, comprising:
- generating, by a computer, a filter for identifying a relevant document based on an initial relevance rule related to a set of documents;
applying, by the computer, the filter to the set of documents thereby identifying a subset of relevant documents;
receiving, by the computer from an assessor, the subset of relevant documents comprising an identification of key information;
generating, by the computer, an updated relevance rule based on the key information and the initial relevance rule;
generating, by the computer, a query based on the updated relevance rule for identifying relevant documents within the set of documents; and
outputting, by the computer, the set of documents within which the relevant documents have been identified.
4 Assignments
0 Petitions
Accused Products
Abstract
A method and system for performing high precision and high recall relevancy searching is provided. According to embodiments of the present invention, a relevance rule is generated based on a user model and language from within one or more relevant and non-relevant documents. A query is created based on the relevance rule wherein the query may be applied to a corpus to identify relevant and non-relevant documents. The relevance rule may be iteratively refined in order to increase the accuracy of the query. The resulting query may be used by a litigator during the discovery phase of a litigation to respond to a request for production.
24 Citations
12 Claims
-
1. A computer-implemented method for identifying one or more relevant documents, comprising:
-
generating, by a computer, a filter for identifying a relevant document based on an initial relevance rule related to a set of documents; applying, by the computer, the filter to the set of documents thereby identifying a subset of relevant documents; receiving, by the computer from an assessor, the subset of relevant documents comprising an identification of key information; generating, by the computer, an updated relevance rule based on the key information and the initial relevance rule; generating, by the computer, a query based on the updated relevance rule for identifying relevant documents within the set of documents; and outputting, by the computer, the set of documents within which the relevant documents have been identified. - View Dependent Claims (2, 3, 5, 6)
-
-
4. The computer-implemented method of claim I, wherein generating the updated relevance rule comprises:
-
identifying, by the computer, a conflict between the key information and the initial relevance rule, providing, by the computer, the key information and the initial relevance rule to an assessor to resolve the conflict, and receiving, by the computer, an updated relevance rule wherein the initial relevance rule has been altered to resolve the conflict between the initial relevance rule and the key information.
-
-
7. A system for identifying one or more relevant documents, comprising:
-
a user modeling module configured to; generate a filter for identifying a relevant document based on an initial relevance rule related to a set of documents, generate an updated relevance rule based on the key information and the initial relevance rule, generate a query based on the updated relevance rule for identifying relevant documents within the set of documents, an assessment module configured to; apply the filter to the set of documents thereby identifying a subset of relevant documents, and receive from an assessor, the subset of relevant documents comprising an identification of key information, and a classification module configured to output the set of documents wherein the relevant documents have been identified. - View Dependent Claims (8, 9, 10, 11, 12)
-
Specification