Methods for enhancing efficiency and cost effectiveness of first pass review of documents
First Claim
1. A method for reviewing a collection of documents to identify relevant documents from the collection, the method comprising:
- running a search of the collection of documents, the search being based on a plurality of query terms and returning a subset of responsive documents from the collection;
determining a corresponding probability of relevancy for each document in the responsive documents subset; and
removing from the responsive documents subset, documents that do not reach a threshold probability of relevancy.
20 Assignments
0 Petitions
Accused Products
Abstract
Methods for reviewing a collection of documents to identify relevant documents from the collection are provided. A search of the collection can be run based on query terms, to return a subset of responsive documents. A probability of relevancy can be determined for a document in the returned subset, and the document is removed from the subset if it does not reach a threshold probability of relevancy. Documents in a thread of a correspondence (for example, an e-mail) in the responsive documents subset can be added to the responsive documents subset. Further, an attachment to a document in the responsive documents subset can be added to the responsive documents subset. A statistical technique can be applied to determine whether remaining documents in the collection meet a predetermined acceptance level.
138 Citations
37 Claims
-
1. A method for reviewing a collection of documents to identify relevant documents from the collection, the method comprising:
-
running a search of the collection of documents, the search being based on a plurality of query terms and returning a subset of responsive documents from the collection; determining a corresponding probability of relevancy for each document in the responsive documents subset; and removing from the responsive documents subset, documents that do not reach a threshold probability of relevancy. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A method for reviewing a collection of documents to identify relevant documents from the collection, the method comprising:
-
running a search of the collection of documents, based on a plurality of query terms, the search returning a subset of responsive documents in the collection; automatically identifying a correspondence between a sender and a recipient, in the responsive documents subset; automatically determining one or more additional documents which are in a thread of the correspondence, the additional documents not being in the responsive documents subset; and adding the additional documents to the responsive documents subset. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A method for reviewing a collection of documents to identify relevant documents from the collection, the method comprising:
-
running a search of the collection of documents, based on a plurality of query terms, the search returning a subset of responsive documents in the collection; automatically determining whether any of the responsive documents in the responsive documents subset includes an attachment that is not in the subset; and adding the attachment to the responsive documents subset. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35, 36)
-
-
37. A method for reviewing a collection of documents to identify relevant documents from the collection, the method comprising:
-
running a search of the collection of documents, based on a plurality of query terms, the search returning a subset of responsive documents from the collection; randomly selecting a predetermined number of documents from a remainder of the collection of documents not in the responsive documents subset; determining whether the randomly selected documents include additional relevant documents; identifying one or more specific terms in the additional responsive documents that render the documents relevant; expanding the query terms with the specific terms; and re-running the search with the expanded query terms.
-
Specification