System And Method For Generating A Reference Set For Use During Document Review
First Claim
Patent Images
1. A method for generating a reference set for use during document review, comprising:
- obtaining a collection of unclassified documents;
applying selection criteria to the collection and selecting those unclassified documents that satisfy the selection criteria as reference set candidates;
assigning a classification code to each reference set candidate; and
forming a reference set from the classified reference set candidates, wherein the reference set comprises coded documents that are quality controlled and shared between one or more reviewers.
7 Assignments
0 Petitions
Accused Products
Abstract
A system and method for providing generating reference sets for use during document review is provided. A collection of unclassified documents is obtained. Selection criteria are applied to the document collection and those unclassified documents that satisfy the selection criteria are selected as reference set candidates. A classification code is assigned to each reference set candidate. A reference set is formed from the classified reference set candidates. The reference set is quality controlled and shared between one or more users.
-
Citations
20 Claims
-
1. A method for generating a reference set for use during document review, comprising:
-
obtaining a collection of unclassified documents; applying selection criteria to the collection and selecting those unclassified documents that satisfy the selection criteria as reference set candidates; assigning a classification code to each reference set candidate; and forming a reference set from the classified reference set candidates, wherein the reference set comprises coded documents that are quality controlled and shared between one or more reviewers. - View Dependent Claims (2, 3, 4)
-
-
5. A system for generating a reference set for use during document review, comprising:
-
a collection of unclassified documents; a selection module to apply selection criteria to the collection and to select those unclassified documents that satisfy the selection criteria as reference set candidates; a classification module to assign a classification code to each reference set candidate; and a data module to form a reference set from the classified reference set candidates, wherein the reference set comprises coded document that are quality controlled and shared between one or more reviewers. - View Dependent Claims (6, 7, 8)
-
-
9. A method for generating a reference set via clustering, comprising:
-
obtaining a collection of documents; grouping the documents into clusters of documents; selecting one or more documents from at least one cluster as reference set candidates; assigning a classification code to each of the reference set candidates; and grouping the classified reference set candidates as the reference set. - View Dependent Claims (10, 11, 12)
-
-
13. A method for generating a reference set via seed documents, comprising:
-
obtaining a collection of documents; identifying one or more seed documents; comparing the seed documents to the document collection and identifying those documents similar to the seed documents as reference set candidates; applying a size threshold to the reference set candidates; and grouping the reference set candidates as the reference set when the size threshold is satisfied. - View Dependent Claims (14, 15, 16)
-
-
17. A method for generating a training set for use during document review, comprising:
-
assigning classification codes to a set of documents; receiving further classification codes assigned to the same set of documents; comparing the classification code for at least one document with the further classification code for that document; determining whether a disagreement exists between the assigned classification code and the further classification code for at least one document; identifying those documents with disagreeing classification codes as training set candidates; applying a stop threshold to the training set candidates; and grouping the training set candidates as a training set when the stop threshold is satisfied. - View Dependent Claims (18, 19, 20)
-
Specification