×

Integration and combination of random sampling and document batching

  • US 9,785,634 B2
  • Filed: 06/04/2011
  • Issued: 10/10/2017
  • Est. Priority Date: 06/04/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method of integrated batching and random sampling of documents for enhanced functionality within document review processes, comprising:

  • receiving a batching request, the batching request including;

    a population size that corresponds to a number of a total amount of documents available for sampling; and

    an acceptable margin of error;

    computing a random sample size from the batching request;

    randomly selecting a subset of documents from the total amount of documents available for sampling, a number of the randomly selected subset of documents corresponding to the random sample size, a set of excluded documents being documents in the total amount of documents available for sampling that are not included in the randomly selected subset of documents;

    determining a range of relevant documents within the set of excluded documents by determining a population of relevant documents within the set of excluded documents, the determination performed by;

    receiving a query regarding the total amount of documents,applying a hypothesis test to the randomly selected subset of documents to calculate a first response to the query for the randomly selected subset of documents, andutilizing the first response to calculate a second response to the query for the population of excluded but relevant documents within the total amount of documents;

    randomly grouping the randomly selected subset of documents into a plurality of batches for assignment to a plurality of review nodes, at least one review node being a machine review node and at least one node being a human review node;

    assigning each of the randomly grouped batches to a review node of the plurality of review nodes for review of the respective batch;

    determining a range of excluded but relevant documents for both a batch of machine reviewed documents and a batch of human reviewed documents;

    comparing the ranges together to determine a difference between machine reviewed documents and human reviewed documents; and

    utilizing machine document review if the difference is less than a threshold amount.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×