Electronic discovery system with statistical sampling
First Claim
1. A computer-implemented method comprising:
- obtaining, by at least one data processor and by a sampler tool, data from a plurality of applications and data sources used to implement a software-based electronic discovery workflow, the electronic discovery workflow being used to categorize each of a plurality of documents for potential production in connection with litigation discovery, the obtained data characterizing a current state of at least one aspect of the electronic discovery workflow;
generating, by the at least one data processor, at least one benchmark input field on a graphical user interface, each of the at least one benchmark input field accepting a user-defined benchmark describing a desired statistical feature of the data;
displaying, by the at least one data processor, on the graphical user interface, a recommended sample size indicating a number of documents that are likely needed to achieve at least one of the desired statistical features;
calculating, by the at least one data processor, based on the user-defined benchmarks and the data, at least one metric, each of the at least one metric corresponding to an actual statistical feature of the data with the recommended sample size;
displaying, by the at least one data processor, on the graphical user interface, the at least one metric;
generating, by the at least one data processor, a report based on the at least one metric;
displaying, on the graphical user interface, the report to a user;
providing, by the at least one data processor, data characterizing the report to a data store;
updating, by the at least one data processor with a write-back operation to the data store, an updated report based on detected changes in the at least one benchmark input field;
providing, by the at least one data processor, updated data characterizing the updated report; and
displaying, on the graphical user interface, the updated report to the user.
9 Assignments
0 Petitions
Accused Products
Abstract
A sampler tool for an electronic discovery system implementing an electronic discovery workflow is provided. The sampler tool defines a workflow, identifies, a data set, then calculates results and generates reports and archival records. In particular, the sampler tool calculates and/or obtains various metrics characterizing the workflow and generates reports based on such metrics. These reports can serve as historical snapshots which can be used for quality control purposes and/or to provide support regarding the efficacy of the data selection process or large-scale document review in the context of the electronic discovery process. Related apparatus, systems, techniques and articles are also described.
149 Citations
24 Claims
-
1. A computer-implemented method comprising:
-
obtaining, by at least one data processor and by a sampler tool, data from a plurality of applications and data sources used to implement a software-based electronic discovery workflow, the electronic discovery workflow being used to categorize each of a plurality of documents for potential production in connection with litigation discovery, the obtained data characterizing a current state of at least one aspect of the electronic discovery workflow; generating, by the at least one data processor, at least one benchmark input field on a graphical user interface, each of the at least one benchmark input field accepting a user-defined benchmark describing a desired statistical feature of the data; displaying, by the at least one data processor, on the graphical user interface, a recommended sample size indicating a number of documents that are likely needed to achieve at least one of the desired statistical features; calculating, by the at least one data processor, based on the user-defined benchmarks and the data, at least one metric, each of the at least one metric corresponding to an actual statistical feature of the data with the recommended sample size; displaying, by the at least one data processor, on the graphical user interface, the at least one metric; generating, by the at least one data processor, a report based on the at least one metric; displaying, on the graphical user interface, the report to a user; providing, by the at least one data processor, data characterizing the report to a data store; updating, by the at least one data processor with a write-back operation to the data store, an updated report based on detected changes in the at least one benchmark input field; providing, by the at least one data processor, updated data characterizing the updated report; and displaying, on the graphical user interface, the updated report to the user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A non-transitory computer program product storing instructions, which when executed by at least one data processor of at least one computing system, result in operations comprising:
-
obtaining, by a sampler tool, data from a plurality of applications and data sources used to implement a software-based electronic discovery workflow, the electronic discovery workflow being used to categorize each of a plurality of documents for potential production in connection with litigation discovery, the obtained data characterizing a current state of at least one aspect of the electronic discovery workflow; generating at least one benchmark input field on a graphical user interface, each of the at least one benchmark input field accepting a user-defined benchmark describing a desired statistical feature of the data; displaying, on the graphical user interface, a recommended sample size indicating a number of documents that are likely needed to achieve at least one of the desired statistical features; calculating, based on the user-defined benchmarks and the data, at least one metric, each of the at least one metric corresponding to an actual statistical feature of the data with the recommended sample size; displaying, on the graphical user interface, the at least one metric; generating a report based on the at least one metric; displaying, on the graphical user interface, the report to a user; providing data characterizing the report to a data store; updating, by a write-back operation to the data store, an updated report based on detected changes in the at least one benchmark input field; providing updated data characterizing the updated report; and displaying, on the graphical user interface, the updated report to the user. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A system comprising:
-
at least one data processor; and memory storing instructions, which when executed by the at least one data processor, result in operations comprising; obtaining, by a sampler tool, data from a plurality of applications and data sources used to implement a software-based electronic discovery workflow, the electronic discovery workflow being used to categorize each of a plurality of documents for potential production in connection with litigation discovery, the obtained data characterizing a current state of at least one aspect of the electronic discovery workflow; generating at least one benchmark input field on a graphical user interface, each of the at least one benchmark input field accepting a user-defined benchmark describing a desired statistical feature of the data; displaying, on the graphical user interface, a recommended sample size indicating a number of documents that are likely needed to achieve at least one of the desired statistical features; calculating, based on the user-defined benchmarks and the data, at least one metric, each of the at least one metric corresponding to an actual statistical feature of the data with the recommended sample size; displaying, on the graphical user interface, the at least one metric; generating a report based on the at least one metric; displaying, on the graphical user interface, the report to a user; providing data characterizing the report to a data store; updating, by a write-back operation to the data store, an updated report based on detected changes in the at least one benchmark input field; providing updated data characterizing the updated report; and displaying, on the graphical user interface, the updated report to the user.
-
Specification