Document Classification and Characterization
First Claim
1. A method comprising:
- receiving data characterizing each of a plurality of documents within a document set;
grouping the plurality of documents into a plurality of stacks using one or more grouping algorithms;
identifying a prime document for each stack, the prime document including attributes representative of the entire stack;
providing data characterizing documents for each stack including at least the identified prime document to at least one human reviewer;
receiving user-generated input from the human reviewer categorizing each provided document;
sending data characterizing supplemental documents within a stack other than the provided documents to at least one human reviewer for quality control; and
selecting randomized and stratified supplemental documents whose data is sent to at least one human reviewer for quality control based on an algorithm designed to select documents based on their likelihood to require remediation.
10 Assignments
0 Petitions
Accused Products
Abstract
Data is received that characterizes each of a plurality of documents within a document set. Based on this data, the plurality of documents are grouped into a plurality of stacks using one or more grouping algorithms. A prime document is identified for each stack that includes attributes representative of the entire stack. Subsequently, provision of data is provided that characterizes documents for each stack including at least the identified prime document to at least one human reviewer. User-generated input from the human reviewer is later received that categorized each provided document and data characterizing the user-generated input can then be provided. Related apparatus, systems, techniques and articles are also described.
4 Citations
20 Claims
-
1. A method comprising:
-
receiving data characterizing each of a plurality of documents within a document set; grouping the plurality of documents into a plurality of stacks using one or more grouping algorithms; identifying a prime document for each stack, the prime document including attributes representative of the entire stack; providing data characterizing documents for each stack including at least the identified prime document to at least one human reviewer; receiving user-generated input from the human reviewer categorizing each provided document; sending data characterizing supplemental documents within a stack other than the provided documents to at least one human reviewer for quality control; and selecting randomized and stratified supplemental documents whose data is sent to at least one human reviewer for quality control based on an algorithm designed to select documents based on their likelihood to require remediation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. An article of manufacture comprising:
-
computer executable instructions stored on non-transitory computer readable media, which, when executed by a computer, causes the computer to perform operations comprising; receiving data characterizing each of a plurality of documents within a document set; grouping the plurality of documents into a plurality of stacks using one or more grouping algorithms; identifying a prime document for each stack, the prime document including attributes representative of the entire stack; providing data characterizing documents for each stack including at least the identified prime document to at least one human reviewer; receiving user-generated input from the human reviewer categorizing each provided document; sending data characterizing supplemental documents within a stack other than the provided documents to at least one human reviewer for quality control;
anselecting randomized and stratified supplemental documents whose data is sent to at least one human reviewer for quality control based on an algorithm designed to select documents based on their likelihood to require remediation. - View Dependent Claims (16, 17, 18, 19)
-
-
20. A system comprising:
-
at least one data processor; and memory storing instructions, which when executed by the at least one data processor, result in operations comprising; receiving data characterizing each of a plurality of documents within a document set; grouping the plurality of documents into a plurality of stacks using one or more grouping algorithms; identifying a prime document for each stack, the prime document including attributes representative of the entire stack; providing data characterizing documents for each stack including at least the identified prime document to at least one human reviewer; receiving user-generated input from the human reviewer categorizing each provided document; sending data characterizing supplemental documents within a stack other than the provided documents to at least one human reviewer for quality control;
anselecting randomized and stratified supplemental documents whose data is sent to at least one human reviewer for quality control based on an algorithm designed to select documents based on their likelihood to require remediation.
-
Specification