×

Document classification and characterization

  • US 9,703,863 B2
  • Filed: 03/11/2013
  • Issued: 07/11/2017
  • Est. Priority Date: 01/26/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving, by at least one data processor, data characterizing each of a plurality of documents within a document set;

    grouping, by the at least one data processor, the plurality of documents into a plurality of stacks using one or more grouping algorithms;

    identifying, by the at least one data processor, a prime document for each stack, the prime document including attributes representative of the entire stack;

    providing, by the at least one data processor, data characterizing documents for each stack including at least the identified prime document to at least one human reviewer;

    receiving, by the at least one data processor, user-generated input from the at least one human reviewer categorizing each provided document;

    sending, by the at least one data processor, data characterizing supplemental documents within a stack other than the provided documents to enable the at least one human reviewer to review a digital representation of such supplemental documents for quality control; and

    selecting, by the at least one data processor, randomized and stratified supplemental documents whose data is sent to the at least one human reviewer for quality control based on an algorithm designed to select documents based on their likelihood to require remediation, the selecting comprising;

    stratifying the supplemental documents by weighting the corresponding documents by tier representation; and

    randomizing the supplemental documents within pre-defined parameters comprising the weighting.

View all claims
  • 10 Assignments
Timeline View
Assignment View
    ×
    ×