×

SYSTEM AND METHOD FOR PROVIDING TECHNOLOGY ASSISTED DATA REVIEW WITH OPTIMIZING FEATURES

  • US 20180113935A1
  • Filed: 12/20/2017
  • Published: 04/26/2018
  • Est. Priority Date: 03/13/2013
  • Status: Active Grant
First Claim
Patent Images

1. An electronic document system, comprising:

  • a processor;

    a data store including a plurality of documents;

    a non-transitory computer readable medium, comprising instructions for;

    generating a document map for the plurality of documents within the data store using a topic-related generative model for the plurality of documents by clustering the plurality of documents into topics based on the topic-related generative model;

    selecting a control set of documents from the plurality of documents, wherein the control set of documents is selected from a first strata of the plurality of documents and a second strata of the plurality of documents;

    sending the control set of documents to a user;

    receiving a control set metric regarding the control set of documents from the user, wherein the control set metric includes an indicator of responsiveness for each of the documents of the control set of documents;

    the data review system performing the steps of;

    a) determining a responsiveness score for each of the plurality of documents according to a scoring algorithm including determining a document responsiveness probability for the document, determining a weighted topic score for the document for each of a set of topics in the topic-related generative model based on the document responsiveness probability and a topic-document weight between the topic and the document, generating an initial responsiveness score based on the topic-document weights of the document for each topic and the weighted topic score, and normalizing the document responsiveness probability based on the initial responsiveness score to determine the responsiveness score for the document;

    b) determining a set of responsive documents and a set of non-responsive documents of the plurality of documents based on the responsiveness score determined for each of the plurality of documents and a decision boundary score;

    c) determining a confidence score for the data review system using the responsiveness score for each of the documents of the control set and the indicator of responsiveness for each of the control set documents received from the user;

    d) selecting one or more of the plurality of documents based on the responsiveness scores of the plurality of documents, wherein the responsiveness score of each of the one or more selected documents is at or near the decision boundary score;

    e) presenting the one or more selected documents to the user;

    f) receiving an indicator of responsiveness from the user for each of the selected documents;

    g) refining the scoring algorithm based on the indicator of responsiveness for each of the selected document; and

    h) generating a desired confidence score for the document system and presenting the set of responsive documents to the user when the desired confidence score for the document system is achieved, wherein the confidence score for the document system is determined by comparing the responsiveness score for the documents of the control set to the indicator of responsiveness for the documents of the control set received from the user.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×