×

Data processing systems for automated classification of personal information from documents and related methods

  • US 10,614,247 B2
  • Filed: 09/20/2019
  • Issued: 04/07/2020
  • Est. Priority Date: 06/10/2016
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented data processing method for automatically classifying personal information in an electronic document and generating a sensitivity score for the electronic document based on the classification, the method comprising:

  • receiving, by one or more processors, the electronic document for analysis;

    using one or more natural language processing techniques, by the one or more processors, to decompose data from the electronic document into;

    one or more structured objects; and

    one or more values for each of the one or more structured objects;

    classifying, by the one or more processors, the each of the one or more structured objects in the electronic document based on one or more attributes of the one or more structured objects;

    categorizing, by the one or more processors, the each of the one or more structured objects based on a sensitivity of the one or more structured objects;

    rating, by the one or more processors, an accuracy of the categorization, wherein rating the accuracy of the categorization comprises;

    receiving a second electronic document that is related to the electronic document;

    using the one or more natural language processing techniques, by one or more processors, to decompose data from the second electronic document into;

    one or more second structured objects; and

    one or more second values for each of the one or more structured objects;

    classifying, by one or more processors, each of the one or more second structured objects in the second electronic document based on one or more second attributes of the one or more second structured objects;

    categorizing, by one or more processors, each of the one or more second structured objects based on a sensitivity of the one or more second structured objects; and

    comparing the categorization of the one or more structured objects with the categorization of the one or more second structured objects; and

    rating the accuracy based on the comparison; and

    generating, by the one or more processors, a sensitivity score for the electronic document based at least in part on the categorized one or more structured objects and the associated one or more values.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×