×

Layered masking of content

  • US 10,546,154 B2
  • Filed: 10/27/2017
  • Issued: 01/28/2020
  • Est. Priority Date: 03/28/2017
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving content including a token;

    storing one or more regular expressions for the content, wherein each regular expression comprises a sequence of symbols and characters expressing a string or pattern;

    determining, by a computer system based on matching the one or more regular expressions with the token, a first confidence score indicating a probability that the token includes personally identifiable information (PII), the first confidence score being associated with the regular expression;

    storing a lookup table that includes one or more tokens for known PII;

    determining, by the computer system based on matching the token with tokens in the lookup table, a second confidence score indicating a probability that the token includes PII, the second confidence score being associated with a term in the lookup table that is an exact match of the token;

    storing a model for determining a third confidence score indicating a probability that the token includes PII, wherein the model is generated using a machine learning training algorithm;

    determining, by the computer system based on inputting the token into the model, the third confidence score;

    masking the token by the computer system based on the first confidence score, the second confidence score and the third confidence score; and

    providing, by the computer system as data of improved privacy, the content including the masked token to a content consuming device.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×