×

Removing personal information from text using a neural network

  • US 10,169,315 B1
  • Filed: 04/27/2018
  • Issued: 01/01/2019
  • Est. Priority Date: 04/27/2018
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for removing personal information from text using a neural network, the method comprising:

  • obtaining the neural network, wherein the neural network is configured to process the text and select a label from a plurality of possible labels for each word of the text, wherein each label corresponds to a class of words, and wherein at least one label corresponds to a class of words to be removed from the text;

    receiving the text;

    obtaining a word embedding for each word of the text, where a word embedding represents a word in a vector space;

    computing a context vector for each word of the text by processing the word embeddings with a first layer of the neural network, where a context vector for a given word includes information about words before or after the given word;

    computing label scores for each word of the text by processing each of the context vectors with a second layer of the neural network, wherein each label score indicates a match between a word and a class of words;

    selecting a label for each word of the text by processing the label scores with a third layer of the neural network; and

    generating redacted text by replacing a first word of the text with a first label corresponding to the first word.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×