×

Protecting confidential information

  • US 10,061,937 B1
  • Filed: 06/28/2017
  • Issued: 08/28/2018
  • Est. Priority Date: 06/28/2017
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving, by one or more computer processors, from a first computer, text generated by a user;

    identifying, by one or more computer processors, in the text generated by the user, one or more confidential information registered in a dictionary, wherein the dictionary contains a plurality of registered confidential information and a plurality of substitute word corresponding to the plurality of registered confidential information;

    retrieving, by one or more computer processors, from the dictionary, one or more substitute words corresponding to each identified registered confidential information of the one or more confidential information registered in the dictionary;

    identifying, by one or more computer processors, in the text generated by the user, whether one or more words are potentially confidential based, at least in part, on a text analysis of the text generated by the user;

    generating, by one or more computer processors, one or more words for each of the one or more potentially confidential words, wherein the generating comprises;

    determining, by one or more computer processors, for each of the one or more potentially confidential words, the registered confidential information associated with a shortest edit distance;

    retrieving, by one or more computer processors, from the dictionary, the one or more retrieved substitute words to replace the registered confidential information associated with the shortest edit distance;

    determining, by one or more computer processors, a category of the one or more retrieved substitute words to replace the registered confidential information associated with the shortest edit distance;

    retrieving, by one or more computer processors, a list of unused words in the category of the one or more retrieved substitute words to replace the registered confidential information associated with the shortest edit distance;

    selecting, by one or more computer processors, one or more words from the list of unused words in the category of the one or more retrieved substitute words to replace the registered confidential information associated with the shortest edit distance based, at least in part, the text analysis identifying a highest topic index of the selected one or more words from the list of unused words; and

    sending, by one or more computer processors, to the first computer, a proposed protected text, wherein the proposed protected text includes the text generated by the user with each of the identified registered confidential information included with each of the one or more retrieved substitute words to replace the identified confidential information and each of the one or more potentially confidential words included with each of the one or more generated words to replace the one or more potentially confidential words.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×