×

Methods, devices and systems for data augmentation to improve fraud detection

  • US 10,664,656 B2
  • Filed: 06/20/2018
  • Issued: 05/26/2020
  • Est. Priority Date: 06/20/2018
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for modifying an original electronic text document of a corpus of electronic text documents, comprising:

  • receiving the original electronic text document in a computer having a memory;

    repeatedly translating the received original electronic text document, using at least one machine translation engine, wherein each translated electronic text document is used as a basis for a subsequent translation into another language;

    re-translating a last-translated electronic text document back into an original language of the original electronic text document;

    transforming the re-translated electronic text document by selecting at least one word therein and substituting a respective synonym for each selected word to generate a synonym-replaced electronic text document;

    transforming the synonym-replaced electronic text document by selecting at least one word therein and substituting a respective misspelled word for each selected word to generate an modified electronic text document;

    computing a similarity measure between the original electronic text document and the modified electronic text document;

    determining whether the computed similarity measure is at least as great as a predetermined similarity threshold; and

    storing the modified electronic text document in the memory if the computed similarity threshold is greater than or equal to the predetermined similarity threshold and not storing the modified electronic text document in the memory if the computed similarity threshold is less than the predetermined similarity threshold.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×