×

Systems and methods for identifying spam messages using subject information

  • US 9,647,975 B1
  • Filed: 09/28/2016
  • Issued: 05/09/2017
  • Est. Priority Date: 06/24/2016
  • Status: Active Grant
First Claim
Patent Images

1. A system for identifying a spam email message, the system comprising:

  • a computing platform including computing hardware of at least one processor, a memory operably coupled to the at least one processor and configured to store instructions invoked by the at least one processor, an operating system implemented on the computing hardware, and input/output facilities;

    a rules database configured to store a plurality of ratio determination rules including a set of conditions for a text string for which the rules are applied to determine an n-value of words in a gram and a k-value of words to skip in an input text;

    a vectors database configured to store a plurality of known vectors, wherein the plurality of known vectors are classified by thematic category;

    instructions that, when executed on the computing platform, cause the computing platform to implement;

    a message processing tool configured to receive an email message via the input/output facilities, the email message containing a subject field,a gram building tool configured to build a k-skip-n-gram set of word combinations according to the ratio of the k-value and the n-value for the subject field as the input text as determined by the ratio determination rules in the rules database,a vector building tool configured to receive, from the gram building tool, the k-skip-n-gram set of word combinations, and build a vector for each k-skip-n-gram word combination, anda spam identification tool configured to determine a spam presence threshold based on the cosine similarity for each k-skip-n-gram word combination and the plurality of known vectors for the particular email message subject field thematic category, and determine that the email message contains spam when the spam presence threshold is exceeded.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×