×

Selecting pattern matching segments for electronic communication clustering

  • US 10,216,837 B1
  • Filed: 12/29/2014
  • Issued: 02/26/2019
  • Est. Priority Date: 12/29/2014
  • Status: Active Grant
First Claim
Patent Images

1. A system comprising one or more processors operably coupled with non-transitory memory that stores instructions that, in response to execution of the instructions by the one or more processors, cause the one or more processors to:

  • identify a set of pattern matching segments that match at least one of a corpus of email addresses;

    determine a measure of coverage of each of the set of pattern matching segments across the corpus of email addresses;

    determine one or more measures of flexibility associated with each of the set of pattern matching segments based on a count of wildcard characters within each pattern matching segment relative to a count of fixed text characters within each pattern matching segment;

    determine, based on the measure of coverage and the one or more measures of flexibility associated with each of the set of pattern matching segments, a score associated with each pattern matching segment;

    select, based on scores associated with the pattern matching segments, one or more of the pattern matching segments that satisfy one or more thresholds that are automatically adjusted;

    group a corpus of emails into a plurality of clusters based on a comparison of the one or more selected pattern matching segments to email addresses associated with the corpus of emails;

    analyze emails of a given cluster of the plurality of clusters grouped based on a given pattern matching segment to identify content that is transient among the emails of the given cluster;

    generate, for the given cluster, a data extraction template that is usable to extract transient content from subsequent emails that include sender email addresses that match the given pattern matching segment; and

    apply the data extraction template to a subsequent email having a sender address that matches the given pattern matching segment to extract transient data from a subject or body of the subsequent email, wherein the extracted transient data is output to a user via an output device of a computing device operated by the user.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×