×

Clustering communications based on classification

  • US 10,007,717 B2
  • Filed: 09/18/2014
  • Issued: 06/26/2018
  • Est. Priority Date: 09/18/2014
  • Status: Active Grant
First Claim
Patent Images

1. A computer implemented method, comprising:

  • identifying a plurality of classification terms indicative of a classification;

    identifying a corpus of communications from one or more databases, the corpus of communications including a plurality of communications that are not labeled with an association to the classification;

    determining a cluster of the communications based on occurrence of one or more of the classification terms in the communications of the cluster;

    subsequent to determining the cluster, determining a feature set based on the communications of the cluster, wherein determining the feature set comprises;

    determining one or more features that are based on content that appears in a plurality of the communications of the cluster,wherein the content is in addition to the classification terms used in determining the cluster, andwherein determining the features based on the content that is in addition to the classification terms comprises determining the features based on the content appearing in the plurality of the communications of the cluster;

    assigning the feature set to an indication of the classification; and

    using the assigned feature set to classify an additional communication with the classification or using the assigned feature set to select a data extraction parser, for the classification, for the additional communication.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×