×

Method and system for data mining of short message streams

  • US 9,558,165 B1
  • Filed: 08/19/2012
  • Issued: 01/31/2017
  • Est. Priority Date: 08/19/2011
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer-implemented method for summarizing a message stream, method comprising the steps of:

  • defining a communications channel with one or more key words, wherein defining the communications channel comprises specifying one or more key words that are used to extract a message from the message stream, the message stream comprising at least two messages;

    extracting one or more messages from the message stream based on the defined channel, wherein extracting one or more messages from the message stream based on the defined channel comprises filtering one or more messages from the message stream using the defined channel as a filter for selecting a message to be extracted for additional processing;

    removing common words from the one or more extracted messages;

    building a word order graph for the one or more extracted messages, the word order graph tracking sequencing of words found within each extracted message;

    using an algorithm to find commonly occurring word clusters within each extracted message, wherein the algorithm reviews each extracted message for at least two-word clusters with a predetermined pair-frequency, the pair-frequency comprising a number of times that words appear together in an extracted message;

    pruning the word clusters to reduce a total number of word clusters;

    ranking one or more surviving clusters to determine an order of presentation;

    arranging each word cluster into a natural order based on the word order graph; and

    displaying the word clusters as a summary of the message stream.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×