×

Realtime data stream cluster summarization and labeling system

  • US 20170255536A1
  • Filed: 12/08/2016
  • Published: 09/07/2017
  • Est. Priority Date: 03/15/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method for generating topic labels from statistical topic models, comprising:

  • receiving a collection of topics, associated topic word probabilities for a given topic in conjunction with a statistical topic model and a set of documents associated with each topic;

    truncating a document set to include documents having an aggregate topic word probability that meets truncation criteria;

    reweighting the probabilities for each topic word in the truncated document set for a given topic based on the frequency that the topic word appears across the collection of topics;

    determining, for each document in the truncated document set for the given topic, an aggregate topic word probability;

    identifying topic fragments in each document in a truncated document set based on the topic words to create user friendly and highly descriptive topic labels.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×