×

Apparatus for automatic theme detection from unstructured data

  • US 10,372,741 B2
  • Filed: 03/01/2013
  • Issued: 08/06/2019
  • Est. Priority Date: 03/02/2012
  • Status: Active Grant
First Claim
Patent Images

1. A system comprising:

  • a repository of unstructured documents stored in a computing system;

    a natural language processor configured to perform language processing; and

    a non-transitory computer-readable storage medium comprising instructions that, when executed, enable a computing system to detect themes within the unstructured documents by;

    removing noise words from the unstructured documents, to yield clean documents;

    initiate a sentiment computation component configured to determine sentiment of each word in the clean documents by;

    assigning to each word in the clean documents at least one of a positive sentiment, a negative sentiment, and neutral sentiment, to yield assigned sentiments;

    determining a sentiment probability of a section of the unstructured data based on the assigned sentiments of words in the section, to yield assigned sectional sentiment; and

    determining an overall sentiment probability distribution for the unstructured documents based on the assigned sectional sentiment of multiple sections of the unstructured documents;

    initiate a theme detection component configured to;

    discover themes based on topics with neutral sentiment when the topics are located in a section of the unstructured documents with a sentiment probability that is greater than an overall sentiment probability distribution;

    assign labels to each discovered theme;

    identify patterns that describe each theme;

    identify instances of the themes within individual documents of the unstructured documents based on a presence of the patterns in the individual documents; and

    organize the themes in a hierarchy using the instances of the themes; and

    initiate a user interface configured to;

    allow an operator to initiate theme detection by the theme detection component; and

    allow an operator to view and interact with results of the theme detection, wherein the results comprise at least one of the assigned labels, the patterns, and the hierarchy.

View all claims
  • 7 Assignments
Timeline View
Assignment View
    ×
    ×