×

Data analytics system and methods for text data

  • US 10,275,444 B2
  • Filed: 07/15/2016
  • Issued: 04/30/2019
  • Est. Priority Date: 07/15/2016
  • Status: Active Grant
First Claim
Patent Images

1. A device, comprising:

  • a processing system including a processor; and

    a memory that stores executable instructions that, when executed by the processing system, facilitate performance of operations, comprising;

    performing a statistical natural language processing analysis on a plurality of text documents to determine a plurality of topics, wherein prior to performing the statistical natural language processing analysis, a training is performed on sample documents to determine parameters for the statistical natural language processing analysis;

    creating a proper subset of topics from the plurality of topics, based on user input;

    mapping a topic in the proper subset of topics to each document in the plurality of text documents, thereby creating a plurality of topic-document pairs;

    for each topic-document pair of the plurality of topic-document pairs, identifying a bias from text in a corresponding document of the topic-document pair;

    creating clusters of topics from the proper subset of topics, wherein each cluster of topics is determined from the bias of each topic-document pair and a frequency of occurrence of each topic in the document identified by the topic-document pair, and wherein the clusters of topics have an image configuration based on the bias and the frequency of occurrence that distinguishes one cluster from another; and

    generating presentable content depicting each cluster of the clusters of topics according to a corresponding image configuration, wherein the image configuration specifies that an area for each cluster of topics is subdivided into separate sub-areas for each topic, wherein the sub-area for each topic represents a frequency of occurrence of that topic.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×