×

Systems and methods for identifying key phrase clusters within documents

  • US 10,180,929 B1
  • Filed: 10/13/2016
  • Issued: 01/15/2019
  • Est. Priority Date: 06/30/2014
  • Status: Active Grant
First Claim
Patent Images

1. An electronic device comprising:

  • a computer display;

    computer-readable storage media; and

    one or more processors configured to execute instructions to cause the electronic device to;

    obtain, based on a first user input, documents and a statistical model;

    segment contents of the documents into segments;

    determine frequencies at which the segments occur within the contents of the documents and store the frequencies in the computer-readable storage media;

    with the statistical model, determine modeled frequencies for the segments;

    compare the frequencies with the modeled frequencies;

    based on the comparison, determine statistical significance values for the segments;

    identify representative segments from the segments having statistical significance values exceeding a predetermined threshold value;

    cluster the documents into clusters, each cluster having identical or substantially identical representative segments;

    determine a label for each cluster;

    display within a graphical user interface a representation of the documents;

    receive a second user input and identify a set of clusters, from the clusters, associated with the second user input; and

    based on the received second user input, modify the graphical user interface to further includea representation of the second user input, andfor each of the clusters of the set of clusters;

    an indication of the label associated with the cluster, andan indication of the documents associated with the cluster,wherein the clusters of the set of clusters are grouped and displayed in separate portions of the graphical user interface.

View all claims
  • 8 Assignments
Timeline View
Assignment View
    ×
    ×