×

Document relationship analysis system

  • US 9,928,295 B2
  • Filed: 02/02/2015
  • Issued: 03/27/2018
  • Est. Priority Date: 01/31/2014
  • Status: Active Grant
First Claim
Patent Images

1. A system for analyzing relationships between documents, the system comprising:

  • a user interface;

    an ingest memory configured to store source documents retrieved from an external document source;

    a text index memory configured to store a text index;

    a cluster index memory configured to store document vectors associated with each source document;

    a text extraction pipeline automatically extracting text from source documents added to the ingest memory;

    a document vector calculator automatically computing document vectors for source documents by applying term weights to the extracted text associated with the source document, the document vector calculator generating a plurality of profile document vectors associated with profile documents selected for use in a query against a target dataset;

    an indexer automatically building an index of the extracted text and storing the text index in the text index memory;

    a dataset manager component generating a result dataset containing documents of interest from a target dataset containing selected source documents based on a query by evaluating similarities between each profile document vector and the document vector calculated for each source document in the target dataset; and

    a relationship analyzer component automatically selecting a visualization model for clustering the documents of interest based the number of documents of interest in the result dataset and rendering the result set using selected visualization model in the user interface.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×