×

Systems and methods for determining atypical language

  • US 9,690,849 B2
  • Filed: 03/07/2014
  • Issued: 06/27/2017
  • Est. Priority Date: 09/30/2011
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method comprising:

  • analyzing a first cluster of conceptually-related portions of text to identify a probability for each of the one or more portions of texts within the first cluster of conceptually-related portions of text, wherein the first cluster of conceptually-related portions of text comprises one or more financial documents and each of the one or more financial documents comprises one or more financial document sections and each of the one or more financial document sections comprises one or more sentences, and wherein the probability is calculated based on the number of occurrences of a given token of a given sentence of a given financial document of the first cluster of conceptually-related portions of text;

    developing a model based on the one or more probabilities corresponding to the one or more portions of texts within the first cluster of conceptually-related portions of text;

    calculating an abnormality score for each of the one or more sentences of the one or more financial document sections of a first identified conceptually-related portion of text as compared to the model; and

    transmitting a second identified conceptually-related portion of text based upon the abnormality score satisfying a threshold.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×