×

Systems and Methods for Determining Lexical Associations Among Words in a Corpus

  • US 20150347385A1
  • Filed: 06/01/2015
  • Published: 12/03/2015
  • Est. Priority Date: 05/30/2014
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method of identifying one or more target words of a corpus that have a lexical relationship to a plurality of provided cue words, the method comprising:

  • receiving a plurality of cue words;

    analyzing the cue words and statistical lexical information derived from a corpus of documents with a processing system to determine candidate words that have a lexical association with the cue words, the statistical information including numerical values indicative of probabilities of word pairs appearing together as adjacent words in a well-formed text or appearing together within a paragraph of a well-formed text;

    for each candidate word,determining, using the processing system, a statistical association score between the candidate word and each of the cue words using numerical values included in the statistical information, andgenerating, using the processing system, an aggregate score for each of the candidate words based on the statistical association scores; and

    selecting one or more of the candidate words to be the one or more target words based on the aggregate scores of the candidate words.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×