×

DOMAIN-SPECIFIC COMPUTATIONAL LEXICON FORMATION

  • US 20160179783A1
  • Filed: 03/05/2015
  • Published: 06/23/2016
  • Est. Priority Date: 12/23/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • extracting a candidate token sequence comprising one or more word tokens from an unstructured domain glossary comprising a plurality of entries associated with a domain;

    performing a look-up operation to retrieve language data for each word token in the candidate token sequence;

    annotating each word token in the candidate token sequence found by the look-up operation with corresponding retrieved language data to form an annotated sequence;

    performing a pattern match of the annotated sequence relative to a repository of patterns;

    identifying a best matching pattern from the repository of patterns to the annotated sequence based on matching criteria;

    refining the annotated sequence with lexical information associated with the best matching pattern as a refined annotated sequence; and

    outputting the candidate token sequence and the refined annotated sequence to a domain-specific computational lexicon file.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×