×

Extraction of lexical kernel units from a domain-specific lexicon

  • US 9,588,959 B2
  • Filed: 01/09/2015
  • Issued: 03/07/2017
  • Est. Priority Date: 01/09/2015
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer program product comprising:

  • a tangible storage medium readable by a processor and storing instructions for execution by the processor to perform a method comprising;

    receiving a candidate lexical kernel unit comprising a word token sequence that includes two or more words;

    retrieving domain terms that contain the two or more words from a terminology resource file of domain terms associated with a domain;

    analyzing the candidate lexical kernel unit and the retrieved domain terms to determine whether the candidate lexical kernel unit satisfies specified criteria for use as a building block by a natural-language processing (NLP) tool for building larger lexical units in the domain, each of the larger lexical units including a greater number of words than the candidate lexical kernel unit;

    identifying the candidate lexical kernel unit as a lexical kernel unit based on determining that the candidate lexical kernel unit satisfies the specified criteria; and

    outputting the lexical kernel unit to a domain-specific lexical kernel unit file for input to the NLP tool for use as a lexical resource in parsing natural language text in the domain, the parsing including identifying domain-specific terms in the natural language text in the domain.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×