×

Computer-implemented method, program, and system for identifying non-self-descriptive terms in electronic documents

  • US 9,158,756 B2
  • Filed: 03/13/2013
  • Issued: 10/13/2015
  • Est. Priority Date: 03/30/2012
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer-implemented method for identifying a non-self-descriptive term in an electronic document, including a memory and a processor communicatively coupled to the memory, wherein the processor is configured to execute the steps of a method comprising:

  • acquiring a noun included in corpus data;

    calculating a qualifying level and qualified level in the corpus data related to each noun included in the corpus data;

    identifying one or more nouns included in the corpus data having a qualifying level and/or qualified level satisfying a predetermined condition; and

    presenting a term related to one or more of the nouns in the electronic document as a candidate for the non-self-descriptive term in the electronic document, wherein the qualified level of a first noun in the, corpus data is calculated by;

    counting a number of occurrences (M) of the first noun in the corpus data;

    counting a number of times (Mb1) the first noun is qualified by a preposition in the corpus data;

    counting a number of times (Mb2) the first noun is qualified by a present of past participle in the corpus data;

    counting a number of times (Mb3) the first noun is qualified by a noun adjunct in the corpus data; and

    summing Mb1, Mb2 and Mb3 and dividing the sum by M to obtain the qualified level of the first noun in the corpus data.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×