×

System and method of generating dictionary entries

  • US 7,254,530 B2
  • Filed: 09/26/2002
  • Issued: 08/07/2007
  • Est. Priority Date: 09/26/2001
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for automatically generating a dictionary based on a corpus of full text articles, comprising:

  • applying linguistic pattern analysis to the sentences in the corpus to extract <

    term, definition>

    pairs and identify sentences with candidate complex <

    term, definition>

    pairs;

    applying grammar analysis to the sentences with candidate complex <

    term, definition>

    pairs to extract <

    term, definition>

    pairs;

    storing the extracted <

    term, definition>

    pairs in a dictionary database;

    wherein the linguistic pattern analysis further comprises identifying sentences including text markers, and wherein the sentences including text markers are subjected to filtering to remove sentences not likely to include <

    term, definition>

    pairs, and wherein the filtering includes rules selected from the group including sentences with conjunctions at the beginning of a text marker, sentences having phrases indicative of explanation, and sentences having patterns indicative of enumeration.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×