×

Method and apparatus for learning information extraction patterns from examples

  • US 5,796,926 A
  • Filed: 06/06/1995
  • Issued: 08/18/1998
  • Est. Priority Date: 06/06/1995
  • Status: Expired due to Fees
First Claim
Patent Images

1. In a computer-based information extraction system having text as input and events as output, a method for learning information extraction patterns for use in logging events, said learning method comprising the steps of:

  • a) presenting an example sentence to a pattern learning engine;

    b) identifying to said pattern learning engine a valid event, said valid event comprising a set of syntactic constituents from within said example sentence; and

    c) determining whether said example sentence and its corresponding valid event is not already matched by any known pattern, wherein said determining step comprises;

    segmenting said example sentence into in a series of syntactic constituents, each said syntactic constituent containing a head word/head entity characterizing said syntactic constituent; and

    matching each said head word/entity and selected other syntactic properties of said series of syntactic constituents against said known patterns in order to verify plausibility of specific syntactic relationships between said syntactic constituents of the event under test;

    and, if the example sentence and its valid event are not matched by any of said known patterns,d) attempting to generalize one of said known patterns to match the example sentence with its corresponding valid event; and

    , if no acceptable resultant pattern is produced,e) building, in said learning engine, a new grammar pattern based on said example sentence and its corresponding valid event, for use in constructing subsequent valid events from subsequent input sentences which are input to said information extraction system.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×