×

Methods and systems relating to information extraction

  • US 20060253274A1
  • Filed: 04/24/2006
  • Published: 11/09/2006
  • Est. Priority Date: 05/05/2005
  • Status: Active Grant
First Claim
Patent Images

1. A method of training an information extraction system comprising:

  • employing a first corpus of annotated text;

    automatedly extracting information from a second corpus of unannotated text;

    automatedly populating a discriminative information extraction model based on the information extracted from the second corpus and the first corpus of annotated text;

    automatedly identifying from at least one of the first corpus, the second corpus, and a third corpus one or more word strings including words having an ambiguous relationship and providing the one or more word strings to a trainer for annotation; and

    automatedly updating the discriminative information extraction model based on annotations to the one or more word strings provided by the trainer.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×