×

Acquisition and application of contextual role knowledge for coreference resolution

  • US 20090326919A1
  • Filed: 11/18/2004
  • Published: 12/31/2009
  • Est. Priority Date: 11/18/2003
  • Status: Active Grant
First Claim
Patent Images

1. A method for associating anaphors with antecedents in a written work, the method comprising:

  • processing a training corpus containing textual documents that are topically related to the written work, said processing producing interpretive information useful to categorize noun phrases of the training corpus as independent or potentially anaphoric;

    identifying noun phrases within the written work;

    using the interpretive information, filtering those identified noun phrases to exclude noun phrases that can be identified to be independent in nature;

    identifying a set of potentially anaphoric noun phrases occurring in the written work;

    following said identifying a set of potentially anaphoric noun phrases, recognizing cases of unambiguous coreferences in the set of potentially anaphoric noun phrases, said recognizing associating a noun phrase with an antecedent for each case;

    following said recognizing, identifying coreference combinations for unrecognized noun phrases from the set of potentially anaphoric noun phrases, each coreference combination including an unrecognized noun phrase and a potential antecedent;

    applying a plurality of general knowledge sources and contextual role knowledge sources to the coreference combinations, wherein the contextual role knowledge sources include events and a manner of participation in the events to identify relatedness for the coreference combination at a thematic role level, said applying producing evidentiary values for the coreference combinations;

    applying a factor to each of the produced evidentiary values to favor more credible knowledge sources;

    for each unrecognized noun phrase, applying a probabilistic model to the produced evidentiary values associated with the noun phrase; and

    for each application of the probabilistic model to an unrecognized noun phrase, selecting either an antecedent for that unrecognized noun phrase, if the coreference of that antecedent has a corresponding evidentiary value above a selected threshold value, or no antecedent otherwise.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×