×

Applying a structured language model to information extraction

  • US 8,706,491 B2
  • Filed: 08/24/2010
  • Issued: 04/22/2014
  • Est. Priority Date: 05/20/2002
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method of training an information extraction system to extract information from a natural language input, comprising:

  • initializing a structured language model with syntactically annotated training data, the annotated training data including a parse tree for a sentence having syntactic labels comprising a frame label indicating an overall action being referred to by the sentence and slot labels identifying attributes of the action;

    training the structured language model by generating parses with the initialized structured language model using annotated training data that has semantic constituent labels with semantic constituent boundaries identified, wherein the structured language model is trained as a match constrained parser which generates a set of syntactic parses for a given word string that all match the constituent boundaries specified by the semantic parse, by determining whether unlabeled constituents that define the semantic parse are included in a set of constituents that define the syntactic parse, wherein any parses that do not match the constituent boundaries are discarded;

    replacing the syntactic labels in the parse tree with joint syntactic and semantic labels based on the generated parses excluding the discarded parses; and

    retraining the structured language model in which the structured language model generates parses that are constrained to identically match the semantic constituent labels of the joint syntactic and semantic labels and constrained to match all of the semantic constituent boundaries.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×