Applying a structured language model to information extraction
First Claim
1. A method of extracting information from a natural language input using a computer having a processor and memory, comprising:
- accessing a semantic schema with a structured language model, the semantic schema having a template with a structure of frames that corresponds to one or more pieces of information to be extracted for an application program;
generating, with the processor, a candidate parse by parsing the natural language input with the structured language model, wherein, during generation, the structured language model generates hypothesis parses of a portion of the natural language input by applying the template and accepts only those hypothesis parses, as possible candidate parses, if the hypothesis parses completely match the structure of frames from the template, and discards all hypothesis parses, during construction of the hypothesis parses, that do not completely match the structure of frames from the templates, each accepted candidate parse including syntactic head words, and semantic labels, and using the head words and semantic labels in each accepted candidate parse to predict a next word in the natural language input, to obtain an overall parse for the natural language input, the overall parse having a semantic frame label and one or more constituents of the natural language input each having a semantic slot label, the overall parse being constrained based on the semantic schema accessed; and
identifying, with the processor, an information extraction frame corresponding to the natural language input based on the frame label and filling in slots in the frame with the one or more constituents labeled by the slot labels.
2 Assignments
0 Petitions
Accused Products
Abstract
One feature of the present invention uses the parsing capabilities of a structured language model in the information extraction process. During training, the structured language model is first initialized with syntactically annotated training data. The model is then trained by generating parses on semantically annotated training data enforcing annotated constituent boundaries. The syntactic labels in the parse trees generated by the parser are then replaced with joint syntactic and semantic labels. The model is then trained by generating parses on the semantically annotated training data enforcing the semantic tags or labels found in the training data. The trained model can then be used to extract information from test data using the parses generated by the model.
-
Citations
6 Claims
-
1. A method of extracting information from a natural language input using a computer having a processor and memory, comprising:
-
accessing a semantic schema with a structured language model, the semantic schema having a template with a structure of frames that corresponds to one or more pieces of information to be extracted for an application program; generating, with the processor, a candidate parse by parsing the natural language input with the structured language model, wherein, during generation, the structured language model generates hypothesis parses of a portion of the natural language input by applying the template and accepts only those hypothesis parses, as possible candidate parses, if the hypothesis parses completely match the structure of frames from the template, and discards all hypothesis parses, during construction of the hypothesis parses, that do not completely match the structure of frames from the templates, each accepted candidate parse including syntactic head words, and semantic labels, and using the head words and semantic labels in each accepted candidate parse to predict a next word in the natural language input, to obtain an overall parse for the natural language input, the overall parse having a semantic frame label and one or more constituents of the natural language input each having a semantic slot label, the overall parse being constrained based on the semantic schema accessed; and identifying, with the processor, an information extraction frame corresponding to the natural language input based on the frame label and filling in slots in the frame with the one or more constituents labeled by the slot labels. - View Dependent Claims (2, 3)
-
-
4. An information extraction system for extracting information from a natural language speech input using a computer, comprising:
-
a speech recognizer, including a structured language model, receiving the natural language speech input and generating a textual representation of the natural language speech input based on language modeling by the structured language model, the structured language model accessing a semantic schema having a template with a structure of frames that corresponds to one or more pieces of information to be extracted for an application program and parsing the textual representation, generating a plurality of parse hypotheses, to obtain one or more candidate semantic parse trees, each of the parse hypotheses being constrained during generation, by the template in the semantic schema so that all parse hypotheses that do not match the complete structure of frames in the template are discarded during their construction so that each candidate semantic parse tree obtained by the structured language model matches the structure of frames and all parse hypotheses which do not match the complete structure of frames in the template are rejected, wherein each candidate semantic parse tree has a structure with a semantic frame label and one or more semantic slot labels corresponding to constituents of the textual representation, the semantic frame and slot labels identifying the information to be extracted, and wherein a selected candidate parse includes head words and semantic labels that are used to predict a next word in the natural language input, to obtain an overall parse for the natural language input; and a processor, being a functional element of the computer, activated by the speech recognizer to facilitate parsing of the textual representation. - View Dependent Claims (5, 6)
-
Specification