Method for parsing natural language text with constituent construction links
First Claim
Patent Images
1. A method for improving a natural language parser to parse natural language text, the method comprising:
- a) using the parser to generate at least one constituent construction link for at least one source word in at least one additional utterance;
where said constituent construction link consists of a source word, a target word, and a link action,where said link action can be chosen from a set of link action values where said values include at least 2 values from Append, Insert Below, Insert Above and Below;
b) using the at least one constituent construction link to generate at least one constituent tree structure that represents a sentence parse result for each additional utterance by performing determination steps and repeating the determination steps, where the determination steps include;
i. if this is an initial constituent construction link for the additional utterance, create a first word node for a first word of the additional utterance, and create a new node and make it a parent of the first word node;
ii. create a source word node for the source word;
iii. find a highest node above a target word of the constituent construction link wherein the target word is either a first child of the highest node, or wherein the target word is a descendent of the first child of the highest node and is also a descendent of the first child of all intervening nodes between the highest node and the target word, and designate the highest node as a highest right most node;
iv. add one or more nodes to the constituent tree structure at locations relative to the highest right most node if directed by the link action of the Constituent Construction Link;
v. attach the source word node of the source word to the constituent tree structure at a point relative to the highest right most node based on the link action of the Constituent Construction Link.
0 Assignments
0 Petitions
Accused Products
Abstract
A parser for natural language text is provided. The parser is trained by accessing a corpus of labeled utterances. The parser extracts details of the syntactic tree structures and part of speech tags from the labeled utterances. The details extracted from the tree structures include Simple Links which are the key to the improved efficiency of this new approach. The parser creates a language model using the details that were extracted from the corpus. The parser then uses the language model to parse utterances.
63 Citations
10 Claims
-
1. A method for improving a natural language parser to parse natural language text, the method comprising:
-
a) using the parser to generate at least one constituent construction link for at least one source word in at least one additional utterance; where said constituent construction link consists of a source word, a target word, and a link action, where said link action can be chosen from a set of link action values where said values include at least 2 values from Append, Insert Below, Insert Above and Below; b) using the at least one constituent construction link to generate at least one constituent tree structure that represents a sentence parse result for each additional utterance by performing determination steps and repeating the determination steps, where the determination steps include; i. if this is an initial constituent construction link for the additional utterance, create a first word node for a first word of the additional utterance, and create a new node and make it a parent of the first word node; ii. create a source word node for the source word; iii. find a highest node above a target word of the constituent construction link wherein the target word is either a first child of the highest node, or wherein the target word is a descendent of the first child of the highest node and is also a descendent of the first child of all intervening nodes between the highest node and the target word, and designate the highest node as a highest right most node; iv. add one or more nodes to the constituent tree structure at locations relative to the highest right most node if directed by the link action of the Constituent Construction Link; v. attach the source word node of the source word to the constituent tree structure at a point relative to the highest right most node based on the link action of the Constituent Construction Link. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
Specification