Unsupervised learning of deep patterns for semantic parsing
First Claim
Patent Images
1. A method of unsupervised learning of deep patterns for semantic parsing, using an algorithm that operates over a VerbNet corpus, the method comprising:
- receiving a set of exemplary sentences, including a first exemplary sentence, from a language net;
extracting, from VerbNet, a plurality of markup language files corresponding to a usage pattern P;
parsing each of the exemplary sentences to yield a set of training constituency trees respectively corresponding to the exemplary sentences;
performing maximal frequent subtree analysis on the set of training constituency trees to yield an unfiltered set of deep pattern trees; and
filtering at least one irrelevant tree(s) from the set of deep pattern trees to obtain a filtered set of deep pattern trees, with the at least one irrelevant tree(s) do not contain a part of speech that is included in the usage pattern P;
deleting redundant leaves from at least one of the following;
the filtered set of deep pattern, and/or the unfiltered set of deep pattern trees; and
matching nodes of each tree pattern, of the filtered set of deep tree patterns, with items of the usage pattern P using machine logic based rules to facilitate translation of the filtered set of deep pattern trees into annotation query language rules;
At least the parsing step is performed by computer software running on computer hardware.
1 Assignment
0 Petitions
Accused Products
Abstract
Using exemplary sentences, usage patterns and thematic roles ascribed in VerbNet to generate “deep pattern trees” for the exemplary sentences. Then, when an arbitrary natural language subject sentence is input, these deep pattern trees can be matched to the natural language subject sentence in order to assign thematic roles to at least some of the “grammatical portions” of the natural language subject sentence.
58 Citations
9 Claims
-
1. A method of unsupervised learning of deep patterns for semantic parsing, using an algorithm that operates over a VerbNet corpus, the method comprising:
-
receiving a set of exemplary sentences, including a first exemplary sentence, from a language net; extracting, from VerbNet, a plurality of markup language files corresponding to a usage pattern P; parsing each of the exemplary sentences to yield a set of training constituency trees respectively corresponding to the exemplary sentences; performing maximal frequent subtree analysis on the set of training constituency trees to yield an unfiltered set of deep pattern trees; and filtering at least one irrelevant tree(s) from the set of deep pattern trees to obtain a filtered set of deep pattern trees, with the at least one irrelevant tree(s) do not contain a part of speech that is included in the usage pattern P; deleting redundant leaves from at least one of the following;
the filtered set of deep pattern, and/or the unfiltered set of deep pattern trees; andmatching nodes of each tree pattern, of the filtered set of deep tree patterns, with items of the usage pattern P using machine logic based rules to facilitate translation of the filtered set of deep pattern trees into annotation query language rules; At least the parsing step is performed by computer software running on computer hardware. - View Dependent Claims (2, 3)
-
-
4. A computer program product, for unsupervised learning of deep patterns for semantic parsing, using an algorithm that operates over a VerbNet corpus, the computer program product comprising software stored on a software storage device, the software comprising:
-
first program instructions programmed to extract, from VerbNet, a plurality of markup language files corresponding to a usage pattern P; second program instructions programmed to parse each of the exemplary sentences to yield a set of training constituency trees respectively corresponding to the exemplary sentences; third program instructions programmed to perform maximal frequent subtree analysis on the set of training constituency trees to yield an unfiltered set of deep pattern trees; and fourth program instructions programmed to filter at least one irrelevant tree(s) from the set of deep pattern trees to obtain a filtered set of deep pattern trees, with the at least one irrelevant tree(s) do not contain a part of speech that is included in the usage pattern P; fifth program instructions programmed to delete redundant leaves from at least one of the following;
the filtered set of deep pattern, and/or the unfiltered set of deep pattern trees; andsixth program instructions programmed to match nodes of each tree pattern, of the filtered set of deep tree patterns, with items of the usage pattern P using machine logic based rules to facilitate translation of the filtered set of deep pattern trees into annotation query language (AQL) rules. - View Dependent Claims (5, 6)
-
-
7. A computer system for unsupervised learning of deep patterns for semantic parsing, using an algorithm that operates over a VerbNet corpus, the computer system comprising:
-
a processor(s) set; and a software storage device; wherein; the processor set is structured, located, connected and/or programmed to run software stored on the software storage device; and the software comprises; first program instructions programmed to extract, from VerbNet, a plurality of markup language files corresponding to a usage pattern P; second program instructions programmed to parse each of the exemplary sentences to yield a set of training constituency trees respectively corresponding to the exemplary sentences; third program instructions programmed to perform maximal frequent subtree analysis on the set of training constituency trees to yield an unfiltered set of deep pattern trees; and fourth program instructions programmed to filter at least one irrelevant tree(s) from the set of deep pattern trees to obtain a filtered set of deep pattern trees, with the at least one irrelevant tree(s) do not contain a part of speech that is included in the usage pattern P; fifth program instructions programmed to delete redundant leaves from at least one of the following;
the filtered set of deep pattern, and/or the unfiltered set of deep pattern trees; andsixth program instructions programmed to match the nodes of each tree pattern, of the filtered set of deep tree patterns, with items of the usage pattern P using machine logic based rules to facilitate translation of the filtered set of deep pattern trees into annotation query language (AQL) rules. - View Dependent Claims (8, 9)
-
Specification