Method and apparatus for embedding grammars in a natural language understanding (NLU) statistical parser
First Claim
Patent Images
1. In a natural language understanding (NLU) system, a method for including grammars in a statistical parser comprising:
- receiving a text input; and
,applying a first context free grammar (CFG) to said text input to determine substrings and corresponding parse trees, wherein said substrings and said corresponding parse trees further correspond to said first CFG; and
,examining each said possible substring and corresponding parse tree using an inventory of ordered grammar-related queries corresponding to said CFG, wherein said queries are ordered training of the NLU system to form a decision tree, the ordering of the queries being a statistical ordering based on a minimization of conditional entropy or maximization of a likelihood that a resulting model predicts data used to train the NLU system.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for use in a natural language understanding system for including grammars within a statistical parser. The method involves a series of steps. The invention receives a text input. The invention applies a first context free grammar to the text input to determine substrings and corresponding parse trees, wherein the substrings and corresponding parse trees further correspond to the first context free grammar. Additionally, the invention can examine each possible substring using an inventory of queries corresponding to the CFG.
125 Citations
51 Claims
-
1. In a natural language understanding (NLU) system, a method for including grammars in a statistical parser comprising:
-
receiving a text input; and
,applying a first context free grammar (CFG) to said text input to determine substrings and corresponding parse trees, wherein said substrings and said corresponding parse trees further correspond to said first CFG; and
,examining each said possible substring and corresponding parse tree using an inventory of ordered grammar-related queries corresponding to said CFG, wherein said queries are ordered training of the NLU system to form a decision tree, the ordering of the queries being a statistical ordering based on a minimization of conditional entropy or maximization of a likelihood that a resulting model predicts data used to train the NLU system. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A system for recognizing grammatical phrases in a text input comprising:
-
a text buffer for storing said text input; at least one context-free grammar (CFG) for recognizing a particular grammatical phrase within said text input; and
,an inventory of ordered grammar-related queries wherein each query within said inventory of queries corresponds to said at least one CFG; wherein said queries are ordered during training of a corresponding natural language understanding (NLU) system to form a decision tree, the ordering of the queries being a statistical ordering based on a minimization of conditional entropy or maximization of a likelihood that a resulting model predicts data used to train the NLU system; wherein said at least one CFG is applied to said text input in said text buffer to determine substrings and corresponding parse trees; wherein said inventory of queries is further applied to said substrings to determine said particular grammatical phrase. - View Dependent Claims (17)
-
-
18. A system for recognizing grammatical phrases in a text input comprising:
-
a text buffer for storing said text input; at least one context-free grammar (CFG) for recognizing a particular grammatical phrase within said text input; and
,one or more weighted grammar-related features wherein each said feature corresponds to said at least one CFG; wherein each said feature is weighted during training of a corresponding natural language understanding (NLU) system, each weight being determined based on a maximum entropy model; wherein said at least one CFG is applied to said text input in said text buffer to determine substrings and corresponding parse trees; wherein said features are further applied to said substrings to determine said particular grammatical phrase. - View Dependent Claims (19)
-
-
20. A machine readable storage, having stored thereon a computer program having a plurality of code sections executable by a machine for causing the machine to perform the steps of:
-
receiving a text input; applying a first context free grammar (CFG) to said text input to determine substrings and corresponding parse trees, wherein said substrings and said corresponding parse trees further correspond to said first CFG; and
,examining each said possible substring and corresponding parse tree using an inventory of ordered grammar-related queries corresponding to said CFG, wherein said queries are ordered during training of the NLU system to form a decision tree, the ordering of the queries being a statistical ordering based on a minimization of conditional entropy or maximization of a likelihood that a resulting model predicts data used to train a corresponding NLU system. - View Dependent Claims (21, 22, 23)
-
-
24. In a natural language understanding (NLU) system, a method for including grammars in a statistical parser comprising:
-
receiving a text input; and
,applying a first context free grammar (CFG) to said text input to determine substrings and corresponding parse trees, wherein said substrings and said corresponding parse trees further correspond to said first CFG; and
,examining each said possible substring and corresponding parse tree using one or more weighted grammar-related features corresponding to said CFG, wherein each said feature is weighted during training of said NLU system, each weight being determined based on a maximum entropy model. - View Dependent Claims (25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
-
39. A machine readable storage, having stored thereon a computer program having a plurality of code sections executable by a machine for causing the machine to perform the steps of:
-
receiving a text input; applying a first context free grammar (CFG) to said text input to determine substrings and corresponding parse trees, wherein said substrings and said corresponding parse trees further correspond to said first CFG; and
,examining each said possible substring and corresponding parse tree using one or more weighted grammar-related features corresponding to said CFG, wherein each said feature is weighted during training of a corresponding natural language understanding (NLU) system, each weight being determined based on a maximum entropy model. - View Dependent Claims (40, 41, 42)
-
-
43. In a natural language understanding (NLU) system, a direct channel method for determining a meaning for a text input comprising:
-
selectably applying a reusable context free grammar (CFG) to a text input; identifying one or more substrings within said text input, each said substring corresponding to said reusable CFG; examining each said possible substring and corresponding parse tree using an inventory of ordered grammar-related queries and a set of features corresponding to said CFG, wherein said queries are ordered during training of the NLU system to form a decision tree, the ordering of the queries being a statistical ordering based on a minimization of conditional entropy or maximization of a likelihood that a resulting model predicts data used to train the NLU system; and
,determining a meaning for said text input based upon said identified substrings from possible meanings within said reusable CFG. - View Dependent Claims (44, 45, 46, 47)
-
-
48. A machine readable storage, having stored thereon a computer program having a plurality of code sections executable by a machine for causing the machine to perform the steps of:
-
selectably applying a reusable context free grammar (CFG) to a text input; identifying one or more substrings within said text input, said substrings corresponding to said reusable CFG; examining each said substring using at least one of an inventory of grammar-related queries and a set of features corresponding to said CFG, wherein said queries are ordered during training of a corresponding natural language understanding (NLU) system to form a decision tree, the ordering of the queries being a statistical ordering based on a minimization of conditional entropy or maximization of a likelihood that a resulting model predicts data used to train the NLU system; and
,determining a meaning for said text input based upon said identified substrings from possible meanings within said reusable CFG. - View Dependent Claims (49, 50, 51)
-
Specification