Learning parsing rules and argument identification from crowdsourcing of proposed command inputs
First Claim
Patent Images
1. A computer-implemented method performed by a data processing apparatus, comprising:
- providing, by the data processing apparatus to a plurality of user devices, a user interface that includes;
a command sentence selected from a log of command sentences;
the first set of non-terminal fields, each non-terminal field in the first set listing the non-terminal type and the terminal that parses to the non-terminal type;
a task definition describing a user task to classify each of a plurality of n-grams of the command sentence as belonging to one of;
the non-terminal types in the set of non-terminal types;
ornone of the non-terminal types in the set of non-terminal types;
receiving, by the data processing apparatus and from the plurality of user devices, user task response data classifying the n-grams of the command sentence, wherein for each non-terminal type at least a respective first set of n-grams are classified as belonging to the non-terminal type and at least a second set of n-grams are classified as belonging to none of the non-terminal types;
generating, by the data processing apparatus, command grammars for the action from the user task response data, each of the command grammars defining non-terminals of each of the non-terminal types and at least one terminal defining at least one of the second set of n-grams; and
persisting the command grammars to a command model.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems, methods and apparatus for learning parsing rules and argument identification from crowdsourcing of proposed command inputs are disclosed. Crowdsourcing techniques are used to generate rules for parsing input sentences. A parse is used to determine whether the input sentence invokes a specific action, and if so, what arguments are to be passed to the invocation of the action.
-
Citations
15 Claims
-
1. A computer-implemented method performed by a data processing apparatus, comprising:
-
providing, by the data processing apparatus to a plurality of user devices, a user interface that includes; a command sentence selected from a log of command sentences; the first set of non-terminal fields, each non-terminal field in the first set listing the non-terminal type and the terminal that parses to the non-terminal type; a task definition describing a user task to classify each of a plurality of n-grams of the command sentence as belonging to one of; the non-terminal types in the set of non-terminal types;
ornone of the non-terminal types in the set of non-terminal types; receiving, by the data processing apparatus and from the plurality of user devices, user task response data classifying the n-grams of the command sentence, wherein for each non-terminal type at least a respective first set of n-grams are classified as belonging to the non-terminal type and at least a second set of n-grams are classified as belonging to none of the non-terminal types; generating, by the data processing apparatus, command grammars for the action from the user task response data, each of the command grammars defining non-terminals of each of the non-terminal types and at least one terminal defining at least one of the second set of n-grams; and persisting the command grammars to a command model. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A non-transitory computer readable storage medium storing instructions executable by a data processing apparatus and that upon such execution causes the data processing apparatus to perform operations comprising:
-
providing, by the data processing apparatus to a plurality of user devices, a user interface that includes; a command sentence selected from a log of command sentences; the first set of non-terminal fields, each non-terminal field in the first set listing the non-terminal type and the terminal that parses to the non-terminal type; a task definition describing a user task to classify each of a plurality of n-grams of the command sentence as belonging to one of; the non-terminal types in the set of non-terminal types;
ornone of the non-terminal types in the set of non-terminal types; receiving, by the data processing apparatus and from the plurality of user devices, user task response data classifying the n-grams of the command sentence, wherein for each non-terminal type at least a respective first set of n-grams are classified as belonging to the non-terminal type and at least a second set of n-grams are classified as belonging to none of the non-terminal types; generating, by the data processing apparatus, command grammars for the action from the user task response data, each of the command grammars defining non-terminals of each of the non-terminal types and at least one terminal defining at least one of the second set of n-grams; and persisting the command grammars to a command model. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A system, comprising:
-
a data processing apparatus; and a non-transitory computer readable storage medium in data communication with the data processing apparatus storing instructions executable by the data processing apparatus and that upon such execution causes the data processing apparatus to perform operations comprising;
providing, by the data processing apparatus to a plurality of user devices, a user interface that includes;a command sentence selected from a log of command sentences; the first set of non-terminal fields, each non-terminal field in the first set listing the non-terminal type and the terminal that parses to the non-terminal type; a task definition describing a user task to classify each of a plurality of n-grams of the command sentence as belonging to one of; the non-terminal types in the set of non-terminal types;
ornone of the non-terminal types in the set of non-terminal types; receiving, by the data processing apparatus and from the plurality of user devices, user task response data classifying the n-grams of the command sentence, wherein for each non-terminal type at least a respective first set of n-grams are classified as belonging to the non-terminal type and at least a second set of n-grams are classified as belonging to none of the non-terminal types; generating, by the data processing apparatus, command grammars for the action from the user task response data, each of the command grammars defining non-terminals of each of the non-terminal types and at least one terminal defining at least one of the second set of n-grams; and persisting the command grammars to a command model. - View Dependent Claims (12, 13, 14, 15)
-
Specification