TECHNIQUES FOR COMBINING HUMAN AND MACHINE LEARNING IN NATURAL LANGUAGE PROCESSING
First Claim
1. A method for generating a natural language model, the method comprising:
- receiving more than one annotation of a document;
calculating a level of agreement among the received annotations;
determining that a criterion among a first criterion, a second criterion, and a third criterion is satisfied based at least in part on the level of agreement;
determining an aggregated annotation representing an aggregation of information in the received annotations and training a natural language model using the aggregated annotation, when the first criterion is satisfied;
generating at least one human readable prompt configured to receive additional annotations of the document, when the second criterion is satisfied; and
discarding the received annotations from use in training the natural language model, when the third criterion is satisfied.
10 Assignments
0 Petitions
Accused Products
Abstract
Methods, apparatuses and computer readable medium are presented for generating a natural language model. A method for generating a natural language model comprises: receiving more than one annotation of a document; calculating a level of agreement among the received annotations; determining that a criterion among a first criterion, a second criterion, and a third criterion is satisfied based at least in part on the level of agreement; determining an aggregated annotation representing an aggregation of information in the received annotations and training a natural language model using the aggregated annotation, when the first criterion is satisfied; generating at least one human readable prompt configured to receive additional annotations of the document, when the second criterion is satisfied; and discarding the received annotations from use in training the natural language model, when the third criterion is satisfied.
5 Citations
20 Claims
-
1. A method for generating a natural language model, the method comprising:
-
receiving more than one annotation of a document; calculating a level of agreement among the received annotations; determining that a criterion among a first criterion, a second criterion, and a third criterion is satisfied based at least in part on the level of agreement; determining an aggregated annotation representing an aggregation of information in the received annotations and training a natural language model using the aggregated annotation, when the first criterion is satisfied; generating at least one human readable prompt configured to receive additional annotations of the document, when the second criterion is satisfied; and discarding the received annotations from use in training the natural language model, when the third criterion is satisfied. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. An apparatus for generating a natural language model, the apparatus comprising one or more processors configured to:
-
receive more than one annotation of a document; calculate a level of agreement among the received annotations; determine that a criterion among a first criterion, a second criterion, and a third criterion is satisfied based at least in part on the level of agreement; determine an aggregated annotation representing an aggregation of information in the received annotations and train a natural language model using the aggregated annotation, when the first criterion is satisfied; generate at least one human readable prompt configured to receive additional annotations of the document, when a second criterion is satisfied; and discard the received annotations from use in training the natural language model, when the third criterion is satisfied. - View Dependent Claims (17, 18, 19)
the level of agreement comprises a difference between the highest numerical value and the lowest numerical value among the selected categories; the first criterion is satisfied when the difference is no more than a threshold value; and the third criterion is satisfied when the difference is more than a threshold value.
-
-
20. A non-transitory computer readable medium comprising instructions that, when executed by a processor, cause the processor to:
-
receive more than one annotation of a document; calculate a level of agreement among the received annotations; determine that a criterion among a first criterion, a second criterion, and a third criterion is satisfied based at least in part on the level of agreement; determine an aggregated annotation representing an aggregation of information in the received annotations and train a natural language model using the aggregated annotation, when the first criterion is satisfied; generate at least one human readable prompt configured to receive additional annotations of the document, when a second criterion is satisfied; and discard the received annotations from use in training the natural language model, when the third criterion is satisfied.
-
Specification