SYSTEM AND METHODS FOR GENERATING TREEBANKS FOR NATURAL LANGUAGE PROCESSING BY MODIFYING PARSER OPERATION THROUGH INTRODUCTION OF CONSTRAINTS ON PARSE TREE STRUCTURE
First Claim
Patent Images
1. A method for modifying the operation of a parser, comprising:
- receiving data representing an input sentence;
generating a display of a structure representing the input sentence based on a specific parsing process;
receiving one or more inputs representing changes to the displayed structure;
generating a corrected structure representing the input sentence based on the specific parsing process as modified by the received inputs; and
training a parser to reliably learn a parsing process based on the specific parsing process as modified by the one or more received inputs.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems, apparatuses, and methods for generating a parser training set and ultimately a correct treebank for a corpus of text, based on using an existing parser that was trained on a different corpus. Also disclosed are systems, apparatuses, and methods for improving the operation of a parser in the case of using a less familiar set of training data than is typically used to train conventional parsers. This can be used to generate a more effective and accurate parser for a new corpus (and hence more accurate parse trees) with significantly less effort than would be required if it was necessary to generate a standard size training set.
22 Citations
19 Claims
-
1. A method for modifying the operation of a parser, comprising:
-
receiving data representing an input sentence; generating a display of a structure representing the input sentence based on a specific parsing process; receiving one or more inputs representing changes to the displayed structure; generating a corrected structure representing the input sentence based on the specific parsing process as modified by the received inputs; and training a parser to reliably learn a parsing process based on the specific parsing process as modified by the one or more received inputs. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An apparatus, comprising:
-
an electronic data processing element; a set of instructions stored on a non-transient medium and executable by the electronic data processing element, which when executed cause the apparatus to receive data representing an input sentence; generate a display of a structure representing the input sentence based on a specific parsing process; receive one or more inputs representing changes to the displayed structure; generate a corrected structure representing the input sentence based on the specific parsing process as modified by the received inputs; and train a parser to reliably learn a parsing process based on the specific parsing process as modified by the one or more received inputs. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A system comprising:
-
a data storage element containing data representing one or more sentences or strings of characters; an electronic data processing element; a set of instructions stored on a non-transient medium and executable by the electronic data processing element, which when executed cause the system to generate a visual display of a structure representing the result of parsing one of the sentences or strings of characters using a first parsing process; receive one or more inputs representing changes to the displayed structure; generate a visual display of a corrected structure, the corrected structure representing the result of parsing the sentence using the first parsing process as modified by the received inputs; and train a parser execute a second parsing process, the second parsing process being based on the first parsing process as modified by the one or more received inputs. - View Dependent Claims (18, 19)
-
Specification