SYSTEMS AND METHODS FOR AUTOMATIC SEMANTIC ROLE LABELING OF HIGH MORPHOLOGICAL TEXT FOR NATURAL LANGUAGE PROCESSING APPLICATIONS
First Claim
1. A method for processing natural language text, comprising:
- receiving as input a natural language text sentence comprising a sequence of white-space delimited words including inflicted words that are formed of morphemes including a stem and one or more affixes;
identifying a target verb as a stem of an inflicted word in the text sentence;
grouping morphemes from one or more inflicted words with the same syntactic role into constituents; and
predicting a semantic role of a constituent for the target verb.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods are provided for automated semantic role labeling for languages having complex morphology. In one aspect, a method for processing natural language text includes receiving as input a natural language text sentence comprising a sequence of white-space delimited words including inflicted words that are formed of morphemes including a stem and one or more affixes, identifying a target verb as a stem of an inflicted word in the text sentence, grouping morphemes from one or more inflicted words with the same syntactic role into constituents, and predicting a semantic role of a constituent for the target verb.
208 Citations
22 Claims
-
1. A method for processing natural language text, comprising:
-
receiving as input a natural language text sentence comprising a sequence of white-space delimited words including inflicted words that are formed of morphemes including a stem and one or more affixes; identifying a target verb as a stem of an inflicted word in the text sentence; grouping morphemes from one or more inflicted words with the same syntactic role into constituents; and predicting a semantic role of a constituent for the target verb. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A program storage device readable by machine, tangibly embodying a program of instructions executable by the machine to perform method steps for processing natural language text, the method steps comprising:
-
receiving as input a natural language text sentence comprising a sequence of white-space delimited words including inflicted words that are formed of morphemes including a stem and one or more affixes; identifying a target verb as a stem of an inflicted word in the text sentence; grouping morphemes from one or more inflicted words with the same syntactic role into constituents; and predicting a semantic role of a constituent for the target verb. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A method for processing natural language text, comprising:
-
receiving as input a natural language text sentence comprising a sequence of white-space delimited words including at least one inflicted word comprising a stem and one or more affixes; automatically segmenting the white-space delimited words into separate morphemes including prefixes, stems and suffixe; automatically grouping morphemes into constituents and identifying morphemes that are target verbs; and automatically predicting a semantic role of a constituent for a target verb using a trained statistical model.
-
-
22. A method for processing natural language text, comprising:
-
receiving as input a natural language text sentence comprising a sequence of white-space delimited words including at least one inflicted word comprising a stem and one or more affixes; automatically performing a morphological analysis on the text sentence as a sequence of characters to extract morphological information; automatically detecing stems of inflicted words that are target verbs and grouping stems and affixes of different words into constituent, using the extracted morphological information; and automatically predicting a semantic role of each constituent for a target verb using a trained statistical model using a plurality of feature data including morphological features extracted during morphological analysis.
-
Specification