Anaphora resolution for medical text with machine learning and relevance feedback
First Claim
1. A method for directing a computer processor to resolve an anaphor in electronic natural language text, comprising:
- detecting a plurality of entities and an anaphor in a span of parsed natural language text comprising one or more sentences;
extracting pairs of related entities among the plurality of entities, based on domain knowledge;
constructing a set of tuples, wherein each tuple is a data type comprising an anaphor, an antecedent entity (AE) appearing before the anaphor in the span of parsed natural language text, and an entity (E) appearing after the anaphor in the span of parsed natural language text, wherein the anaphor refers to the AE and relates the AE to the E, and wherein the constructing is based on initial training data and the extracted pairs of related entities; and
resolving the anaphor by determining which entity in the plurality of entities the anaphor references, using the constructed set of tuples, and selecting an AE among one or more candidate AEs.
2 Assignments
0 Petitions
Accused Products
Abstract
The program directs a computer processor to resolve an anaphor in electronic natural language text. The program detects a plurality of entities and an anaphor in a span of parsed natural language text comprising one or more sentences, and extracts pairs of related entities based on domain knowledge. The program constructs a set of tuples, wherein each tuple is a data type comprising an anaphor, an antecedent entity (AE) appearing before the anaphor in the span of parsed natural language text, and an entity (E) appearing after the anaphor in the span of parsed natural language text, wherein the anaphor refers to the AE and relates the AE to the E. The program resolves the anaphor by determining which entity in the plurality of entities the anaphor references, using the constructed set of tuples, and selecting an AE among one or more candidate AEs.
-
Citations
20 Claims
-
1. A method for directing a computer processor to resolve an anaphor in electronic natural language text, comprising:
-
detecting a plurality of entities and an anaphor in a span of parsed natural language text comprising one or more sentences; extracting pairs of related entities among the plurality of entities, based on domain knowledge; constructing a set of tuples, wherein each tuple is a data type comprising an anaphor, an antecedent entity (AE) appearing before the anaphor in the span of parsed natural language text, and an entity (E) appearing after the anaphor in the span of parsed natural language text, wherein the anaphor refers to the AE and relates the AE to the E, and wherein the constructing is based on initial training data and the extracted pairs of related entities; and resolving the anaphor by determining which entity in the plurality of entities the anaphor references, using the constructed set of tuples, and selecting an AE among one or more candidate AEs. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer program product for directing a computer processor to resolve an anaphor in electronic natural language text, comprising a non-transitory tangible storage device having program code embodied therewith, the program code executable by a processor of a computer to perform a method, the method comprising:
-
detecting, by the processor, a plurality of entities and an anaphor in a span of parsed natural language text comprising one or more sentences; extracting, by the processor, pairs of related entities among the plurality of entities, based on domain knowledge; constructing, by the processor, a set of tuples, wherein each tuple is a data type comprising an anaphor, an antecedent entity (AE) appearing before the anaphor in the span of parsed natural language text, and an entity (E) appearing after the anaphor in the span of parsed natural language text, wherein the anaphor refers to the AE and relates the AE to the E, and wherein the constructing is based on initial training data and the extracted pairs of related entities; and resolving, by the processor, the anaphor by determining which entity in the plurality of entities the anaphor references, using the constructed set of tuples, and selecting an AE among one or more candidate AEs. - View Dependent Claims (13, 14, 15, 16)
-
-
17. A computer system for performing electronic natural language processing on unstructured data, comprising:
-
one or more computer devices each having one or more processors and one or more tangible storage devices; and a program embodied on at least one of the one or more storage devices, the program having a plurality of program instructions for execution by the one or more processors, the program instructions comprising instructions for; detecting, by the computer, a plurality of entities and an anaphor in a span of parsed natural language text comprising one or more sentences; extracting, by the computer, pairs of related entities among the plurality of entities, based on domain knowledge; constructing, by the computer, a set of tuples, wherein each tuple is a data type comprising an anaphor, an antecedent entity (AE) appearing before the anaphor in the span of parsed natural language text, and an entity (E) appearing after the anaphor in the span of parsed natural language text, wherein the anaphor refers to the AE and relates the AE to the E, and wherein the constructing is based on initial training data and the extracted pairs of related entities; and resolving, by the computer, the anaphor by determining which entity in the plurality of entities the anaphor references, using the constructed set of tuples, and selecting an AE among one or more candidate AEs. - View Dependent Claims (18, 19, 20)
-
Specification