SEMANTICALLY-DRIVEN EXTRACTION OF RELATIONS BETWEEN NAMED ENTITIES
First Claim
Patent Images
1. A method of developing rules for text processing comprising:
- identifying a semantic relation involving at least two named entities, each named entity being represented in the semantic relation by its class;
providing a set of example instances of combinations of named entities which satisfy the semantic relation;
retrieving text strings from text, each of the retrieved strings including at least two of the named entities of one of the example instances in the set;
extracting, from the retrieved text strings, syntactic patterns, each syntactic pattern involving at least two of the named entities of one of the example instances in the respective retrieved text string; and
generating generalized rules for text processing based on the extracted syntactic patterns, the rules being of a form for identifying candidate instances of the semantic relation in text.
6 Assignments
0 Petitions
Accused Products
Abstract
A system and method of developing rules for text processing enable retrieval of instances of named entities in a predetermined semantic relation (such as the DATE and PLACE of an EVENT) by extracting patterns from text strings in which attested examples of named entities satisfying the semantic relation occur. The patterns are generalized to form rules which can be added to the existing rules of a syntactic parser and subsequently applied to text to find candidate instances of other named entities in the predetermined semantic relation.
110 Citations
20 Claims
-
1. A method of developing rules for text processing comprising:
-
identifying a semantic relation involving at least two named entities, each named entity being represented in the semantic relation by its class; providing a set of example instances of combinations of named entities which satisfy the semantic relation; retrieving text strings from text, each of the retrieved strings including at least two of the named entities of one of the example instances in the set; extracting, from the retrieved text strings, syntactic patterns, each syntactic pattern involving at least two of the named entities of one of the example instances in the respective retrieved text string; and generating generalized rules for text processing based on the extracted syntactic patterns, the rules being of a form for identifying candidate instances of the semantic relation in text. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 19)
-
-
18. A system for developing rules for text processing comprising:
-
memory for storing a set of example instances of combinations of named entities which satisfy a semantic relation involving at least two named entities, wherein each named entity is represented in the semantic relation by its class; a text retriever for retrieving text strings from text, each of the retrieved strings including at least two of the named entities of one of the example instances in the set; a pattern extractor which extracts, from the retrieved text strings, syntactic patterns, each syntactic pattern involving at least two of the named entities of one of the example instances in the respective retrieved text string; and a rules generator which generates generalized rules for text processing based on the extracted syntactic patterns.
-
-
20. A method of identifying candidate instances of at least one of a date of an event and a place of an event, comprising:
-
identifying attested instances of combinations of named entities which are the date and place of an event; retrieving text strings from text, each of the retrieved strings including at least two of the named entities of one of the attested instances; extracting, from the retrieved text strings, syntactic patterns, each syntactic pattern involving at least two of the named entities of one of the attested instances in the respective retrieved text string; generating generalized rules for text processing based on the extracted syntactic patterns; and applying the rules to text to identifying candidate instances of at least one of the date and the place of an event.
-
Specification