DETECTING RELATIONSHIPS IN UNSTRUCTURED TEXT
First Claim
1. A computer implemented method of detecting a relationship between a first entity and a second entity, said method comprising:
- creating a text pattern that represents a type of relationship, wherein said text pattern comprises a first slot for said first entity and a second slot for said second entity;
analyzing a text-based document so as to locate said text pattern within said document;
determining a location for each proper name occurring within said document; and
extracting proper names located within said first slot and said second slot of said text pattern within said document, wherein said proper names located within said first slot and said second slot identify said first entity and said second entity.
0 Assignments
0 Petitions
Accused Products
Abstract
Disclosed are embodiments of a system and a method for detecting relationships described in unstructured text-based electronic documents. The system and method incorporate the use of an input file that contains one or more text patterns that represent particular relationships. The text patterns each include regular text expressions that describe the particular relationship and slots for the location of each entity in that relationship. Document(s) are selected by a user and scanned by a proper noun tagger that identifies and tags every occurrence of proper names within the document(s). Then, a pattern matcher scans the document(s) to match text patterns. If a text pattern is matched within a document a relationship detector extracts all pairs of proper names found in the slots for each matched text pattern. The output from the relationship detector includes the names for each entity in the relationship, the type of relationship, and the identity of the document and the location of the sentence describing the relationship in the document.
-
Citations
20 Claims
-
1. A computer implemented method of detecting a relationship between a first entity and a second entity, said method comprising:
-
creating a text pattern that represents a type of relationship, wherein said text pattern comprises a first slot for said first entity and a second slot for said second entity; analyzing a text-based document so as to locate said text pattern within said document; determining a location for each proper name occurring within said document; and extracting proper names located within said first slot and said second slot of said text pattern within said document, wherein said proper names located within said first slot and said second slot identify said first entity and said second entity. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for detecting a relationship between a first entity and a second entity, said system comprising:
-
an input file adapted to store a text pattern that describes a type of relationship, wherein said text pattern comprises a first slot for said first entity and a second slot for said second entity; a pattern matcher in communication with said input file and adapted to analyze a text-based document so as to locate said text pattern within said document; a proper noun tagger adapted to locate and record occurrences of proper names within said document; and a relationship detector in communication with said pattern matcher and said proper noun tagger and adapted to extract said proper names located within said first slot and said second slot of said text pattern within said document so as to identify said first entity and said second entity and, thereby, detect said relationship. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A program storage device readable by computer and tangibly embodying a program of instructions executable by said computer to perform a method of detecting a relationship between a first entity and a second entity, said method comprising:
-
creating a text pattern that represents a type of relationship, wherein said text pattern comprises a first slot for said first entity and a second slot for said second entity; analyzing a text-based document so as to locate said text pattern within said document; determining a location for each proper name occurring within said document; and extracting proper names located within said first slot and said second slot of said text pattern within said document, wherein said proper names located within said first slot and said second slot identify said first entity and said second entity - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification