METHOD AND SYSTEM FOR EXTRACTING AND VISUALIZING GRAPH-STRUCTURED RELATIONS FROM UNSTRUCTURED TEXT
First Claim
1. A method for automatically extracting and mining relations and related entities from unstructured text, comprising:
- receiving a query specifying a main entity; and
extracting from unstructured text relations and related entities related to the main entity specified in the query, the extracting further comprising;
searching and selecting in the unstructured text, documents containing the main entity;
attaching to each word of the selected documents, at least one tag, each tag being of a different type;
extracting relations and related entities by applying patterns to the tagged documents;
extracting from the selected documents features characterizing each entity and relation; and
building a graph based on the extracted features, whose nodes represent the entities related to the specified main entity and whose edges represent the relations between the entities.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention is directed to a system, method and computer program for automatically extracting and mining relations and related entities from unstructured text. A method in accordance with an embodiment of the invention includes: extracting relations and related entities from unstructured text data, representing the extracted information into a graph, and manipulating the resulting graph to gain more insight into the information it contains. The extraction of relations and related entities is performed first by automatically inducting pattern and second by applying these induced patterns to unstructured text data. For each relation and entity, several features are extracted in order to build a graph whose nodes are entities and edges are relations.
204 Citations
19 Claims
-
1. A method for automatically extracting and mining relations and related entities from unstructured text, comprising:
-
receiving a query specifying a main entity; and
extracting from unstructured text relations and related entities related to the main entity specified in the query, the extracting further comprising;
searching and selecting in the unstructured text, documents containing the main entity;
attaching to each word of the selected documents, at least one tag, each tag being of a different type;
extracting relations and related entities by applying patterns to the tagged documents;
extracting from the selected documents features characterizing each entity and relation; and
building a graph based on the extracted features, whose nodes represent the entities related to the specified main entity and whose edges represent the relations between the entities. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A system for automatically extracting and mining relations and related entities from unstructured text, comprising:
-
a system for receiving a query specifying a main entity; and
a system for extracting from unstructured text relations and related entities related to the main entity specified in the query, the system for extracting further comprising;
a system for searching and selecting in the unstructured text, documents containing the main entity;
a system for attaching to each word of the selected documents, at least one tag, each tag being of a different type;
a system for extracting relations and related entities by applying patterns to the tagged documents;
a system for extracting from the selected documents features characterizing each entity and relation; and
a system for building a graph based on the extracted features, whose nodes represent the entities related to the specified main entity and whose edges represent the relations between the entities.
-
-
19. A computer program stored on a computer readable medium for automatically extracting and mining relations and related entities from unstructured text, when the computer program is executed on a computer, the computer program comprising program code for:
-
receiving a query specifying a main entity; and
extracting from unstructured text relations and related entities related to the main entity specified in the query, the extracting further comprising;
searching and selecting in the unstructured text, documents containing the main entity;
attaching to each word of the selected documents, at least one tag, each tag being of a different type;
extracting relations and related entities by applying patterns to the tagged documents;
extracting from the selected documents features characterizing each entity and relation; and
building a graph based on the extracted features, whose nodes represent the entities related to the specified main entity and whose edges represent the relations between the entities.
-
Specification