Method and system for discovering knowledge from text documents
First Claim
1. A method for discovering knowledge from text documents, the method comprising the steps of:
- extracting from text documents semi-structured meta-data, wherein the semi-structured meta-data includes a plurality of entities and a plurality of relations between the entities;
identifying from the semi-structured meta-data a plurality of key entities and a corresponding plurality of key relations;
deriving from a domain knowledge base a plurality of attributes relating to each of the plurality of entities relating to one of the plurality of key entities for forming a plurality of pairs of key entity and a plurality of attributes related thereto;
formulating a plurality of patterns, each of the plurality of patterns relating to one of the plurality of pairs of key entity and a plurality of attributes related thereto;
analyzing the plurality of patterns using an associative discoverer; and
interpreting the output of the associative discoverer for discovering knowledge.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and a system for discovering knowledge from text documents are disclosed, which involve extracting from text documents semi-structured meta-data, wherein the semi-structured meta-data includes a plurality of entities and a plurality of relations between the entities; identifying from the semi-structured meta-data a plurality of key entities and a corresponding plurality of key relations; deriving from a domain knowledge base a plurality of attributes relating to each of the plurality of entities relating to one of the plurality of key entities for forming a plurality of pairs of key entity and a plurality of attributes related thereto; formulating a plurality of patterns, each of the plurality of patterns relating to one of the plurality of pairs of key entity and a plurality of attributes related thereto; analyzing the plurality of patterns using an associative discoverer; and interpreting the output of the associative discoverer for discovering knowledge.
-
Citations
38 Claims
-
1. A method for discovering knowledge from text documents, the method comprising the steps of:
-
extracting from text documents semi-structured meta-data, wherein the semi-structured meta-data includes a plurality of entities and a plurality of relations between the entities;
identifying from the semi-structured meta-data a plurality of key entities and a corresponding plurality of key relations;
deriving from a domain knowledge base a plurality of attributes relating to each of the plurality of entities relating to one of the plurality of key entities for forming a plurality of pairs of key entity and a plurality of attributes related thereto;
formulating a plurality of patterns, each of the plurality of patterns relating to one of the plurality of pairs of key entity and a plurality of attributes related thereto;
analyzing the plurality of patterns using an associative discoverer; and
interpreting the output of the associative discoverer for discovering knowledge. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computer program product comprising a computer usable medium having computer readable program code means embodied in the medium for discovering knowledge from text documents, the computer program product comprising:
-
computer readable program code means for extracting from text documents semi-structured meta-data, wherein the semi-structured meta-data includes a plurality of entities and a plurality of relations between the entities;
computer readable program code means for identifying from the semi-structured meta-data a plurality of key entities and a corresponding plurality of key relations;
computer readable program code means for deriving from a domain knowledge base a plurality of attributes relating to each of the plurality of entities relating to one of the plurality of key entities for forming a plurality of pairs of key entity and a plurality of attributes related thereto;
computer readable program code means for formulating a plurality of patterns, each of the plurality of patterns relating to one of the plurality of pairs of key entity and a plurality of attributes related thereto;
computer readable program code means for analyzing the plurality of patterns using an associative discoverer; and
computer readable program code means for interpreting the output of the associative discoverer for discovering knowledge. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A system for knowledge discovery from free-text documents, comprising:
-
means for extracting semi-structured meta-data from the free-text documents;
means for identifying key entities and key relations from the semi-structured meta-data;
a knowledge base that defines the attributes of entities;
means for formulating patterns based on the key entities and the attributes of entities related to the key entities; and
means for analyzing the patterns for knowledge. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
Specification