System and method for data extraction and management in multi-relational ontology creation
First Claim
Patent Images
1. A computer-implemented system for extracting data from one or more data sources for the creation of one or more multi-relational ontologies, comprising:
- an upper ontology that specifies, for a specific domain, a set of concept types and relationship types, a hierarchy of concept types and relationship types, a set of specific pairs of concept types, and a set of permissible relationship types that may be used to connect specific pair of concept types;
a plurality of data sources;
means for selecting a corpus of documents from the plurality of data sources, at least one of the documents being related to the specific domain;
a set of rules relating to the creation of assertions, wherein assertions comprise a first concept, a second concept, and a relationship between the first concept and the second concept;
an extraction module for;
(i) extracting from the corpus of documents, in accordance with the rules, concepts and relationships between concepts to form rules-based assertions; and
(ii) associating evidence information with each of the rules-based assertions; and
means for storing the rules-based assertions and evidence information in one or more databases.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention relates to a system and method for data extraction and management in multi-relational ontology creation. The system of the invention includes selecting a corpus of documents containing information relevant to a targeted knowledge domain, extracting assertions and their constituent concepts and relationships from the corpus, and storing the assertions, wherein the extraction processes may rules and utilize natural language processing.
197 Citations
26 Claims
-
1. A computer-implemented system for extracting data from one or more data sources for the creation of one or more multi-relational ontologies, comprising:
-
an upper ontology that specifies, for a specific domain, a set of concept types and relationship types, a hierarchy of concept types and relationship types, a set of specific pairs of concept types, and a set of permissible relationship types that may be used to connect specific pair of concept types;
a plurality of data sources;
means for selecting a corpus of documents from the plurality of data sources, at least one of the documents being related to the specific domain;
a set of rules relating to the creation of assertions, wherein assertions comprise a first concept, a second concept, and a relationship between the first concept and the second concept;
an extraction module for;
(i) extracting from the corpus of documents, in accordance with the rules, concepts and relationships between concepts to form rules-based assertions; and
(ii) associating evidence information with each of the rules-based assertions; and
means for storing the rules-based assertions and evidence information in one or more databases. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A computer-implemented method for extracting data from one or more data sources for the creation of one or more multi-relational ontologies, comprising:
-
providing an upper ontology that specifies, for a specific domain, a set of concept types and relationship types, a hierarchy of concept types and relationship types, a set of specific pairs of concept types, and a set of permissible relationship types that may be used to connect specific pair of concept types;
providing a plurality of data sources;
selecting a corpus of documents from the plurality of data sources, at least one of the documents being related to the specific domain;
providing a set of rules relating to the creation of assertions, wherein assertions comprise a first concept, a second concept, and a relationship between the first concept and the second concept;
extracting from the corpus of documents, in accordance with one or more of the rules from the set of rules, concepts and relationships between concepts to form rules-based assertions;
associating evidence information with each of the rules-based assertions; and
storing the rules-based assertions and evidence information in one or more databases.
-
Specification