System and method for automating the generation of an ontology from unstructured documents
First Claim
Patent Images
1. A domain independent method of creating an ontology comprising:
- providing a corpus of documents;
extracting phrases from the documents;
extracting core noun phrases from the documents;
extracting links from the documents; and
generating an ontology in accordance with at least the extracted phrases.
3 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for the substantially automatic creation of ontologies from unstructured documents identify phrases and core noun phrases from the respective documents. Links can be extracted from the documents. Concepts can be identified from the documents. Ontologies can be automatically created for the documents. The processing is domain independent.
49 Citations
28 Claims
-
1. A domain independent method of creating an ontology comprising:
-
providing a corpus of documents; extracting phrases from the documents; extracting core noun phrases from the documents; extracting links from the documents; and generating an ontology in accordance with at least the extracted phrases. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. An apparatus comprising:
-
a document file; first software, executable by a processor that analyzes documents in the file and forms an extracted phrases file; second software, executable by a processor that analyzes documents in the file and forms a core noun phrases file; third software, executable by a processor, that analyzes documents in the file and forms a link file; and fourth software, executable by a processor, that forms an ontology in accordance with selected phrases in the extracted phrases file. - View Dependent Claims (22, 23, 24)
-
-
25. An ontology generating system comprising:
-
first software, recorded on a computer readable medium, that extracts and stores phrases from at least one text source; second software, recorded on a computer readable medium, that extracts and stores core noun phrases from the extracted phrases; third software, recorded on a computer readable medium, that extracts and stores links from the extracted phrases; and fourth software, recorded on a computer readable medium, that generates an ontology for the at least one text source. - View Dependent Claims (26, 27, 28)
-
Specification