Method and system for performing information extraction and quality control for a knowledgebase
0 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates to the field of information extraction and storage and more specifically to techniques for extracting information from a plurality of articles in a distributed manner and for storing the extracted information in an information store. an embodiment of the present invention identifies a plurality of articles from which information is to be extracted and a plurality of information extractors for extracting the information from the articles. A database is provided for storing information related to the plurality of articles and the plurality of information extractors. The plurality of articles are assigned to the plurality of information extractors for information extraction. Information extracted by information extractors from the articles is stored in the information store.
-
Citations
36 Claims
-
1-3. -3. (Canceled).
-
4. A method for constructing a knowledge representation, the method comprising the steps of:
-
selecting articles to serve as an information source for the knowledge representation;
extracting and formatting information contained in the articles for storage in the knowledge representation including representing a fact expressed in an article'"'"'s natural language as at least an object and process relationship;
verifying that the information extracted from the selected articles is correct and that it has been placed in the correct format; and
storing the formatted information in the knowledge representation. - View Dependent Claims (5, 6, 7, 8, 9, 10, 11)
-
-
12. A system for extracting information from articles originating from a first database and storing the extracted information in a second database, the system comprising:
-
an information extraction unit which extracts a finding from an article'"'"'s natural language and translates this finding into a structured finding comprising at least an object, process, and a relationship between the object and process;
a database management unit in communication with the information extraction unit for determining if the structured finding has been properly formatted for storage in the second database;
an information storage unit in communication with the second database for storing the structured finding in the second database. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36)
-
Specification