Method and system for performing information extraction and quality control for a knowledgebase
First Claim
1. A system for extracting information from articles and for storing the extracted information in a knowledge representation, the system comprising:
- an article selection unit, for selecting and prioritizing articles from which information will be extracted;
an information extraction unit coupled to and in communication with the article selection unit, which receives one or more selected articles from the article selection unit and extracts information from the selected article according to pre-defined information extraction protocols, wherein the extracted information includes a fact represented by a relationship between at least an object and a process, and wherein the fact is derived from a natural language representation of the fact in the selected article;
a knowledge representation management unit, coupled to and in communication with the information extraction unit for determining if the extracted information has been both properly extracted and formatted for storage in the knowledge representation;
an information storage unit coupled to and in communication with the knowledge representation management unit for storing the information in the representation if it has been properly extracted and formatted and for responding to inquiries regarding the stored representation; and
a query management and information display unit, coupled to and in communication with the information storage unit for responding to user inquiries for information stored in the information storage unit, for retrieving information from the information storage unit in response to the queries and for displaying the retrieved information.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method of information extraction and storage extracts information from a plurality of articles in a distributed manner and stores the extracted information in an information store. In one embodiment, the system includes identification of a plurality of articles from which information is to be extracted and a plurality of information extractors for extracting the information from the articles. A database is provided for storing information related to the plurality of articles and the plurality of information extractors. The plurality of articles are assigned to the plurality of information extractors for information extraction and the extracted information is then stored in the information store.
-
Citations
11 Claims
-
1. A system for extracting information from articles and for storing the extracted information in a knowledge representation, the system comprising:
-
an article selection unit, for selecting and prioritizing articles from which information will be extracted;
an information extraction unit coupled to and in communication with the article selection unit, which receives one or more selected articles from the article selection unit and extracts information from the selected article according to pre-defined information extraction protocols, wherein the extracted information includes a fact represented by a relationship between at least an object and a process, and wherein the fact is derived from a natural language representation of the fact in the selected article;
a knowledge representation management unit, coupled to and in communication with the information extraction unit for determining if the extracted information has been both properly extracted and formatted for storage in the knowledge representation;
an information storage unit coupled to and in communication with the knowledge representation management unit for storing the information in the representation if it has been properly extracted and formatted and for responding to inquiries regarding the stored representation; and
a query management and information display unit, coupled to and in communication with the information storage unit for responding to user inquiries for information stored in the information storage unit, for retrieving information from the information storage unit in response to the queries and for displaying the retrieved information. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
Specification