Techniques for facilitating information acquisition and storage
First Claim
1. A computer-implemented method of storing information in an information store, the computer-implemented method comprising:
- identifying a plurality of articles from which information is to be extracted;
assigning each article a priority score and ranking the articles from highest to lowest priority, thereby generating a queue, wherein the priority score for each article is calculated using a user-configurable priority calculation algorithm;
identifying a plurality of information extractors for extracting information from the plurality of articles;
providing a database for storing information related to the plurality of articles and the plurality of information extractors;
assigning the plurality of articles to the plurality of information extractors for information extraction, wherein the articles are assigned based on order in the queue, wherein an article with a higher rank is presented for information extraction before an article with a lower rank;
receiving information extracted by a first information extractor from a first article; and
storing the information extracted by the first information extractor from the first article in the information store.
1 Assignment
0 Petitions
Accused Products
Abstract
A method, system, and computer program product are provided for extracting information from a plurality of articles in a distributed manner and for storing the extracted information in an information store. The invention identifies a plurality of articles from which information is to be extracted and a plurality of information extractors for extracting the information from the articles. Each article is assigned a priority score and ranking the articles from highest to lowest priority, thereby generating a queue; wherein the priority score for each article is calculated using a user-configurable priority calculation algorithm. The plurality of articles is assigned to the plurality of information extractors based on order in the queue, wherein an article with a higher rank is presented for information extraction before an article with a lower rank. Information extracted by information extractors from the articles is stored in the information store.
39 Citations
67 Claims
-
1. A computer-implemented method of storing information in an information store, the computer-implemented method comprising:
-
identifying a plurality of articles from which information is to be extracted; assigning each article a priority score and ranking the articles from highest to lowest priority, thereby generating a queue, wherein the priority score for each article is calculated using a user-configurable priority calculation algorithm; identifying a plurality of information extractors for extracting information from the plurality of articles; providing a database for storing information related to the plurality of articles and the plurality of information extractors; assigning the plurality of articles to the plurality of information extractors for information extraction, wherein the articles are assigned based on order in the queue, wherein an article with a higher rank is presented for information extraction before an article with a lower rank; receiving information extracted by a first information extractor from a first article; and storing the information extracted by the first information extractor from the first article in the information store. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 58, 65)
-
-
18. A computer-implemented method of storing information in an information store, the information store configured to store the extracted information according to an information model, the computer-implemented method comprising:
-
identifying a plurality of articles from which the information is to be extracted; assigning each article a priority score and ranking the articles from highest to lowest priority, thereby generating a queue, wherein the priority score for each article is calculated using a user-configurable priority calculation algorithm; identifying information extractors for extracting the information from the plurality of articles; storing information related to the plurality of articles and the information extractors in a database; assigning the plurality of articles to the information extractors wherein the articles are assigned based on order in the queue, wherein an article with a higher rank is presented for information extraction before an article with a lower rank; and for each article from the plurality of articles; receiving information extracted from the article by the information extractor to whom the article is assigned; storing the extracted information in the database; enabling content reviewers to identify and correct errors associated with the extracted information; enabling model reviewers to identify and make changes to the information model of the information store based on the information extracted from the article; and storing the information extracted from the article in the information store. - View Dependent Claims (19, 59, 61)
-
-
20. A computer system for storing information comprising:
-
a processor; a memory coupled to the processor, the memory configured to store a plurality of code modules for execution by the processor, the plurality of code modules comprising; a code module for identifying a plurality of articles from which information is to be extracted; a code module for identifying a plurality of information extractors for extracting information from the plurality of articles; a code module for storing information related to the plurality of articles and the plurality of information extractors in a database; code for storing a priority score for each article and ranking the articles from highest to lowest priority, thereby generating a queue, wherein the priority score for each article is calculated using a user-configurable priority calculation algorithm; a code module for assigning the plurality of articles to the plurality of information extractors for information extraction, wherein the articles are assigned based on order in the queue, wherein an article with a higher rank is presented for information extraction before an article with a lower rank; a code module for receiving information extracted by a first information extractor from a first article; and a code module for storing the information extracted by the first information extractor from the first article in an information store. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 66)
-
-
37. A networked system for storing information comprising:
-
a communication network; a computer system coupled to the communication network; an information store coupled to the computer system, the information store configured to store the information according to an information model; and a database coupled to the communication network; wherein the computer system is configured to; identify a plurality of articles from which the information is to be extracted; assign each article a priority score and ranking the articles from highest to lowest priority, thereby generating a queue, wherein the priority score for each article is calculated using a user-configurable priority calculation algorithm; identify information extractors for extracting the information from the plurality of articles; store information related to the plurality of articles and the information extractors in a database; assign the plurality of articles to the information extractors wherein the articles are assigned based on order in the queue, wherein an article with a higher rank is presented for information extraction before an article with a lower rank; and for each article from the plurality of articles; receive information extracted from the article by the information extractor to whom the article is assigned; store the extracted information in the database; enable content reviewers to identify and correct errors associated with the extracted information; enable model reviewers to identify and make changes to the information model of the information store based on the information extracted from the article; and store the information extracted from the article in the information store. - View Dependent Claims (38, 62, 67)
-
-
39. A computer program product, stored on a computer-readable storage medium, for storing information in an information store, the computer program product comprising:
-
code for identifying a plurality of articles from which information is to be extracted; code for assigning each article a priority score and ranking the articles from highest to lowest priority, thereby generating a queue, wherein the priority score for each article is calculated using a user-configurable priority calculation algorithm; code for identifying a plurality of information extractors for extracting information from the plurality of articles; code for providing a database for storing information related to the plurality of articles and the plurality of information extractors; code for assigning the plurality of articles to the plurality of information extractors for information extraction, wherein the articles are assigned based on order in the queue, wherein an article with a higher rank is presented for information extraction before an article with a lower rank; code for receiving information extracted by a first information extractor from a first article; and code for storing the information extracted by the first information extractor from the first article in the information store. - View Dependent Claims (40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 60, 63, 64)
-
-
56. A computer program product stored on a computer-readable storage medium, for storing information in an information store, the information store configured to store the extracted information according to an information model, the computer program product comprising:
-
code for identifying a plurality of articles from which the information is to be extracted; code for assigning each article a priority score and ranking the articles from highest to lowest priority, thereby generating a queue, wherein the priority score for each article is calculated using a user-configurable priority calculation algorithm; code for identifying information extractors for extracting the information from the plurality of articles; code for storing information related to the plurality of articles and the information extractors in a database; code for assigning the plurality of articles to the information extractors, wherein the articles are assigned based on order in the queue, wherein an article with a higher rank is presented for information extraction before an article with a lower rank; and for each article from the plurality of articles; code for receiving information extracted from the article by the information extractor to whom the article is assigned; code for storing the extracted information in the database; code for enabling content reviewers to identify and correct errors associated with the extracted information; code for enabling model reviewers to identify and make changes to the information model of the information store based on the information extracted from the article; and code for storing the information extracted from the article in the information store. - View Dependent Claims (57)
-
Specification