Device for retrieving data from a knowledge-based text
First Claim
1. A device for extracting information from a text (10) comprising an extraction module (20) and a learning module (30) cooperating with each other comprising means (212) for automatically selecting in the text (10) the contexts of instance of classes/entities of information to be extracted, for automatically selecting from these contexts those which are relevant for a domain and for enabling the user to modify this latter selection in a manner such that the learning module (30) will improve the next output (70, 80) of the extraction module (20), characterized in that the extraction module (20) additionally comprises means (213) for identifying relations existing in the text (10) between the relevant entities at the output of the means (212).
1 Assignment
0 Petitions
Accused Products
Abstract
The invention relates to a device and a method for extracting information from an unstructured text, said information including relevant instances of classes/entities searched for by the user and relations between these classes/entities. The device and method improve in a semi-automatic manner on a given domain. The transition from one domain to a new domain is also highly facilitated by the device and method of the invention.
-
Citations
18 Claims
- 1. A device for extracting information from a text (10) comprising an extraction module (20) and a learning module (30) cooperating with each other comprising means (212) for automatically selecting in the text (10) the contexts of instance of classes/entities of information to be extracted, for automatically selecting from these contexts those which are relevant for a domain and for enabling the user to modify this latter selection in a manner such that the learning module (30) will improve the next output (70, 80) of the extraction module (20), characterized in that the extraction module (20) additionally comprises means (213) for identifying relations existing in the text (10) between the relevant entities at the output of the means (212).
- 11. A method for extracting information from a text (10) comprising a learning process (2000) and a selection process (1000), said selection process comprising a step (1100) of automatic selection in the text of contexts of instance of classes/entities of the information to be extracted, a step (1110) of automatic selection from these contexts of those which are relevant for a domain and a step (1130) of modification by the user of outputs of the previous step, the modified outputs being taken into account in the learning process (2000) to improve the next result of the selection process (1000), characterized in that the selection process (1000) additionally comprises steps (1310, 1320, 1330) to identify the relations existing in the text (10) between the relevant entities at the output of the steps (1120, 1130) of the selection process (1000).
Specification