INFORMATION EXTRACTION DEVICE AND INFORMATION EXTRACTION SYSTEM
First Claim
1. An information extraction device for extracting specific information using information extraction rules, comprising:
- a case candidate extraction unit for extracting new specific information that is not extracted by said information extraction rules as novel case candidates based on extraction results obtained from extraction target text data;
a rule candidate generation unit for generating multiple extraction rule candidates based on said novel case candidates;
a relation analysis unit for analyzing the derivational relation between said novel case candidates and said extraction rule candidates and the overlapping relation between said multiple extraction rule candidates to generate relation analysis results; and
a case candidate selection unit for calculating the priorities of said novel case candidates based on said relation analysis results and previously prepared case information and selecting said novel case candidates according to the priority.
1 Assignment
0 Petitions
Accused Products
Abstract
The information extraction device for extracting specific information using information extraction rules comprises a case candidate extraction means for extracting new specific information that is not extracted by the information extraction rules as novel case candidates based on extraction results obtained from extraction target text data; a rule candidate generation means for generating multiple extraction rule candidates based on the novel case candidates; a relation analysis means for analyzing the derivational relation between the novel case candidates and the extraction rule candidates and the overlapping relation between the multiple extraction rule candidates to generate relation analysis results; and a case candidate selection means for calculating the priorities of the novel case candidates based on the relation analysis results and previously prepared case information and selecting the novel case candidates according to the priority.
12 Citations
19 Claims
-
1. An information extraction device for extracting specific information using information extraction rules, comprising:
-
a case candidate extraction unit for extracting new specific information that is not extracted by said information extraction rules as novel case candidates based on extraction results obtained from extraction target text data; a rule candidate generation unit for generating multiple extraction rule candidates based on said novel case candidates; a relation analysis unit for analyzing the derivational relation between said novel case candidates and said extraction rule candidates and the overlapping relation between said multiple extraction rule candidates to generate relation analysis results; and a case candidate selection unit for calculating the priorities of said novel case candidates based on said relation analysis results and previously prepared case information and selecting said novel case candidates according to the priority. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An information extraction system comprising an information extraction device connected to a user terminal via communication lines for extracting specific information using information extraction rules, wherein
said information extraction device comprises: -
a case candidate extraction unit for extracting new specific information that is not extracted by said information extraction rules as novel case candidates based on extraction results obtained from extraction target text data; a rule candidate generation unit for generating multiple extraction rule candidates based on said novel case candidates; a relation analysis unit for analyzing the derivational relation between said novel case candidates and said extraction rule candidates and the overlapping relation between said multiple extraction rule candidates to generate relation analysis results; a case candidate selection unit for calculating the priorities of said novel case candidates based on said relation analysis results and previously prepared case information and selecting said novel case candidates according to the priority; and a case candidate inquiry unit for inquiring of said user terminal about the correct/incorrect of novel case candidates selected by said case candidate selection unit and giving the determination results from said user terminal to said case candidate selection unit; said case candidate selection unit determines the correct/incorrect of said selected novel case candidates based on said determination results given by said case candidate inquiry unit.
-
-
12. An information extraction method for extracting specific information using information extraction rules, comprising the flowing steps:
-
extracting new specific information that is not extracted by said information extraction rules as novel case candidates based on extraction results obtained from extraction target text data; generating multiple extraction rule candidates based on said novel case candidates; analyzing the derivational relation between said novel case candidates and said extraction rule candidates and the overlapping relation between said multiple extraction rule candidates to generate relation analysis results; and calculating the priorities of said novel case candidates based on said relation analysis results and previously prepared case information and selecting said novel case candidates according to the priority. - View Dependent Claims (13, 14, 15)
-
-
16. A recording medium storing an information extraction program for an information extraction device provided with a computer and extracting specific information using information extraction rules, wherein said program allows said computer to perform the following procedures:
-
extracting new specific information that is not extracted by said information extraction rules as novel case candidates based on extraction results obtained from extraction target text data; generating multiple extraction rule candidates based on said novel case candidates; analyzing the derivational relation between said novel case candidates and said extraction rule candidates and the overlapping relation between said multiple extraction rule candidates to generate relation analysis results; and calculating the priorities of said novel case candidates based on said relation analysis results and previously prepared case information and selecting said novel case candidates according to the priority. - View Dependent Claims (17, 18, 19)
-
Specification