Method and system for information extraction
First Claim
1. A method for extracting information related to a pre-defined context from a data set, the pre-defined context having a set of attributes that define the context, the method comprising the steps of:
- a. identifying a relevant data from a group of data sets, the identification of the relevant data set being based on the occurrence of attributes corresponding to the pre-defined context;
b. identifying pertinent information from the relevant data set, the pertinent information being the information that contains values of the attributes corresponding to the pre-defined context;
c. extracting values of the attributes from the pertinent information; and
d. arranging the extracted values in the form of a pre-defined data structure which logically links the attributes to each other, in accordance with their inter-relationships as per the pre-defined context.
4 Assignments
0 Petitions
Accused Products
Abstract
A present invention provides a method and a system for extracting information related to a pre-defined context from data sets written in semi-structured or unstructured form, such as a natural language text. The information related to the pre-defined context is stored in an information store in accordance with a pre-defined structural arrangement. Further, the individual data values in the extracted information are assigned weights depending on their relevance to attributes of the predefined context. The operation of assigning weights to the structured information provides a measure for comparing the relevance of a plurality of structurally arranged information to the attributes of the pre-defined context.
31 Citations
16 Claims
-
1. A method for extracting information related to a pre-defined context from a data set, the pre-defined context having a set of attributes that define the context, the method comprising the steps of:
-
a. identifying a relevant data from a group of data sets, the identification of the relevant data set being based on the occurrence of attributes corresponding to the pre-defined context;
b. identifying pertinent information from the relevant data set, the pertinent information being the information that contains values of the attributes corresponding to the pre-defined context;
c. extracting values of the attributes from the pertinent information; and
d. arranging the extracted values in the form of a pre-defined data structure which logically links the attributes to each other, in accordance with their inter-relationships as per the pre-defined context. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for extracting information related to a pre-defined context from a data set, the pre-defined context having a set of attributes that define the context, the system comprising:
-
a. a data set classifier, to identify a relevant data set from a group of data sets, the identification of the relevant data set being based on the occurrence of attributes corresponding to the pre-defined context;
b. an information identifier, to identify the pertinent information in the relevant data set, the pertinent information being the information that contains the values of the attributes corresponding to the pre-defined context; and
c. an entity extractor, to extract values of the attributes from the pertinent information. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
-
16. A computer program product for use with a computer, the computer program product comprising a computer usable medium having a computer readable program code embodied therein for extracting information related to a pre-defined context from a data set, the pre-defined context comprising a set of attributes that define the context, the data set comprising structural and textual information, the computer program code performing the steps of:
-
a. identifying a relevant data set from a group of data sets, the identification of the relevant data being based on the occurrence of attributes corresponding to the pre-defined context;
b. identifying pertinent information from the relevant data set, the pertinent information being information that contains the values of attributes corresponding to the pre-defined context;
c. extracting values of the attributes from the pertinent information; and
d. arranging the extracted values in the form of a pre-defined data structure which logically links the attributes to each other, in accordance with their inter-relationships in the pre-defined context.
-
Specification