QUERYING AND INTEGRATING STRUCTURED AND INSTRUCTURED DATA
First Claim
1. A computer-implemented method of querying and integrating structured and unstructured data, the method comprising:
- receiving entity information that is extracted from a first set of unstructured data using an open domain information extraction system, wherein the entity information comprises relationship information between a first entity and a second entity of the first set of unstructured data;
recognizing a pattern based on the relationship information and creating a schema for the first set of unstructured data based on the pattern; and
associating an element of the created schema with (i) an entity of a second set of unstructured data or (ii) a schema element of an existing set of structured data if there is sufficient overall similarity between the created schema element and either the second unstructured data entity or the schema element of the existing structured data, thereby creating a link between the created schema element and either the second unstructured data entity or the schema element of the existing set of structured data.
1 Assignment
0 Petitions
Accused Products
Abstract
A computer-implemented method, system, and article of manufacture for querying and integrating structured and unstructured data. The method includes: receiving entity information that is extracted from a first set of unstructured data using an open domain information extraction system, wherein the entity information comprises relationship information between a first entity and a second entity of the first set of unstructured data; recognizing a pattern based on the relationship information and creating a schema for the first set of unstructured data based on the pattern; and associating an element of the created schema with (i) an entity of a second set of unstructured data or (ii) a schema element of an existing set of structured data if there is sufficient overall similarity between the created schema element and either the second unstructured data entity or the schema element of the existing structured data.
44 Citations
19 Claims
-
1. A computer-implemented method of querying and integrating structured and unstructured data, the method comprising:
-
receiving entity information that is extracted from a first set of unstructured data using an open domain information extraction system, wherein the entity information comprises relationship information between a first entity and a second entity of the first set of unstructured data; recognizing a pattern based on the relationship information and creating a schema for the first set of unstructured data based on the pattern; and associating an element of the created schema with (i) an entity of a second set of unstructured data or (ii) a schema element of an existing set of structured data if there is sufficient overall similarity between the created schema element and either the second unstructured data entity or the schema element of the existing structured data, thereby creating a link between the created schema element and either the second unstructured data entity or the schema element of the existing set of structured data. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer-implemented system for querying and integrating structured and unstructured data, system comprising:
-
a receiving device configured to receive entity information that is extracted from a first set of unstructured data using an open domain information extraction system, wherein the entity information comprises relationship information between a first entity and a second entity of the first set of unstructured data; a pattern recognition device configured to recognize a pattern based on the relationship information and to create a schema for the first set of unstructured data based on the pattern; and an element association device configured to associate an element of the created schema with (i) an entity of a second set of unstructured data or (ii) a schema element of an existing set of structured data if there is sufficient overall similarity between the created schema element and either the second unstructured data entity or the schema element of the existing structured data, thereby creating a link between the created schema element and either the second unstructured data entity or the schema element of the existing set of structured data. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
-
14. A non-transitory article of manufacture tangibly embodying computer readable instructions which, when implemented, causes a computer to carry out the steps of a computer-implemented method of querying and integrating structured and unstructured data, the method comprising:
-
receiving entity information that is extracted from a first set of unstructured data using an open domain information extraction system, wherein the entity information comprises relationship information between a first entity and a second entity; recognizing a pattern based on the relationship information and creating a schema for the first set of unstructured data based on the pattern; and associating an element of the created schema with (i) an entity of a second set of unstructured data or (ii) a schema element of an existing set of structured data if there is sufficient overall similarity between the created schema element and either the second unstructured data entity or the schema element of the existing structured data, thereby creating a link between the created schema element and either the second unstructured data entity or the schema element of the existing set of structured data. - View Dependent Claims (15, 16, 17, 18, 19)
-
Specification