Methods and products for integrating mixed format data including the extraction of relational facts from free text
First Claim
Patent Images
1. A computer program product located to one or more storage media devices usable to perform integration of mixed format data, said computer program product comprising instructions executable by a computer to perform the functions of:
- accessing a database of structured data, the structured data comprising a set of data tuples;
accessing a source of unstructured data, the unstructured data including free text relatable to the data tuples of the structured data;
extracting relational facts from the free text;
producing a set of construed data, each construed datum containing at least one relational fact, each construed datum being further relatable to a data tuple of the structured data; and
integrating the produced data with the data tuples of the structured data.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed herein are systems, methods and products for interpreting and structuring free text records utilizing extractions of several types including syntactic, role, thematic and domain extractions. Also disclosed herein are systems, methods and products for integrating interpretive extractions with structured data into unified structures that can be analyzed with, among other tools, data mining and data visualization tools.
-
Citations
32 Claims
-
1. A computer program product located to one or more storage media devices usable to perform integration of mixed format data, said computer program product comprising instructions executable by a computer to perform the functions of:
-
accessing a database of structured data, the structured data comprising a set of data tuples;
accessing a source of unstructured data, the unstructured data including free text relatable to the data tuples of the structured data;
extracting relational facts from the free text;
producing a set of construed data, each construed datum containing at least one relational fact, each construed datum being further relatable to a data tuple of the structured data; and
integrating the produced data with the data tuples of the structured data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer program product located to one or more storage media devices usable to perform integration of mixed format data, said computer program product comprising instructions executable by a computer to perform the functions of:
-
accessing a database of structured data, the structured data comprising a set of data tuples;
accessing a source of unstructured data, the unstructured data including free text relatable to the data tuples of the structured data;
extracting relational facts from the free text;
producing a set of construed data reflecting at least one relational fact conveyed in free text, each construed datum containing at least one relational fact, each construed datum being further relatable to a data tuple of the structured data;
integrating the produced data with the data tuples of the structured data, said integrating retaining reference information to the original free text; and
constructing a library containing extracted attributes.
-
-
18. A method for integrating mixed format data, comprising the steps of:
-
accessing a database of structured data, the structured data comprising a set of data tuples;
accessing a source of unstructured data, the unstructured data including free text relatable to the data tuples of the structured data;
extracting relational facts from the free text;
producing a set of construed data reflecting at least one relational fact conveyed in free text, each construed datum containing at least one relational fact, each construed datum being further relatable to a data tuple of the structured data; and
integrating the produced data with the data tuples of the structured data. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32)
-
Specification