Extracting and leveraging knowledge from unstructured data
First Claim
Patent Images
1. A system, comprising:
- a processor configured to operate a machine-implemented data extractor and correlator configured to retrieve public data from at least one of a plurality of public data sources, extract information from unstructured data within the retrieved public data, and correlate the information extracted from the unstructured data with previously stored structured data to generate additional structured data by building at least one semantic relationship between the extracted information and the previously stored structured data based at least in part on semantic information, the data extractor and correlator comprising a semantic analysis module configured to determine the semantic information based on the unstructured data, wherein the data extractor and correlator is further configured to extract information from the unstructured data by performing;
breaking at least one sentence into subject, verb, and object;
extracting phrases that link a subject to an object;
extracting at least one word in close proximity to an identified feature; and
extracting at least one word in close proximity to a known quality; and
a storage device configured to store the previously stored structured data and the additional structured data.
1 Assignment
0 Petitions
Accused Products
Abstract
A system may include a machine-implemented data extractor and correlator configured to retrieve data from at least one data source. The data extractor and correlator may extract information from unstructured data within the retrieved data and correlate the extracted information with previously stored structured data to generate additional structured data. The system may also include a storage device configured to store the previously stored structured data and the additional structured data.
-
Citations
16 Claims
-
1. A system, comprising:
-
a processor configured to operate a machine-implemented data extractor and correlator configured to retrieve public data from at least one of a plurality of public data sources, extract information from unstructured data within the retrieved public data, and correlate the information extracted from the unstructured data with previously stored structured data to generate additional structured data by building at least one semantic relationship between the extracted information and the previously stored structured data based at least in part on semantic information, the data extractor and correlator comprising a semantic analysis module configured to determine the semantic information based on the unstructured data, wherein the data extractor and correlator is further configured to extract information from the unstructured data by performing; breaking at least one sentence into subject, verb, and object; extracting phrases that link a subject to an object; extracting at least one word in close proximity to an identified feature; and extracting at least one word in close proximity to a known quality; and a storage device configured to store the previously stored structured data and the additional structured data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A machine-implemented method, comprising:
-
a device retrieving public data from at least one of a plurality of public data sources; a processor extracting information from unstructured data within the retrieved public data by performing; breaking at least one sentence into subject, verb, and object; extracting phrases that link a subject to an object; extracting at least one word in close proximity to an identified feature; and extracting at least one word in close proximity to a known quality; the processor determining semantic information based on the unstructured data and correlating the information extracted from the unstructured data with previously stored structured data to generate additional structured data by building at least one semantic relationship between the information extracted and the previously stored structured data based at least in part on the semantic information; and a storage device storing the previously stored structured data and the additional structured data. - View Dependent Claims (15, 16)
-
Specification