System and method for automating data normalization using text analytics
First Claim
Patent Images
1. An automated data enhancement processing system for processing database data, comprising:
- a system for ingesting data structured in at least one predefined format; and
a set of text analytics processes that treat the ingested data as unstructured to generate normalized data represented by consistent and structured metadata.
1 Assignment
0 Petitions
Accused Products
Abstract
A system, method and program product for normalizing, sanitizing and disambiguating structured data. Structured data includes data stored in a database management system (DBMA), as well labeled files (e.g., XML data). An automated data enhancement processing system is provided, comprising: a system for ingesting data structured in at least one predefined database format; and a set of text analytics processes that treat the ingested data as unstructured, and generate normalized data represented and indexed by consistent, structured metadata.
80 Citations
22 Claims
-
1. An automated data enhancement processing system for processing database data, comprising:
-
a system for ingesting data structured in at least one predefined format; and
a set of text analytics processes that treat the ingested data as unstructured to generate normalized data represented by consistent and structured metadata. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A program product stored on a recordable medium, which when executed by a computer, normalizes database data representation and metadata, comprising:
-
program code configured for ingesting data structured in at least one predefined database format; and
program code providing a set of text analytics processes that treats the ingested data as unstructured, and generates normalized data represented by a consistent and structured metadata. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A method for normalizing database data, comprising:
-
ingesting data structured in at least one predefined database format; and
performing a set of text analytics processes on the ingested data to generate normalized data represented by a consistent and structured metadata, wherein each of the text analytics processes treat the ingested data as unstructured. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
-
22. A method for deploying an automated data enhancement processing system for normalizing database data, comprising:
providing a computer infrastructure being operable to;
ingest data structured in at least one predefined database format; and
perform a set of text analytics processes on the ingested data to generate normalized data represented by a consistent, indexed and structured metadata, wherein each of the text analytics processes treat the ingested data as unstructured.
Specification