System, method and program product to estimate cost of integrating and utilizing heterogeneous data sources
First Claim
1. A method for estimating a labor cost to reconcile semantic conflicts between data schema terms used in different data sources, the method comprising the steps of:
- a computer estimating a labor cost for mapping, to shared ontology terms, respective pairs of the data schema terms having semantic conflicts with each other, the computer system estimating the labor cost based on at least five of the following;
(a) a number of the data sources that contain the data schema terms having the semantic conflicts, (b) an approximate number of the data schema terms in each of the data sources, (c) an approximate labor cost for implementing the shared ontology terms for each of the data sources, (d) an approximate labor cost to manually map to the shared ontology terms a percent of the data schema terms in each of the data sources, (e) an approximate labor cost to validate a percent of the mappings from the data schema terms to the shared ontology terms, (f) an approximate labor cost to perform functional computation for a percent of the mappings from the data schema terms to the shared ontology terms, and (g) an approximate labor cost to perform structural heterogeneity semantic mapping between a percent of the data schema terms and the shared ontology terms; and
the computer displaying on a monitor the estimated labor cost for the mapping, to the shared ontology terms, the data schema terms having semantic conflicts with each other.
0 Assignments
0 Petitions
Accused Products
Abstract
System, method and program product for estimating a cost of reconciling heterogeneous data sources. A transition cost for integrating together a first program to identify semantic conflicts, a second program to classify semantic conflicts and a third program to reconcile semantic conflicts is estimated. A steady state cost for managing and maintaining the integrated first, second and third programs is estimated. Another system, method and program product for estimating a cost of integrating heterogeneous data sources. A steady state cost of managing and maintaining a first program which identifies semantic conflicts between a cross data source query and schema elements in a data source is estimated. A steady state cost of managing and maintaining a second program which classifies semantic conflicts between the cross data source query and schema elements in the data source is estimated. A steady state cost of managing and maintaining a third program which reconciles semantic conflicts between the cross data source query and schema elements in the data source is estimated.
-
Citations
15 Claims
-
1. A method for estimating a labor cost to reconcile semantic conflicts between data schema terms used in different data sources, the method comprising the steps of:
-
a computer estimating a labor cost for mapping, to shared ontology terms, respective pairs of the data schema terms having semantic conflicts with each other, the computer system estimating the labor cost based on at least five of the following;
(a) a number of the data sources that contain the data schema terms having the semantic conflicts, (b) an approximate number of the data schema terms in each of the data sources, (c) an approximate labor cost for implementing the shared ontology terms for each of the data sources, (d) an approximate labor cost to manually map to the shared ontology terms a percent of the data schema terms in each of the data sources, (e) an approximate labor cost to validate a percent of the mappings from the data schema terms to the shared ontology terms, (f) an approximate labor cost to perform functional computation for a percent of the mappings from the data schema terms to the shared ontology terms, and (g) an approximate labor cost to perform structural heterogeneity semantic mapping between a percent of the data schema terms and the shared ontology terms; andthe computer displaying on a monitor the estimated labor cost for the mapping, to the shared ontology terms, the data schema terms having semantic conflicts with each other. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer system for estimating a labor cost to reconcile semantic conflicts between data schema terms used in different data sources, the computer system comprising:
-
a CPU, a computer readable memory and a computer readable storage media; first program instructions to estimate a labor cost for mapping, to shared ontology terms, respective pairs of the data schema terms having semantic conflicts with each other, the first program instructions estimating the labor cost based on at least five of the following;
(a) a number of the data sources that contain the data schema terms having the semantic conflicts, (b) an approximate number of the data schema terms in each of the data sources, (c) an approximate labor cost for implementing the shared ontology terms for each of the data sources, (d) an approximate labor cost to manually map to the shared ontology terms a percent of the data schema terms in each of the data sources, (e) an approximate labor cost to validate a percent of the mappings from the data schema terms to the shared ontology terms, (f) an approximate labor cost to perform functional computation for a percent of the mappings from the data schema terms to the shared ontology terms, and (g) an approximate labor cost to perform structural heterogeneity semantic mapping between a percent of the data schema terms and the shared ontology terms; andsecond program instructions to initiate display on a monitor the estimated labor cost for the mapping, to the shared ontology terms, the data schema terms having semantic conflicts with each other; and
whereinthe first and second program instructions are stored on the computer readable storage media for execution by the CPU via the computer readable memory. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A computer program product for estimating a labor cost to reconcile semantic conflicts between data schema terms used in different data sources, the computer program product comprising:
-
a computer readable storage media; first program instructions to estimate a labor cost for mapping, to shared ontology terms, respective pairs of the data schema terms having semantic conflicts with each other, the first program instructions estimating the labor cost based on at least five of the following;
(a) a number of the data sources that contain the data schema terms having the semantic conflicts, (b) an approximate number of the data schema terms in each of the data sources, (c) an approximate labor cost for implementing the shared ontology terms for each of the data sources, (d) an approximate labor cost to manually map to the shared ontology terms a percent of the data schema terms in each of the data sources, (e) an approximate labor cost to validate a percent of the mappings from the data schema terms to the shared ontology terms, (f) an approximate labor cost to perform functional computation for a percent of the mappings from the data schema terms to the shared ontology terms, and (g) an approximate labor cost to perform structural heterogeneity semantic mapping between a percent of the data schema terms and the shared ontology terms; andsecond program instructions to initiate display on a monitor the estimated labor cost for the mapping, to the shared ontology terms, the data schema terms having semantic conflicts with each other; and
whereinthe first and second program instructions are stored on the computer readable storage media. - View Dependent Claims (12, 13, 14, 15)
-
Specification