Policy based data collection, processing, and negotiation for analytics
First Claim
1. A method for improving data collection and analytics utilizing off-line data auto-tagging and on-line fine granularity data tagging based on negotiations between a data provider and a data consumer, the method comprising:
- receiving, by one or more computer processors, a data request from the data consumer;
generating, by one or more computer processors, one or more attributes from the data request;
searching, by one or more computer processors, one or more data sources for data related to the one or more attributes;
collecting, by one or more computer processors, data related to the one or more attributes from the one or more data sources, wherein collecting the data related to the one or more attributes includes populating a data descriptor template for the data related to the one or more attributes with one or more descriptors;
determining, by one or more computer processors, whether the collected data match the one or more descriptors;
responsive to a determination that the collected data match the one or more descriptors, creating, by one or more computer processors, one or more annotated ontologies for the collected data by combining one or more ontologies defining a domain for the collected data with one or more modifiable data policy rules to describe the data and one or more handling requirements for the collected data based on the one or more modifiable data policy rules, wherein creating the one or more annotated ontologies includes annotating the one or more descriptors in the data descriptor template with at least one summary of one or more documents from the collected data, wherein annotating the one or more descriptors with the at least one summary includes associating summary information generated from searching the one or more data sources, applying data policy rules to the at least one summary, and modifying the at least one summary to conform to relevant data policy rules related to data cost, data topology and quantity of data;
providing, by one or more computer processors, a hierarchical visualization of the one or more annotated ontologies for the collected data to the data consumer;
receiving, by one or more computer processors, one or more data selections from the one or more annotated ontologies; and
updating, by one or more computer processors, the one or more annotated ontologies for the collected data based, at least in part, on a quality evaluation of the one or more data selections.
1 Assignment
0 Petitions
Accused Products
Abstract
An approach for ontological policy based data collection, processing, and negotiation for data in view of analytics is provided. The approach searches one or more data sources for data related to a data request. The approach collects data related to the data request from the one or more data sources. The approach determines whether one or more attributes generated from the data request match one or more descriptors associated with the data related to the data request. The approach creates one or more annotated ontologies for the data related to the data request. The approach displays a hierarchical visualization of the one or more annotated ontologies for the data related to the data request. The approach updates the one or more annotated ontologies for the data related to the data request based, at least in part, on an evaluation of the quality of the one or more data selections.
19 Citations
16 Claims
-
1. A method for improving data collection and analytics utilizing off-line data auto-tagging and on-line fine granularity data tagging based on negotiations between a data provider and a data consumer, the method comprising:
-
receiving, by one or more computer processors, a data request from the data consumer; generating, by one or more computer processors, one or more attributes from the data request; searching, by one or more computer processors, one or more data sources for data related to the one or more attributes; collecting, by one or more computer processors, data related to the one or more attributes from the one or more data sources, wherein collecting the data related to the one or more attributes includes populating a data descriptor template for the data related to the one or more attributes with one or more descriptors; determining, by one or more computer processors, whether the collected data match the one or more descriptors; responsive to a determination that the collected data match the one or more descriptors, creating, by one or more computer processors, one or more annotated ontologies for the collected data by combining one or more ontologies defining a domain for the collected data with one or more modifiable data policy rules to describe the data and one or more handling requirements for the collected data based on the one or more modifiable data policy rules, wherein creating the one or more annotated ontologies includes annotating the one or more descriptors in the data descriptor template with at least one summary of one or more documents from the collected data, wherein annotating the one or more descriptors with the at least one summary includes associating summary information generated from searching the one or more data sources, applying data policy rules to the at least one summary, and modifying the at least one summary to conform to relevant data policy rules related to data cost, data topology and quantity of data; providing, by one or more computer processors, a hierarchical visualization of the one or more annotated ontologies for the collected data to the data consumer; receiving, by one or more computer processors, one or more data selections from the one or more annotated ontologies; and updating, by one or more computer processors, the one or more annotated ontologies for the collected data based, at least in part, on a quality evaluation of the one or more data selections. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer program product for improving data collection and analytics utilizing off-line data auto-tagging and on-line fine granularity data tagging based on negotiations between a data provider and a data consumer, the computer program product comprising:
-
one or more computer readable storage media and program instructions stored on the one or more computer readable storage media, the program instructions comprising; program instructions to receive, by one or more computer processors, a data request from the data consumer; program instructions to generate, by one or more computer processors, one or more attributes from the data request; program instructions to search, by one or more computer processors, one or more data sources for data related to the one or more attributes; program instructions to collect, by one or more computer processors, data related to the one or more attributes from the one or more data sources, wherein collecting the data related to the one or more attributes includes populating a data descriptor template for the data related to the one or more attributes with one or more descriptors; program instructions to determine, by one or more computer processors, whether the collected data match one or more descriptors; responsive to a determination that the collected data match the one or more descriptors, program instructions to create, by one or more computer processors, one or more annotated ontologies for the collected data by combining one or more ontologies defining a domain for the collected data with one or more modifiable data policy rules to describe the data and one or more handling requirements for the collected data based on the one or more modifiable data policy rules, wherein creating the one or more annotated ontologies includes annotating the one or more descriptors in the data descriptor template with at least one summary of one or more documents from the collected data, wherein annotating the one or more descriptors with the at least one summary includes associating summary information generated from searching the one or more data sources, applying data policy rules to the at least one summary, and modifying the at least one summary to conform to relevant data policy rules related to data cost, data topology and quantity of data; program instructions to provide, by one or more computer processors, a hierarchical visualization of the one or more annotated ontologies for the collected data to the data consumer; program instructions to receive, by one or more computer processors, one or more data selections from the one or more annotated ontologies; and program instructions to update, by one or more computer processors, the one or more annotated ontologies for the collected data based, at least in part, on a quality evaluation of the one or more data selections. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A computer system for ontological policy based data collection, processing, and negotiation for data in view of analytics, the computer system comprising:
-
one or more hardware computer processors; one or more computer readable storage media; program instructions stored on at least one of the one or more computer readable storage media for execution by at least one of the one or more hardware computer processors, the program instructions comprising; program instructions to receive, by one or more hardware computer processors, a data request from the data consumer; program instructions to generate, by one or more hardware computer processors, one or more attributes from the data request; program instructions to search, by one or more hardware computer processors, one or more data sources for data related to the one or more attributes; program instructions to collect, by one or more hardware computer processors, data related to the one or more attributes from the one or more data sources, wherein collecting the data related to the one or more attributes includes populating a data descriptor template for the data related to the one or more attributes with one or more descriptors; program instructions to determine, by one or more hardware computer processors, whether the collected data match one or more descriptors; responsive to a determination that the collected data match the one or more descriptors, program instructions to create, by one or more hardware computer processors, one or more annotated ontologies for the collected data by combining one or more ontologies defining a domain for the collected data with one or more modifiable data policy rules to describe the data and one or more handling requirements for the collected data based on the one or more modifiable data policy rules, wherein creating the one or more annotated ontologies includes annotating the one or more descriptors in the data descriptor template with at least one summary of one or more documents from the collected data, wherein annotating the one or more descriptors with the at least one summary includes associating summary information generated from searching the one or more data sources, applying data policy rules to the at least one summary, and modifying the at least one summary to conform to relevant data policy rules related to data cost, data topology and quantity of data; program instructions to provide, by one or more hardware computer processors, a hierarchical visualization of the one or more annotated ontologies for the collected data to the data consumer; program instructions to receive, by one or more hardware computer processors, one or more data selections from the one or more annotated ontologies; and program instructions to update, by one or more hardware computer processors, the one or more annotated ontologies for the collected data based, at least in part, on a quality evaluation of the one or more data selections. - View Dependent Claims (14, 15, 16)
-
Specification