System and method for collection, retrieval, and distribution of data
First Claim
Patent Images
1. A system for collection, retrieval, and distribution of data, comprising:
- a computing device comprising a processor and a memory;
a data collector embodied in the computing device to collect data from at least one data source and to tag each group of collected data with an associated metadata tag, wherein the metadata tag includes information about the collected data;
an object distribution logic module associated with the data collector, the object distribution logic module comprising one or more caches to store data and a distributed caching strategy structure, the distributed caching strategy structure including a cache collapsing module to only send updated portions of data rather than an entire data set in accordance with data consumer submitted information and a store and forward feature to dynamically reduce flow of the data to data consumers;
at least one other computing device;
at least one distributed metadata catalogue embodied in the at least one other computing device for containing a plurality of metadata tags, wherein the metadata tags without any associated groups of collected data are stored on the at least one distributed metadata catalogue, wherein only metadata indicating the existence and location of a requested group of data on a network is sent to a requester in response to a data request corresponding to the requested group of data associated with the metadata tag to avoid sending the requested group of data over the network until needed; and
wherein the at least one distributed metadata catalogue comprises a metadata catalogue query services module configured to receive a query from the data consumer, wherein the query is received by a local metadata catalogue query services module of the at least one distributed metadata catalogue that is a closest distance from the data consumer, the local catalogue query services module is configured to determine if a scope of the query is set to local, the local catalogue query services module responding to the query in response to the scope of the query being set to local and the local catalogue query services module passing the query to another catalogue query services module of another distributed metadata catalogue capable of responding to the query.
1 Assignment
0 Petitions
Accused Products
Abstract
A system may include a data collector to collect data from at least one data source and to tag each group of collected data with a metadata tag. The metadata tag may include information about the collected data. The system may also include a metadata catalogue for containing a plurality of metadata tags and to allow a search for existence and a location on a network of any data corresponding to the search.
40 Citations
34 Claims
-
1. A system for collection, retrieval, and distribution of data, comprising:
-
a computing device comprising a processor and a memory; a data collector embodied in the computing device to collect data from at least one data source and to tag each group of collected data with an associated metadata tag, wherein the metadata tag includes information about the collected data; an object distribution logic module associated with the data collector, the object distribution logic module comprising one or more caches to store data and a distributed caching strategy structure, the distributed caching strategy structure including a cache collapsing module to only send updated portions of data rather than an entire data set in accordance with data consumer submitted information and a store and forward feature to dynamically reduce flow of the data to data consumers; at least one other computing device; at least one distributed metadata catalogue embodied in the at least one other computing device for containing a plurality of metadata tags, wherein the metadata tags without any associated groups of collected data are stored on the at least one distributed metadata catalogue, wherein only metadata indicating the existence and location of a requested group of data on a network is sent to a requester in response to a data request corresponding to the requested group of data associated with the metadata tag to avoid sending the requested group of data over the network until needed; and wherein the at least one distributed metadata catalogue comprises a metadata catalogue query services module configured to receive a query from the data consumer, wherein the query is received by a local metadata catalogue query services module of the at least one distributed metadata catalogue that is a closest distance from the data consumer, the local catalogue query services module is configured to determine if a scope of the query is set to local, the local catalogue query services module responding to the query in response to the scope of the query being set to local and the local catalogue query services module passing the query to another catalogue query services module of another distributed metadata catalogue capable of responding to the query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A system for collection, retrieval, and distribution of data, comprising:
-
a plurality of computing devices; a configurable distributed data collection and caching structure including a plurality of data collection nodes, each data collection node being embodied in a respective one of the plurality of computing devices and each data collection node comprising a data collector to collect data from at least one data source and to tag each group of collected data with an associated metadata tag, wherein the metadata tag includes information about the collected data; an object distribution logic module associated with the configurable data collection and caching structure, the object distribution logic module comprising one or more caches to store data and a distributed caching strategy structure, the distributed caching strategy structure including a cache collapsing module to only send updated portions of data rather than an entire data set in accordance with data consumer submitted information and a store and forward feature to dynamically reduce flow of the data to data consumers; at least one data processing device comprising a processor and a memory; at least one distributed metadata catalogue node embodied in the at least one data processing device, the at least one metadata catalogue node comprising; a metadata catalogue data store for storing a plurality of metadata tags without any associated groups of collected data, each group of collected data being associated with one of the plurality of metadata tags; a metadata catalogue query services module to receive and respond to a query from the data consumer, wherein a response to the query includes only metadata indicating the existence and location in the system of data corresponding to the query to avoid moving the data corresponding to the query over a network unless needed; and wherein the query is received by a local metadata catalogue query services module of the at least one distributed metadata catalogue node that is a closest distance from the data consumer, the local catalogue query services module is configured to determine if a scope of the query is set to local, the local catalogue query services module responding to the query in response to the scope of the query being set to local and the local catalogue query services module passing the query to another catalogue query services module of another distributed metadata catalogue node capable of responding to the query. - View Dependent Claims (27, 28, 29)
-
-
30. A method for collection, retrieval, and distribution of data, comprising:
-
collecting data from at least one data source by a data collector embodied in a computing device, the computing device comprising a processor and a memory, wherein an object distribution logic module is associated with the data collector, the object distribution logic module comprising one or more caches to store data and a distributed caching strategy structure, the distributed caching strategy structure including a cache collapsing module to only send updated portions of data rather than an entire data set in accordance with data consumer submitted information and a store and forward feature to dynamically reduce flow of the data to data consumers; tagging each group of collected data by the data collector with an associated metadata tag that includes at least one of a location of the group of data associated with the metadata tag on a system, a description of the group of data associated with the metadata tag, and a summary of the group of data associated with the metadata tag; forming at least one distributed metadata catalogue embodied in at least one other computing device, the at least one distributed metadata catalogue containing a plurality of metadata tags without any associated groups of data, wherein only metadata indicating the existence and location of a requested group of data on the system is sent to a requester in response to a data request corresponding to the requested group of data associated with the metadata tag to avoid sending the requested group of data over a network until needed; and wherein the at least one distributed metadata catalogue comprises a metadata catalogue query services module configured to receive a query from the data consumer, wherein the query is received by a local metadata catalogue query services module of the at least one distributed metadata catalogue that is a closest distance from the data consumer, the local catalogue query services module is configured to determine if a scope of the query is set to local, the local catalogue query services module responding to the query in response to the scope of the query being set to local and the local catalogue query services module passing the query to another catalogue query services module of another distributed metadata catalogue capable of responding to the query. - View Dependent Claims (31, 32)
-
-
33. A computer program product for collection, retrieval, and distribution of data, the computer program product comprising:
-
a non-transitory computer readable storage medium having computer usable program code embodied therewith, the computer usable medium comprising; computer usable program code configured to collect data from at least one data source by a data collector, wherein an object distribution logic module is associated with the data collector, the object distribution logic module comprising one or more caches to store data and a distributed caching strategy structure, the distributed caching strategy structure including a cache collapsing module to only send updated portions of data rather than an entire data set in accordance with data consumer submitted information and a store and forward feature to dynamically reduce flow of the data to data consumers; computer usable program code configured to tag each group of collected data with an associated metadata tag that includes at least one of a location of the data associated with the metadata tag on the system, a description of the data associated with the metadata tag, and a summary of the data associated with the metadata tag; computer usable program code configured to form at least one distributed metadata catalogue containing a plurality of metadata tags without any associated groups of data, wherein only metadata indicating the existence and location of a requested group of data on the system is sent to a requester in response to a data request corresponding to the requested group of data associated with the metadata tag to avoid sending the requested group of data over a network until needed; and wherein the at least one distributed metadata catalogue comprises a metadata catalogue query services module configured to receive a query from the data consumer, wherein the query is received by a local metadata catalogue query services module of the at least one distributed metadata catalogue that is a closest distance from the data consumer, the local catalogue query services module is configured to determine if a scope of the query is set to local, the local catalogue query services module responding to the query in response to the scope of the query being set to local and the local catalogue query services module passing the query to another catalogue query services module of another distributed metadata catalogue capable of responding to the query. - View Dependent Claims (34)
-
Specification