CONFIGURING A SYSTEM TO COLLECT AND AGGREGATE DATASETS
First Claim
Patent Images
1. A method for configuring a system to collect and aggregate datasets, the method, comprising:
- identifying a data source in the system from where dataset is to be collected;
configuring a machine in the system that generates the dataset to be collected, to send the dataset to the data source;
identifying an arrival location where the dataset that is collected is to be aggregated or written;
configuring an agent node by specifying a source for the agent node as the data source in the system and specifying a sink for the agent node as the arrival location.
5 Assignments
0 Petitions
Accused Products
Abstract
Methods for configuring a system to collect and aggregate datasets are disclosed. One embodiment includes, identifying a data source in the system from where dataset is to be collected, configuring a machine in the system that generates the dataset to be collected, to send the dataset to the data source, identifying an arrival location where the dataset that is collected is to be aggregated or written, and/or configuring an agent node by specifying a source for the agent node as the data source in the system and specifying a sink for the agent node as the arrival location.
110 Citations
26 Claims
-
1. A method for configuring a system to collect and aggregate datasets, the method, comprising:
-
identifying a data source in the system from where dataset is to be collected; configuring a machine in the system that generates the dataset to be collected, to send the dataset to the data source; identifying an arrival location where the dataset that is collected is to be aggregated or written; configuring an agent node by specifying a source for the agent node as the data source in the system and specifying a sink for the agent node as the arrival location. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A method for configuring a system having multiple machines to collect datasets from the multiple machines and to perform analytics on the datasets, the method, comprising:
-
identifying data sources on the multiple machines wherein datasets are to be collected from; configuring the multiple machines in the system that generate the datasets are to be collected, to send the datasets to the data source; identifying an arrival location where dataset that is collected is to be logged; specifying configurations for the multiple machines simultaneously by accessing a master through a web page and specifying the data sources for agent nodes; wherein, each agent node is associated with each of the multiple machines; specifying a sink for each of the agent node as the arrival location; specifying the arrival location as a collector source of a collector node; specifying a distributed file system as a collector sink of the collector node; wherein, the agent node or the collector node performs analytics on the datasets. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26)
-
Specification