DYNAMIC SCHEDULING OF TASKS FOR COLLECTING AND PROCESSING DATA USING JOB CONFIGURATION DATA
1 Assignment
0 Petitions
Accused Products
Abstract
A scheduler manages execution of a plurality of data-collection jobs, assigns individual jobs to specific forwarders in a set of forwarders, and generates and transmits tokens (e.g., pairs of data-collection tasks and target sources) to assigned forwarders. The forwarder uses the tokens, along with stored information applicable across jobs, to collect data from the target source and forward it onto an indexer for processing. For example, the indexer can then break a data stream into discrete events, extract a timestamp from each event and index (e.g., store) the event based on the timestamp. The scheduler can monitor forwarders'"'"' job performance, such that it can use the performance to influence subsequent job assignments. Thus, data-collection jobs can be efficiently assigned to and executed by a group of forwarders, where the group can potentially be diverse and dynamic in size.
8 Citations
60 Claims
-
1-30. -30. (canceled)
-
31. A computer-implemented method, comprising:
-
identifying, by a scheduler, a plurality of jobs, wherein each job in the plurality of jobs comprises collecting data from one or more external sources; transmitting, at a direction of the scheduler, credentials to each forwarder in a set of multiple forwarders, wherein each forwarder stores the credentials; generating, by the scheduler, job configuration data that includes identification of a data collection task to perform, a data filter to apply, and a target source from which the data is to be collected; selecting, by the scheduler, for each job a particular forwarder and assigning the job to the particular forwarder selected, wherein assigning the job to the particular forwarder takes into account information received from the particular forwarder on completion of previously assigned jobs; transmitting, by the scheduler, the job configuration data that includes the identification of the data collection task to perform, the data filter to apply, and the target source from which the data is to be collected to the particular forwarder, and using, by the particular forwarder, the job configuration data and the stored credentials in combination as needed to execute the job. - View Dependent Claims (32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47)
-
-
48. A system, comprising:
-
a plurality of data processors associated with a scheduler and a set of forwarders; and a non-transitory computer-readable storage medium containing instructions which when executed on the plurality of data processors, cause the processors to perform operations including; identifying, by a scheduler, a plurality of jobs, wherein each job in the plurality of jobs comprises collecting data from one or more external sources; transmitting, at a direction of the scheduler, credentials to each forwarder in a set of multiple forwarders, wherein each forwarder stores the credentials; generating, by the scheduler, job configuration data that includes identification of a data collection task to perform, a data filter to apply, a target source from which the data is to be collected, and a particular indexer in a plurality of indexers to receive the data; selecting, by the scheduler, for each job a particular forwarder and assigning the job to the particular forwarder selected, wherein assigning the job to the particular forwarder takes into account information received from the particular forwarder on completion of previously assigned jobs; transmitting, by the scheduler, the job configuration data that includes the identification of a data collection task to perform, a data filter to apply, and a target source from which the data is to be collected to the particular forwarder; and using, by the particular forwarder, the job configuration data and the stored credentials in combination as needed to execute the job. - View Dependent Claims (49, 50, 51, 52, 53, 54, 55, 56)
-
- 58. The system of claim 58, wherein, for a first job in the plurality of jobs, the data to be collected includes unstructured data.
-
60. A computer-program product tangibly embodied in a non-transitory machine-readable storage medium, including instructions configured to cause a plurality of data processors associated with a scheduler and a set of forwarders to:
-
identify, by a scheduler, a plurality of jobs, wherein each job in the plurality of jobs comprises collecting data from one or more external sources; transmit, at a direction of the scheduler, credentials to each forwarder in a set of multiple forwarders, wherein each forwarder stores the credentials; generate, by the scheduler, job configuration data that includes identification of a data collection task to perform, a data filter to apply, a target source from which the data is to be collected, and a particular indexer in a plurality of indexers to receive the data; select, by the scheduler, for each job a particular forwarder and assign the job to the particular forwarder selected, wherein assigning the job to the particular forwarder takes into account information received from the particular forwarder on completion of previously assigned jobs; transmit, by the scheduler, the job configuration data that includes the identification of a data collection task to perform, a data filter to apply, and a target source from which the data is to be collected to the particular forwarder, and use, by the particular forwarder, the job configuration data and the stored credentials in combination as needed to execute the job.
-
Specification