Collecting and aggregating log data with fault tolerance
First Claim
1. A method for collecting and aggregating datasets for storage in a file system with fault tolerance, the method comprising:
- collecting datasets from a data source on a machine where the datasets are generated;
generating a batch comprising multiple messages from the datasets;
assigning a tag to the batch and computing a checksum for the batch;
writing the tag, the batch of multiple messages, and the checksum to an entry in a write-ahead-log (WAL) in storage;
sending the datasets to a receiving location;
in response to verifying the checksum of the batch of multiple messages at the receiving location, adding the tag to a map;
writing a file to destination location; and
identifying, in the file, tags associated with the batches in the file that have been written to the destination location.
5 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods of collecting and aggregating log data with fault tolerance are disclosed. One embodiment includes, one or more devices that generate log data, the one or more machines each associated with an agent node to collect the log data, wherein, the agent node generates a batch comprising multiple messages from the log data and assigns a tag to the batch. In one embodiment, the agent node further computes a checksum for the batch of multiple messages. The system may further include a collector device, the collector device being associated with a collector tier having a collector node to which the agent sends the log data; wherein, the collector determines the checksum for the batch of multiple messages received from the agent node.
115 Citations
20 Claims
-
1. A method for collecting and aggregating datasets for storage in a file system with fault tolerance, the method comprising:
-
collecting datasets from a data source on a machine where the datasets are generated; generating a batch comprising multiple messages from the datasets; assigning a tag to the batch and computing a checksum for the batch; writing the tag, the batch of multiple messages, and the checksum to an entry in a write-ahead-log (WAL) in storage; sending the datasets to a receiving location; in response to verifying the checksum of the batch of multiple messages at the receiving location, adding the tag to a map; writing a file to destination location; and identifying, in the file, tags associated with the batches in the file that have been written to the destination location. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. An apparatus for collecting and aggregating datasets for storage in a file system with fault tolerance, the apparatus including a memory storing instructions that, when executed by a processor of the apparatus, cause the apparatus to perform a method comprising:
-
collecting datasets from a data source on a machine where the datasets are generated; generating a batch comprising multiple messages from the datasets; assigning a tag to the batch and computing a checksum for the batch; writing the tag, the batch of multiple messages, and the checksum to an entry in a write-ahead-log (WAL) in storage; sending the datasets to a receiving location; in response to verifying the checksum of the batch of multiple messages at he receiving location, adding the tag to a map; writing a file to destination location; and identifying, in the file, tags associated with the batches in the file that have been written to the destination location. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable medium for collecting and aggregating datasets for storage in a file system with fault tolerance, the medium storing instructions that, when executed by a processor of an apparatus, cause the apparatus to perform a method comprising:
-
collecting datasets from a data source on a machine where the datasets are generated; generating a batch comprising multiple messages from the datasets; assigning a tag to the batch and computing a checksum for the batch; writing the tag, the batch of multiple messages, and the checksum to an entry in a write-ahead-log (WAL) in storage; sending the datasets to a receiving location; in response to verifying the checksum of the batch of multiple messages at the receiving location, adding the tag to a map; writing a file to destination location; and identifying, in the file, tags associated with the batches in the file that have been written to the destination location. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification