SYSTEMS AND METHODS FOR TRIE-BASED AUTOMATED DISCOVERY OF PATTERNS IN COMPUTER LOGS
First Claim
Patent Images
1. A computer system including a processor and a datastore, the system comprising:
- a log processing engine connected to the datastore and configured to;
collect logs from a plurality of applications;
tokenize the logs;
match each record of the logs, from their tokens, to a pattern in a stored trie, each pattern having a unique pattern ID;
extract free parameters and metadata from the logs; and
store the logs to the datastore as combinations of the pattern IDs, the free parameters, and the metadata.
3 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for tokenization of log records for efficient data storage, log querying, and log data analytics can utilize a trie pattern conversion of the log files, storing trie data pattern IDs, free parameters, and metadata instead of the entire log record. New trie patterns can be discovered automatically by counting the occurrences of tokens matching wildcards for existing patterns.
38 Citations
10 Claims
-
1. A computer system including a processor and a datastore, the system comprising:
a log processing engine connected to the datastore and configured to; collect logs from a plurality of applications; tokenize the logs; match each record of the logs, from their tokens, to a pattern in a stored trie, each pattern having a unique pattern ID; extract free parameters and metadata from the logs; and store the logs to the datastore as combinations of the pattern IDs, the free parameters, and the metadata. - View Dependent Claims (2, 3, 4, 5)
-
6. A computer-based method for storing computer logs, the method comprising:
-
collecting logs from a plurality of applications; tokenizing, by a processor, the logs; matching, by the processor, each record of the logs, from their tokens, to a pattern in a stored trie, each pattern having a unique pattern ID; extracting, by the processor, free parameters and metadata from the logs; and storing the logs to the datastore as combinations of the pattern IDs, the free parameters, and the metadata. - View Dependent Claims (7, 8, 9, 10)
-
Specification