Log file compression
First Claim
1. A method for compressing a log file, the method comprising:
- identifying by one or more processors, two or more fields in a log file based on two or more field rules from among a plurality of field rules specified in a log file framework;
extracting, by the one or more processors, contents of the log file associated with the two or more fields;
passing, by the one or more processors, the contents associated with the two or more fields to two or more corresponding compression engines from among a plurality of compression engines each specified for performing a separate type of compression from among a plurality of types of compression for each of the two or more fields, wherein each of the two or more fields corresponds to one or more compression engines from among the plurality of compression engines;
receiving, by the one or more processors, a plurality of outputs from each of the one or more compression engines, wherein the plurality of outputs comprise compressed contents and uncompressed contents, wherein the compressed contents comprise at least one content element replaced by one or more of the two or more compression engines by a dictionary entry index into at least one entry in a dictionary shared among the two or more compression engines for replacing redundant elements among the two or more compression engines;
identifying, by the one or more processors, the at least one dictionary entry index in the compressed contents, wherein the at least one dictionary entry index is an index value assigned by the one or more compression engines to a text string in the dictionary;
arranging, by the one or more processors, the compressed contents, uncompressed contents, and each text string from the dictionary associated with each identified at least one dictionary entry index in a predefined format for a compressed log file; and
outputting, by the one or more processors, the compressed log file.
1 Assignment
0 Petitions
Accused Products
Abstract
A compression system identifies one or more fields in a log file based on at least one field rule from among multiple field rules specified in a log file framework. The compression system extracts contents of the log file associated with the one or more fields. The compression system passes the contents associated with the one or more fields to corresponding compression engines from among a multiple compression engines each specified for performing a separate type of compression from among multiple types of compression for each of the one or more fields, wherein each of the one or more fields corresponds to one or more compression engines from among the multiple compression engines.
11 Citations
19 Claims
-
1. A method for compressing a log file, the method comprising:
-
identifying by one or more processors, two or more fields in a log file based on two or more field rules from among a plurality of field rules specified in a log file framework; extracting, by the one or more processors, contents of the log file associated with the two or more fields; passing, by the one or more processors, the contents associated with the two or more fields to two or more corresponding compression engines from among a plurality of compression engines each specified for performing a separate type of compression from among a plurality of types of compression for each of the two or more fields, wherein each of the two or more fields corresponds to one or more compression engines from among the plurality of compression engines; receiving, by the one or more processors, a plurality of outputs from each of the one or more compression engines, wherein the plurality of outputs comprise compressed contents and uncompressed contents, wherein the compressed contents comprise at least one content element replaced by one or more of the two or more compression engines by a dictionary entry index into at least one entry in a dictionary shared among the two or more compression engines for replacing redundant elements among the two or more compression engines; identifying, by the one or more processors, the at least one dictionary entry index in the compressed contents, wherein the at least one dictionary entry index is an index value assigned by the one or more compression engines to a text string in the dictionary; arranging, by the one or more processors, the compressed contents, uncompressed contents, and each text string from the dictionary associated with each identified at least one dictionary entry index in a predefined format for a compressed log file; and outputting, by the one or more processors, the compressed log file. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for compressing a log file, comprising:
-
one or more processors; a compression system, for execution by at least one of the one or more processors and operative to identify two or more fields in a log file based on two or more field rules from among a plurality of field rules specified in a log file framework; the compression system operative to extract contents of the log file associated with the two or more fields; the compression system operative to pass the contents associated with the two or more fields to two or more corresponding compression engines from among a plurality of compression engines each specified for performing a separate type of compression from among a plurality of types of compression for each of the two or more fields, wherein each of the two or more fields corresponds to one or more compression engines from among the plurality of compression engines; the compression system operative to receive a plurality of outputs from each of the two or more compression engines, wherein the plurality of outputs comprise compressed contents and uncompressed contents, wherein the compressed contents comprise at least one content element replaced by one or more of the two or more compression engines by a dictionary entry index into at least one entry in a dictionary shared among the two or more compression engines for replacing redundant elements among the two or more compression engines; the compression system operative to identify the at least one dictionary entry index in the compressed contents, wherein the at least one dictionary entry index is an index value assigned by the one or more compression engines to a text string in the dictionary; the compression system operative to arrange the compressed contents, uncompressed contents, and each text string from the dictionary associated with each identified at least one dictionary entry index in a predefined format for a compressed log file; and the compression system operative to output the compressed log file. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A computer program product for compressing a log file, the computer program product comprising:
-
one or more computer-readable tangible storage devices; program instructions, stored on at least one of the one or more storage devices, to identify two or more fields in a log file based on two or more field rules from among a plurality of field rules specified in a log file framework; program instructions, stored on at least one of the one or more storage devices, to extract contents of the log file associated with the two or more fields; program instructions, stored on at least one of the one or more storage devices, to pass the contents associated with the two or more fields to corresponding compression engines from among a plurality of compression engines each specified for performing a separate type of compression from among a plurality of types of compression for each of the two or more fields, wherein each of the two or more fields corresponds to one or more compression engines from among the plurality of compression engines program instructions, stored on at least one of the one or more storage devices, to receive a plurality of outputs from each of the two or more compression engines, wherein the plurality of outputs comprise compressed contents and uncompressed contents, wherein the compressed contents comprise at least one content element replaced by one or more of the two or more compression engines by a dictionary entry index into at least one entry in a dictionary shared among the two or more compression engines for replacing redundant elements among the two or more compression engines; program instructions, stored on at least one of the one or more storage devices, to identify the at least one dictionary entry index in the compressed contents, wherein the at least one dictionary entry index is an index value assigned by the one or more compression engines to a text string in the dictionary; program instructions, stored on at least one of the one or more storage devices, to arrange the compressed contents, uncompressed contents and each text string from the dictionary associated with each identified at least one dictionary entry index in a predefined format for a compressed log file; and program instructions, stored on at least one of the one or more storage devices, to output the compressed log file. - View Dependent Claims (16, 17, 18, 19)
-
Specification