Generating important values from a variety of server log files
First Claim
1. A method, in a data processing system comprising a process and a memory coupled to the processor, for identifying suggestions as to which log files associated with data in a data warehouse to search for particular data, the method comprising:
- utilizing an identified data structure of a plurality of log files from a set of log files, selecting, by feature selection logic specifically configured by a log file evaluation mechanism within the data processing system, features from the contents of the log file;
grouping, by the feature selection logic, log files in the set of log files together based on the selected features;
from structured data of the grouped log files, extracting, by extraction logic specifically configured by the log file evaluation mechanism within the data processing system, log event sequences;
calculating, by correlation logic specifically configured by the log file evaluation mechanism within the data processing system, a correlation between the log event sequences and a plurality of data transaction tables from a set of data transaction tables in the data warehouse;
utilizing a highest valued correlate log sequence for the plurality of data transaction tables, determining, by the correlation logic, a business relevance value between the plurality of log files and a business analysis objective, wherein determining the business relevance value between each log file in the set of log files and the business analysis objective utilizes the following business relevance function;
1 Assignment
0 Petitions
Accused Products
Abstract
A mechanism is provided for identifying suggestions as to which log files associated with data in a data warehouse to search for particular data. Features from the contents of a plurality of log files from a set of log files are selected. The plurality of log files are grouped based on the selected features. Using extracted log event sequences, a correlation between the log event sequences and a plurality of data transaction tables from a set of data transaction tables in the data warehouse is calculated. Suggestions as to which log files in the set of log files should be searched is then identified for particular data based on a business relevance value and a utilized data ratio.
15 Citations
17 Claims
-
1. A method, in a data processing system comprising a process and a memory coupled to the processor, for identifying suggestions as to which log files associated with data in a data warehouse to search for particular data, the method comprising:
-
utilizing an identified data structure of a plurality of log files from a set of log files, selecting, by feature selection logic specifically configured by a log file evaluation mechanism within the data processing system, features from the contents of the log file; grouping, by the feature selection logic, log files in the set of log files together based on the selected features; from structured data of the grouped log files, extracting, by extraction logic specifically configured by the log file evaluation mechanism within the data processing system, log event sequences; calculating, by correlation logic specifically configured by the log file evaluation mechanism within the data processing system, a correlation between the log event sequences and a plurality of data transaction tables from a set of data transaction tables in the data warehouse; utilizing a highest valued correlate log sequence for the plurality of data transaction tables, determining, by the correlation logic, a business relevance value between the plurality of log files and a business analysis objective, wherein determining the business relevance value between each log file in the set of log files and the business analysis objective utilizes the following business relevance function; - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A non-transitory computer readable storage medium having a computer readable program stored therein, wherein the computer readable program, when executed on a computing device, causes the computing device to:
-
utilizing an identified data structure of a plurality of log file from a set of log files, select, utilizing feature selection logic of a log file evaluation mechanism specifically configured by readable program, features from the contents of the log file; group, utilizing the feature selection logic, log files in the set of log files together based on the selected features; from structured data of the grouped log files, extract, utilizing extraction logic of the log file evaluation mechanism specifically configured by the computer readable program, log event sequences; calculate, utilizing correlation logic of the log file evaluation mechanism specifically configured by the computer readable program, a correlation between the log event sequences and a plurality of data transaction tables from a set of data transaction tables in the data warehouse; utilizing a highest valued correlate log sequence for the plurality of data transaction tables, determine, utilizing the correlation logic, a business relevance value between the plurality of log files and a business analysis objective, wherein the computer readable program to determine the business relevance value between each log file in the set of log files and the business analysis objective utilizes the following business relevance function; - View Dependent Claims (9, 10, 11, 12)
-
-
13. An apparatus comprising:
-
a processor; and a memory coupled to the processor, wherein the memory comprises instructions which, when executed by the processor, cause the processor to; utilizing an identified data structure of a plurality of log files from a set of log files, select, utilizing feature selection logic of a log file evaluation mechanism specifically configured by the instructions, features from the contents of the log file; group, utilizing the feature selection logic, log files in the set of log files together based on the selected features; from structured data of the grouped log files, extract, utilizing extraction logic of the log file evaluation mechanism specifically configured by the instructions, log event sequences; calculate, utilizing correlation logic of the log file evaluation mechanism specifically configured by instructions, a correlation between the log event sequences and a plurality of data transaction tables from a set of data transaction tables in the data warehouse; utilizing a highest valued correlate log sequence for the plurality of data transaction tables, determine, utilizing the correlation logic, a business relevance value between the plurality of log files and a business analysis objective, wherein the instructions to determine the business relevance value between each log file in the set of log files and the business analysis objective utilizes the following business relevance function; - View Dependent Claims (14, 15, 16, 17)
-
Specification