METHOD FOR ESTIMATING FORMAT OF LOG MESSAGE AND COMPUTER AND COMPUTER PROGRAM THEREFOR
First Claim
1. A method for use in a computer to estimate a format of a log message, the method comprising:
- a creating step of creating a first directed graph structure by dividing a first log message by predetermined characters to define divided portions as nodes and arranging the nodes in order from the beginning of the first log message;
a creating step of creating a second directed graph structure by dividing a second log message by the predetermined characters to define divided portions as nodes and arranging the nodes in order from the beginning of the second log message;
a detecting step of comparing nodes in the first directed graph structure with nodes in the second directed graph structure to detect a node in the first directed graph structure and a node in the second directed graph structure that are nodes other than nodes including a corresponding character string;
an adding step of adding to the first directed graph structure the node detected in the second directed graph structure among the detected nodes as a first branch node; and
an estimating step of estimating the format, based on the first directed graph structure including the first branch node added thereto, wherein the format includes a first portion associated with a node including a corresponding character string, a second portion associated with a node whose appearance tendency of character string is similar between the node detected in the first directed graph structure and the node detected in the second directed graph structure, and, optionally, a third portion associated with a node other than nodes having a similar appearance tendency of character string.
1 Assignment
0 Petitions
Accused Products
Abstract
A technique for estimating a format of a log message (LM) according to the present invention includes creating a first directed graph structure by dividing a first LM by predetermined characters to define divided portions as nodes and arranging the nodes in order from the beginning of the first LM; creating a second directed graph structure by performing on a second LM the same processing as that performed on the first LM; comparing nodes in the first directed graph structure with nodes in the second directed graph structure to detect nodes other than nodes including a corresponding character string; adding to the first directed graph structure the node detected in the second directed graph structure among the detected nodes as a first branch node; and estimating the format, based on the first directed graph structure including the first branch node added thereto.
25 Citations
20 Claims
-
1. A method for use in a computer to estimate a format of a log message, the method comprising:
-
a creating step of creating a first directed graph structure by dividing a first log message by predetermined characters to define divided portions as nodes and arranging the nodes in order from the beginning of the first log message; a creating step of creating a second directed graph structure by dividing a second log message by the predetermined characters to define divided portions as nodes and arranging the nodes in order from the beginning of the second log message; a detecting step of comparing nodes in the first directed graph structure with nodes in the second directed graph structure to detect a node in the first directed graph structure and a node in the second directed graph structure that are nodes other than nodes including a corresponding character string; an adding step of adding to the first directed graph structure the node detected in the second directed graph structure among the detected nodes as a first branch node; and an estimating step of estimating the format, based on the first directed graph structure including the first branch node added thereto, wherein the format includes a first portion associated with a node including a corresponding character string, a second portion associated with a node whose appearance tendency of character string is similar between the node detected in the first directed graph structure and the node detected in the second directed graph structure, and, optionally, a third portion associated with a node other than nodes having a similar appearance tendency of character string. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer for estimating a format of a log message, the computer comprising:
-
directed graph structure creation means for creating a first directed graph structure by dividing a first log message by predetermined characters to define divided portions as nodes and arranging the nodes in order from the beginning of the first log message, and creating a second directed graph structure by dividing the second log message by the predetermined characters to define divided portions as nodes and arranging the nodes in order from the beginning of the second log message; node detection means for comparing nodes in the first directed graph structure with nodes in the second directed graph structure to detect a node in the first directed graph structure and a node in the second directed graph structure that are nodes other than nodes including a corresponding character string; directed graph structure change means for adding to the first directed graph structure the node detected in the second directed graph structure among the detected nodes as a first branch node; and format estimation means for estimating the format, based on the first directed graph structure including the first branch node added thereto, wherein the format includes a first portion associated with a node including a corresponding character string, a second portion associated with a node whose appearance tendency of character string is similar between the node detected in the first directed graph structure and the node detected in the second directed graph structure, and, optionally, a third portion associated with a node other than nodes having a similar appearance tendency of character string.
-
-
20. A computer program product for estimating a format of a log message, the computer program product comprising:
-
one or more computer-readable tangible storage medium and program instructions stored on at least one of the one or more tangible storage medium, the program instructions executable by a processor, the program instructions comprising; program instructions to create a first directed graph structure by dividing a first log message by predetermined characters to define divided portions as nodes and arranging the nodes in order from the beginning of the first log message; program instructions to create a second directed graph structure by dividing a second log message by the predetermined characters to define divided portions as nodes and arranging the nodes in order from the beginning of the second log message; program instructions to compare nodes in the first directed graph structure with nodes in the second directed graph structure to detect a node in the first directed graph structure and a node in the second directed graph structure that are nodes other than nodes including a corresponding character string; program instructions to add to the first directed graph structure the node detected in the second directed graph structure among the detected nodes as a first branch node; and program instructions to estimate the format, based on the first directed graph structure including the first branch node added thereto, wherein the format includes a first portion associated with a node including a corresponding character string, a second portion associated with a node whose appearance tendency of character string is similar between the node detected in the first directed graph structure and the node detected in the second directed graph structure, and, optionally, a third portion associated with a node other than nodes having a similar appearance tendency of character string.
-
Specification