Scalable log analytics
First Claim
1. A method for providing real-time analysis of log messages for a computer infrastructure, the method comprising:
- receiving a plurality of log messages including a first log message;
generating a sketch associated with the first log message, wherein the the sketch includes a tuple of fingerprint values generated by processing a subset of words of the first log message through a fingerprint function;
determining a message type for the first log message based on a comparison of the generated sketch to a plurality of sketches stored in an index, wherein log messages of a same message type have similar sketches;
determining a first log event associated with one or more of the plurality of log messages occurring with a time interval, wherein the first log event comprises a first composition of message types corresponding to the one or more of the plurality of log messages associated with the first log event;
determining an event type for the first log event based on a comparison of the first composition of message types to a plurality of compositions of message types stored in the index; and
determining an anomalous log event within the plurality of log messages based on the event type for the first log event.
2 Assignments
0 Petitions
Accused Products
Abstract
Large amounts of unstructured log data generated by software and infrastructure components of a computing system are processed and analyzed in real time to identify anomalies and potential problems within the computing system. A log analytics module reduces both the volume and level of detail of log data by first classifying log messages into message types based on their content similarity. The log analytics module may then further reduce data by grouping bursts of log messages into log events. Patterns within these log events, such as the collection and number of different message types that comprise the event, can be used to identify anomalous events.
-
Citations
20 Claims
-
1. A method for providing real-time analysis of log messages for a computer infrastructure, the method comprising:
-
receiving a plurality of log messages including a first log message; generating a sketch associated with the first log message, wherein the the sketch includes a tuple of fingerprint values generated by processing a subset of words of the first log message through a fingerprint function; determining a message type for the first log message based on a comparison of the generated sketch to a plurality of sketches stored in an index, wherein log messages of a same message type have similar sketches; determining a first log event associated with one or more of the plurality of log messages occurring with a time interval, wherein the first log event comprises a first composition of message types corresponding to the one or more of the plurality of log messages associated with the first log event; determining an event type for the first log event based on a comparison of the first composition of message types to a plurality of compositions of message types stored in the index; and determining an anomalous log event within the plurality of log messages based on the event type for the first log event. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A non-transitory computer-readable storage medium comprising instructions that, when executed in a computing device, provide real-time analysis of log messages for a computer infrastructure, by performing the steps of:
-
receiving a plurality of log messages including a first log message; generating a sketch associated with the first log message, wherein the sketch includes a tuple of fingerprint values generated by processing a subset of words of the first log message through a fingerprint function; determining a message type for the first log message based on a comparison of the generated sketch to a plurality of sketches stored in an index, wherein log messages of a same message type have similar sketches; determining a first log event associated with one or more of the plurality of log messages occurring with a first time interval, wherein the first log event comprises a first composition of message types corresponding to the one or more of the plurality of log messages associated with the first log event; determining an event type for the first log event based on a comparison of the first composition of message types to a plurality of compositions of message types stored in the index; and determining an anomalous log event within the plurality of log messages based on the event type for the first log event. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer system for providing real-time analysis of log messages for a computer infrastructure, the computer system comprising:
- a system memory;
a storage device having (i) a plurality of log messages including a first log message and (ii) an index having a plurality of sketches and compositions of message types; and
a processor programmed to carry out the steps of;generating a sketch associated with the first log message, wherein the sketch includes a tuple of fingerprint values generated by processing a subset of words of the first log message through a fingerprint function; determining a message type for the first log message based on a comparison of the generated sketch to a plurality of sketches stored in the index, wherein log messages of a same message type have similar sketches; determining a first log event associated with one or more of the plurality of log messages occurring with a first time interval, wherein the first log event comprises a first composition of message types corresponding to the one or more of the plurality of log messages associated with the first log event; determining an event type for the first log event based on a comparison of the first composition of message types to a plurality of compositions of message types stored in the index; and determining an anomalous log event within the plurality of log messages based on the event type for the first log event. - View Dependent Claims (20)
- a system memory;
Specification