Method and System for Log File Analysis Based on Distributed Computing Network
First Claim
1. A method for log file analysis based on a distributed computing network, characterized in that the method comprises:
- storing user identifiers and related log information into a log file;
dividing the log file into a plurality of target files, wherein log information having a common user identifier is included in the same target file;
separately analyzing the plurality of target files to obtain analysis results using two or more nodes; and
combining the analysis results of the nodes.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention discloses a method and a system for log file analysis based on distributed computing network. The method includes: storing user identifiers and related log information into a log file; dividing the log file into target files each including the log information having the same user identifier; separately analyzing the target files to obtain analysis results using at least two nodes; and combining the analysis results of the nodes. The method thereby establishes relationships among various log files through user identifiers, and further analyzes the relationships among the user'"'"'s accesses to various contents of a website.
55 Citations
12 Claims
-
1. A method for log file analysis based on a distributed computing network, characterized in that the method comprises:
-
storing user identifiers and related log information into a log file; dividing the log file into a plurality of target files, wherein log information having a common user identifier is included in the same target file; separately analyzing the plurality of target files to obtain analysis results using two or more nodes; and combining the analysis results of the nodes. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for log file analysis based on a distributed computing network, characterized in that the system comprises a log analysis server and a plurality of nodes, wherein the log analysis server comprises:
-
a collection unit used for collecting a log file of a web server, the log file including user identifiers and log information related to the user identifiers; a storage unit used for storing the log file collected by the collection unit; a first interface unit used for receiving and sending data; and a division unit used for dividing the log file in the storage unit into multiple target files, wherein the log information having a common user identifier is included in the same target file; the nodes comprise; a second interface unit used for receiving and sending data; and a processing unit used for analyzing the target files; and the log analysis server further comprises; a combination unit used for combining analysis results of the plurality of nodes. - View Dependent Claims (10, 11, 12)
and the division unit further comprises; a target file generation unit used for combining the identifier files sent from the nodes which have the same user identifier into a single file to form the respective target file.
-
-
12. The system for log file analysis as recited in claim 11, characterized in that nodes further comprise:
an ordering unit used for ordering the log information in the log file according to times of creation of the log information.
Specification