METHOD AND SYSTEM FOR AUTOMATED TRANSACTION ANALYSIS
First Claim
1. A method for automated transaction identification analysis, the method comprising the steps of:
- receiving a plurality of raw log files from at least one transaction unit into one or more log collector unit and storing the plurality of raw log files to at least one log storage unit from the log collector unit, wherein the log storage unit comprises at least one distributed file system;
processing the raw log file using at least one log parser unit to remove one or more log fields and one or more log entries from the raw log file that are not relevant for the transaction identification analysis to acquire one or more processed entries;
identifying at least one session of at least one user by clustering the processed one or more entries from a single user session using at least one session identifier unit, wherein the similar processed entries are clustered together to define a single user session;
identifying at least one user path that is followed by at least one user during a single user session from the clustered similar user entries using at least one path analysis unit and determine the length of each user path during a user session;
wherein one or more independent frequency and one or more total frequency are processed from the identified user paths;
identifying at least one entry point from the identified single user path, wherein the entry point of the user path with the highest independent frequencies is defined as the entry point to the user path; and
determining at least one transaction and at least one sub transaction from the identified entry point using at least one transaction identifier to get a plurality of individual transactions, a plurality of complete transactions and a plurality of sub transactions with their corresponding key metrics.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention relates to a method and a system to automate identification of transactions. The method comprises receiving raw log files from a transaction device into a log collector, storing the raw log files to a log storage, processing the raw log files by a log parser unit to remove the fields and the entries of the raw log files that are not relevant for the transaction identification analysis, identifying of at least one session of at least one user by clustering the processed one or more entries from a single user session using at least one session identifier unit, identifying at least one user path that is followed by at least one user during a single user session, identifying at least one entry point from the identified single user path and determining at least one transaction and at least one sub transaction from the identified entry point.
-
Citations
25 Claims
-
1. A method for automated transaction identification analysis, the method comprising the steps of:
-
receiving a plurality of raw log files from at least one transaction unit into one or more log collector unit and storing the plurality of raw log files to at least one log storage unit from the log collector unit, wherein the log storage unit comprises at least one distributed file system; processing the raw log file using at least one log parser unit to remove one or more log fields and one or more log entries from the raw log file that are not relevant for the transaction identification analysis to acquire one or more processed entries; identifying at least one session of at least one user by clustering the processed one or more entries from a single user session using at least one session identifier unit, wherein the similar processed entries are clustered together to define a single user session; identifying at least one user path that is followed by at least one user during a single user session from the clustered similar user entries using at least one path analysis unit and determine the length of each user path during a user session;
wherein one or more independent frequency and one or more total frequency are processed from the identified user paths;identifying at least one entry point from the identified single user path, wherein the entry point of the user path with the highest independent frequencies is defined as the entry point to the user path; and determining at least one transaction and at least one sub transaction from the identified entry point using at least one transaction identifier to get a plurality of individual transactions, a plurality of complete transactions and a plurality of sub transactions with their corresponding key metrics. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A system for automated transaction analysis, the system comprising:
-
at least one transaction device adapted to store a plurality of raw log files;
one or more log collector operatively configured with the transaction device to collect the raw logs files from the transaction device;at least one log storage operatively configured with the log collector, wherein the log storage comprises a distributed file system to store the raw log files; at least one log parser operatively configured with the log storage to purge the raw logs files and get a plurality of processed entries; at least one session identifier operatively configured with the log parser to identify a plurality of single user sessions and its associated metrics;
wherein the similar processed entries are clustered together to define a single user session;at least one path analyzer operatively configured with the session identifier to determine at least one path followed by the user during a single user session and its associated path metrics; at least one transaction analyzer operatively connected with the path analyzer to identifying at least one entry point from the determined single user path; and at least one transaction identifier operatively connected with the transaction analyzer to identify transactions and sub transactions performed by the users from the identified entry point. - View Dependent Claims (23, 24, 25)
-
-
20. The system of 19 wherein the transaction device comprises a web server.
- 21. The system of 19 wherein the session identifier comprises an URL analyzer to identify URL metrics.
Specification