×

Identifying and reporting on frequent sequences of events in usage data

  • US 7,051,029 B1
  • Filed: 01/05/2001
  • Issued: 05/23/2006
  • Est. Priority Date: 01/05/2001
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for identifying sequences of Web pages that are frequently visited in order non-consecutively during user browsing sessions, the Web pages of a sequence visited in order during a user browsing session when each Web page of the sequence is visited before a next Web page of the sequence is visited, the Web pages of a sequence visited non-consecutively during a user browsing session when at least one intervening Web page that is not part of the sequence is visited between the visits to the Web pages of the sequence, the method comprising:

  • receiving a Web server log generated by a Web server serving a Website having Web pages, the Web server log reflecting usage of the Website by users;

    identifying multiple user browsing sessions from the received Web server log, each identified user browsing session indicating a series of Web pages from the Website that were consecutively visited in order by a user; and

    identifying sequences of the Web pages of the Website that were frequently visited in order non-consecutively in the identified user browsing sessions bydetermining a minimum threshold number of user browsing sessions;

    determining multiple Web pages that are each visited during more of the identified user browsing sessions than the minimum threshold;

    creating a tree data structure having a root node and having a child node of the root node for each of the determined Web pages, each of the determined Web pages represented by one of the children nodes, the children nodes forming a current lowest level of the tree data structure;

    repeatedly expanding the tree data structure by adding a new lowest level of nodes that are children nodes to the nodes of a previous lowest level of the tree data structure, the added children nodes such that each of the determined Web pages has a node that represents that determined Web page that is added as a child node to each of the nodes of the previous level, each of the added children nodes having an associated sequence of Web pages consisting of the determined Web pages that are represented by the nodes in a path from the root node to that node;

    determining the nodes of the tree data structure whose associated sequence of Web pages is visited in order non-consecutively during more of the identified user browsing sessions than the minimum threshold; and

    removing the nodes of the tree data structure that are not among the determined nodes,such that after creation of the tree data structure is completed, the sequences of Web pages that are associated with the nodes remaining in the tree data structure are the identified sequences of the Web pages that were frequently visited in order non-consecutively during the identified user browsing sessions.

View all claims
  • 10 Assignments
Timeline View
Assignment View
    ×
    ×