Method and system for site path evaluation using web session clustering
First Claim
1. A method for identifying properties of a plurality of web page traversal paths, comprising:
- acquiring data from a plurality of sessions corresponding to said plurality of web page traversal paths;
grouping, by a computer, said web page traversal paths into web page categories, wherein grouping said web page traversal paths into said web page categories includes correlating traffic amount relationships between specific ones of a plurality of web pages in said web page traversal paths to identify said web page categories, wherein correlating the traffic amount relationships between the specific web pages comprises correlating the specific web pages based on an amount of traffic between the specific web pages;
using, by the computer, the web page categories to map the plurality of sessions to new sessions;
clustering, by the computer, said new sessions according to a similarity measure into a plurality of web session clusters, wherein clustering said new sessions comprises defining the similarity measure between sessions, and partitioning said new sessions into said plurality of web session clusters according to said similarity measure; and
selecting, by the computer, one of said plurality of web session clusters most closely exhibiting at least one predefined metric from said plurality of web session clusters for analysis of properties of a web page traversal path contained therein.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and system for site path evaluation using web session clustering is provided. The method identifies properties of a plurality of web site traversal paths. Data is acquired from a plurality of sessions corresponding to at least a portion of the plurality of web page traversal paths. Portions of the web site traversal paths are grouped into a unified web page category. The plurality of sessions is clustered into a plurality of web session clusters according to a similarity measure. One of the plurality of web session clusters most closely exhibiting at least one predefined metric is selected for analysis of the propertied of a web page traversal path contained therein. A system includes a plurality of web pages, a monitoring program and a computational process configured to receive data and identify properties of the plurality of traversal paths.
21 Citations
14 Claims
-
1. A method for identifying properties of a plurality of web page traversal paths, comprising:
-
acquiring data from a plurality of sessions corresponding to said plurality of web page traversal paths; grouping, by a computer, said web page traversal paths into web page categories, wherein grouping said web page traversal paths into said web page categories includes correlating traffic amount relationships between specific ones of a plurality of web pages in said web page traversal paths to identify said web page categories, wherein correlating the traffic amount relationships between the specific web pages comprises correlating the specific web pages based on an amount of traffic between the specific web pages; using, by the computer, the web page categories to map the plurality of sessions to new sessions; clustering, by the computer, said new sessions according to a similarity measure into a plurality of web session clusters, wherein clustering said new sessions comprises defining the similarity measure between sessions, and partitioning said new sessions into said plurality of web session clusters according to said similarity measure; and selecting, by the computer, one of said plurality of web session clusters most closely exhibiting at least one predefined metric from said plurality of web session clusters for analysis of properties of a web page traversal path contained therein. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A non-transitory computer-readable medium having computer-executable instructions for identifying properties of a plurality of web page traversal paths, said computer-executable instructions executable for:
-
acquiring data from a plurality of sessions corresponding to said plurality of web page traversal paths; grouping said web page traversal paths into web page categories, wherein grouping said web page traversal paths into said web page categories includes correlating traffic amount relationships between specific ones of a plurality of web pages in said web page traversal paths to identify said web page categories, wherein correlating the traffic amount relationships between the specific web pages comprises correlating the specific web pages based on an amount of traffic between the specific web pages; using the web page categories to map the plurality of sessions to new sessions; clustering said new sessions according to a similarity measure into a plurality of web session clusters, wherein clustering said new sessions comprises defining the similarity measure between sessions, and partitioning said new sessions into said plurality of web session clusters according to said similarity measure; and selecting one of said plurality of web session clusters most closely exhibiting at least one predefined metric from said plurality of web session clusters for analysis of properties of a web page traversal path contained therein. - View Dependent Claims (11, 12)
-
-
13. A method comprising:
-
in plural sessions, sending requests from a browser to a web site for web pages of the web site, wherein the web pages include corresponding tags; in response to receiving a particular one of the web pages, the browser using the tag of the particular web page to request a monitoring service by a monitoring program separate from the web site; grouping, by a computer, URLs of the web pages into page categories, wherein grouping the URLs includes correlating traffic amount relationships between specific ones of the web pages to identify the page categories, wherein correlating the traffic amount relationships between the specific web pages comprises correlating the specific web pages based on an amount of traffic between the specific web pages; using, by the computer, the page categories to map the plural sessions to new sessions having reduced attribute information; and clustering, by the computer, the new sessions into clusters, wherein clustering the new sessions includes defining a similarity measure between sessions, and partitioning the new sessions into the clusters according to the similarity measure. - View Dependent Claims (14)
-
Specification