Parsing navigation information to identify occurrences corresponding to defined categories
DCFirst Claim
1. A computer-implemented method for using predefined parsing information to analyze web site navigation data in order to identify occurrences corresponding to defined category types, the method comprising:
- for each of multiple distinct web sites each having multiple web pages, receiving web site navigation data associated with the web site that has multiple entries each containing information about a request for a web page of the web site and a response to the request, the request including an indicated URL and sent to a web site server for the web site, each of the indicated URLs optionally including a URL path portion and optionally including a query string portion, the contained information about each request including any URL path portion that is included in the URL indicated for the request and including any query string portion that is included in the URL indicated for the request;
retrieving predefined parsing information associated with the web site that includes multiple distinct definitions of logical sites and multiple distinct definitions of category types, each logical site definition specifying an IP address and port number used by a web site server to provide at least some of the web pages of the web site, and each category type definition specifying one of the logical sites and indicating multiple web pages of the web site of that category type by including a URL pattern that is capable of matching the URL paths of the URLs corresponding to the multiple web pages and a query string pattern capable of matching the query strings of those corresponding URLs;
for each entry of the received web site navigation data, analyzing the information contained in the entry to determine if the web site server to which the request was sent matches any of the defined logical sites by using the IP address and the port number specified by that logical site; and
if a logical site is determined to match the web site server, further analyzing the information contained in the entry by storing, for at least one of the category types whose definition specifies the matching logical site, an indication of an occurrence of a request for a web page of that one category type if the information contained in the entry includes a URL path portion that matches the URL pattern included in that one category type definition and includes a query string portion that matches the query string pattern included in that one category type definition;
receiving a request from an operator of the web site to provide information for the web site about occurrences of requests for web pages of specified category types;
retrieving in response the stored indications of the occurrences of the requests for web pages of the specified category types; and
providing the retrieved information to the operator, so that the operators of the multiple web sites can receive information about occurrences of requests for web pages of category types of interest for their web sites.
9 Assignments
Litigations
0 Petitions
Accused Products
Abstract
A method, system and computer-readable medium for analyzing interaction or usage data, such as for customers, is described. The interaction or usage data may be stored in log files and supplemented with data from other sources. Various data parsing information may be defined and used as part of the analysis, such as by using customer-specific information to identify various occurrences of interest. For example, when analyzing a customer'"'"'s web site interaction data, the parser component can use data defining customer-specific categories of web pages. Such high-level types of occurrences can be specified in a variety of ways, such as by using a combination of a logical web site, one or more URIs corresponding to web pages, and/or one or more query strings. The data parsing information may also specify a mapping of actual web sites to one or more logical sites.
114 Citations
111 Claims
-
1. A computer-implemented method for using predefined parsing information to analyze web site navigation data in order to identify occurrences corresponding to defined category types, the method comprising:
for each of multiple distinct web sites each having multiple web pages, receiving web site navigation data associated with the web site that has multiple entries each containing information about a request for a web page of the web site and a response to the request, the request including an indicated URL and sent to a web site server for the web site, each of the indicated URLs optionally including a URL path portion and optionally including a query string portion, the contained information about each request including any URL path portion that is included in the URL indicated for the request and including any query string portion that is included in the URL indicated for the request;
retrieving predefined parsing information associated with the web site that includes multiple distinct definitions of logical sites and multiple distinct definitions of category types, each logical site definition specifying an IP address and port number used by a web site server to provide at least some of the web pages of the web site, and each category type definition specifying one of the logical sites and indicating multiple web pages of the web site of that category type by including a URL pattern that is capable of matching the URL paths of the URLs corresponding to the multiple web pages and a query string pattern capable of matching the query strings of those corresponding URLs;
for each entry of the received web site navigation data, analyzing the information contained in the entry to determine if the web site server to which the request was sent matches any of the defined logical sites by using the IP address and the port number specified by that logical site; and
if a logical site is determined to match the web site server, further analyzing the information contained in the entry by storing, for at least one of the category types whose definition specifies the matching logical site, an indication of an occurrence of a request for a web page of that one category type if the information contained in the entry includes a URL path portion that matches the URL pattern included in that one category type definition and includes a query string portion that matches the query string pattern included in that one category type definition;
receiving a request from an operator of the web site to provide information for the web site about occurrences of requests for web pages of specified category types;
retrieving in response the stored indications of the occurrences of the requests for web pages of the specified category types; and
providing the retrieved information to the operator, so that the operators of the multiple web sites can receive information about occurrences of requests for web pages of category types of interest for their web sites. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
13. A computer-implemented method for analyzing interaction data to identify occurrences corresponding to defined groups of related items, the method comprising:
-
receiving an indication of interaction data that is associated with a content set having multiple items, the interaction data having one or more entries that are each related to an interaction with at least one of the items of the content set;
receiving an indication of at least one communication definition that specifies a manner of communicating content set item interactions;
receiving an indication of multiple category type definitions each specifying a group of related content set items and each associated with one of the communication definitions, and for each entry of the interaction data, determining whether the entry matches one of the category type definitions in such a manner that the related interaction for the entry is with a content set item that is a member of the group specified by that category type definition and was communicated in the manner specified by the communication definition associated with that category type definition; and
when it is determined that the entry matches one of the category type definitions, storing an indication of an occurrence of that category type. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58)
-
-
59. A computer-readable medium whose contents cause a computing device to analyze data to identify occurrences corresponding to defined groups of items, by performing a method comprising:
-
receiving an indication of data that is associated with a content set having multiple items, the data having one or more entries that are each related to an interaction with at least one of the items of the content set;
receiving an indication of multiple definitions each specifying a group of related content set items; and
for each entry of the data, determining whether the entry matches one of the definitions in such a manner that the related interaction for the entry is with a content set item that is a member of the group specified by that definition; and
when it is determined that the entry matches one of the definitions, indicating an occurrence of an interaction with the group of items specified by that definition. - View Dependent Claims (60, 61, 62)
-
-
63. A computing device for analyzing interaction data to identify occurrences corresponding to defined category types, comprising:
-
an interaction data receiver component capable of receiving an indication of interaction data that is associated with a content set having multiple items, the interaction data having one or more entries that are each related to an interaction with at least one of the items of the content set;
a definition receiver component capable of receiving an indication of at least one communication definition that specifies a manner of communicating content set item interactions and of receiving an indication of multiple category type definitions each specifying a group of related content set items and each associated with one of the communication definitions; and
an interaction data parsing component capable of, for each entry of the interaction data, determining whether the entry matches one of the category type definitions in such a manner that the related interaction for the entry is with a content set item that is a member of the group specified by that category type definition and was communicated in the manner specified by the communication definition associated with that category type definition and of storing an indication of an occurrence of a category type when it is determined that an entry matches the definition for that category type. - View Dependent Claims (64)
-
-
65. A computer system for analyzing interaction data to identify occurrences corresponding to defined category types, comprising:
-
means for receiving an indication of interaction data that is associated with a content set having multiple items, the interaction data having one or more entries that are each related to an interaction with at least one of the items of the content set;
means for receiving an indication of at least one communication definition that specifies a manner of communicating content set item interactions and for receiving an indication of multiple category type definitions each specifying a group of related content set items and each associated with one of the communication definitions; and
means for, for each entry of the interaction data, determining whether the entry matches one of the category type definitions in such a manner that the related interaction for the entry is with a content set item that is a member of the group specified by that category type definition and was communicated in the manner specified by the communication definition associated with that category type definition, and for storing an indication of an occurrence of a category type when it is determined that an entry matches the definition for that category type.
-
-
66. A computer-implemented method for analyzing interaction data for a web site to identify occurrences corresponding to defined category types, the method comprising:
-
receiving an indication of multiple interaction data entries each containing information about an interaction with a web site that includes a specified URL corresponding to one of multiple web pages of the web site, each of the specified URLs optionally including a URL path portion and optionally including a query string portion, the contained information for each entry including any URL path portion that is included in the specified URL for the entry and including any query string portion that is included in the specified URL for the entry;
receiving an indication of multiple category type definitions that each specify a group of web pages related to a category by using a URL path pattern capable of matching at least one URL path related to the category and using a query string pattern capable of matching at least one query string related to the category; and
for each entry, analyzing the entry to determine whether the entry matches one of the category type definitions by containing information about a specified URL corresponding to a web page that is related to the category for that category type definition, the matching such that the contained information includes a URL path portion that matches the URL path pattern for that one category type definition and includes a query string portion that matches the query string pattern for that one category type definition; and
when it is determined that the entry matches one of the category type definitions, storing an indication of an occurrence of that category type for the web site. - View Dependent Claims (67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80)
-
-
81. A computer-readable medium containing instructions that when executed cause a computer system to analyze data to identify occurrences corresponding to defined groups of web pages, by performing a method comprising:
-
receiving an indication of multiple data entries each containing information about an interaction with a web site that includes a specified URL corresponding to one of multiple web pages of the web site, each of the specified URLs optionally including a URL path portion and optionally including a query string portion, the contained information for each entry including any URL path portion that is included in the specified URL for the entry and including any query string portion that is included in the specified URL for the entry;
receiving an indication of multiple definitions that each specify a group of web pages related to a category by using a URL path pattern capable of matching at least one URL path related to the category and using a query string pattern capable of matching at least one query string related to the category; and
for each entry, analyzing the entry to determine whether the entry matches one of the definitions by containing information about a specified URL corresponding to a web page that is related to the category for that definition, the matching such that the contained information includes a URL path portion that matches the URL path pattern for that one definition and includes a query string portion that matches the query string pattern for that one definition; and
when it is determined that the entry matches one of the definitions, indicating an occurrence of an interaction with the group of web pages specified by that definition.
-
-
82. A computer-implemented method for analyzing interaction data for a web site to identify occurrences corresponding to defined category types, the method comprising:
-
receiving an indication of multiple interaction data entries each containing information about a request that specifies a URL corresponding to a web page of a web site, each of the specified URLs optionally including a URL path portion and optionally including a query string portion, each of the query string portions including one or more combinations each having a query parameter name and corresponding query value, the contained information about each request including any URL path portion that is included in the specified URL for the request and including any query string portion that is included in the specified URL for the request;
receiving an indication of a category type definition corresponding to multiple categories, the category type definition specifying a URL path pattern capable of matching at least one URL path related to the multiple categories and a query string pattern capable of matching at least one query string related to the multiple categories, each query string pattern indicating one or more query parameter names, the category type definition further specifying a name definition for providing a unique name for each of the multiple categories, the name definition including at least one of the indicated query parameter names and indicating how values for each of the included query parameter names are to be combined to form the names of the multiple categories, each unique combination of values for the indicated query parameter names corresponding to one of the multiple categories; and
for each entry, analyzing the entry to determine whether the entry matches one of the category type definitions by containing information about a request corresponding to a web page that is related to the category for that category type definition, the matching such that the information contained in the entry includes a URL path portion that matches the URL path pattern specified in that one category type definition and includes a query string portion that matches the query string pattern specified in that one category type definition; and
when it is determined that the entry matches one of the category type definitions, determining the name of the category to which the entry corresponds by retrieving the value from the query string portion of the contained information for the entry that corresponds to each of the query parameter names included in the name definition and by combining the retrieved values in the manner indicated in the name definition; and
storing an indication of an occurrence of the category having the name formed by the combined retrieved values. - View Dependent Claims (83)
-
-
84. A computer-implemented method for analyzing interaction data for a web site to identify occurrences corresponding to defined category types, the method comprising:
-
receiving an indication of multiple interaction data entries each containing information about a request that specifies a URL corresponding to a web page of a web site, each of the specified URLs optionally including a URL path portion and optionally including a query string portion, the contained information about each request including any URL path portion that is included in the specified URL for the request and including any query string portion that is included in the specified URL for the request;
receiving an indication of multiple category type definitions that each specify a group of web pages related to a category with multiple page type patterns that each specify a distinct combination of a URL path pattern capable of matching at least one URL path related to the category and a query string pattern capable of matching at least one query string related to the category; and
for each entry, analyzing the entry to determine whether the entry matches one of the category type definitions by containing information about a request corresponding to a web page that is related to the category for that category type definition, the matching such that, for any of the page type patterns included in that one category type, definition, the information contained in the entry includes a URL path portion and a query string portion that match the URL path pattern and the query string pattern specified in that page type pattern; and
when it is determined that the entry matches one of the category type definitions, storing an indication of an occurrence of that category type for the web site. - View Dependent Claims (85)
-
-
86. A computer-implemented method for analyzing interaction data for a web site to identify occurrences corresponding to defined category types, the method comprising:
-
receiving an indication of multiple interaction data, entries each containing information about a request that specifies a URL corresponding to a web page of the web site, each of the specified URLs optionally including a URL path portion and optionally including a query string portion, each of the query string portions including one or more combinations each having a query parameter name and corresponding query value, the contained information about each request including any URL path portion that is included in the specified URL for the request and including any query string portion that is included in the specified URL for the request;
receiving an indication of multiple category type definitions that each specify a group of web pages related to a category with a URL path pattern capable of matching at least one URL path related to the category and a query string pattern capable of matching at least one query string related to the category, each query string pattern indicating one or more query parameter names whose presence in a query string is required, allowed, or disallowed if that query string is to match the query string pattern; and
for each entry, analyzing the entry to determine whether the entry matches one of the category type definitions by containing information about a request corresponding to a web page that is related to the category for that category type definition, the matching such that the information contained in the entry (a) includes a URL path portion that matches the URL path pattern specified in that one category type definition and (b) includes a query string portion that includes each of the query parameter names whose presence is indicated in the query string pattern specified in that one category type definition to be required, and that does not include any of the query parameter names whose presence is indicated in the query string pattern specified in that one category type definition to be disallowed; and
when it is determined that the entry matches one of the category type definitions, storing an indication of an occurrence of that one category type for the web site. - View Dependent Claims (87, 88)
-
-
89. A computer-implemented method for analyzing interaction data to identify occurrences corresponding to defined hierarchies of items, the method comprising:
-
receiving an indication of multiple interaction data entries each containing information related to an interaction with one of multiple items of a content set, the content set items structured in a hierarchy having multiple hierarchy members at multiple hierarchy levels;
receiving an indication of multiple hierarchy definitions that each correspond to one or more related hierarchy members; and
for each entry, analyzing the entry to determine whether the entry matches one of the hierarchy definitions by containing information about an interaction with a content set item that is one of the hierarchy members to which that one hierarchy definition corresponds; and
when it is determined that the entry matches one of the hierarchy definitions, indicating an occurrence of an interaction with the related hierarchy members to which that one hierarchy definition corresponds. - View Dependent Claims (90, 91, 92, 93, 94)
-
-
95. A computer-implemented method for analyzing usage data to identify occurrences corresponding to defined groups of features, the method comprising:
-
receiving an indication of usage data associated with a provided service or an executing computer program that has multiple features available for use, the usage data having multiple entries each related to a distinct use of one of multiple features of the provided service or executing computer program that includes information being communicated;
receiving an indication of multiple definitions each specifying a group of features related to a category and each associated with a manner of communicating information to the provided service or to the executing computer program; and
for each entry of the usage data, determining whether the entry matches one of the definitions such that the related use for the entry is of a feature that is a member of the group of features specified by that definition and such that the information communicated for the related use is communicated in the manner associated with that definition, and when it is determined that the entry matches one of the definitions, storing an indication of an occurrence of a use of the group of features specified by that definition.
-
-
96. A computer-readable medium containing a data structure that stores multiple definitions for category types so that occurrences of those category types in interaction data for a web site can be identified, the data structure having multiple entries each corresponding to a category type definition that specifies a group of web pages related to a category, each entry storing a URL path pattern capable of matching at least one URL path related to the category and a query string pattern capable of matching at least one query string related to the category,
such that when analyzing information about an interaction with a web page of the web site having a specified URL that optionally includes a URL path portion and optionally includes a query string portion, if the web page is determined to be a member of the group of web pages specified by a category type definition then an occurrence of that category type is indicated, the web page determined to be a member of the group of web pages for a category type definition if the specified URL includes a URL path portion that matches the URL path pattern specified for that category type definition and includes a query string portion that matches the query string pattern specified for that category type definition.
-
102. A computer-readable medium containing a data structure storing multiple definitions for category types so that occurrences of those category types can be identified in interaction data or usage data, the data structure having multiple entries each corresponding to a category type definition, each entry specifying a group of related content set items for a content set and including an indication of a communication definition that specifies a manner of communicating information related to interactions or uses corresponding to the content set items,
so that when analyzing data about an interaction or use that corresponds to a content set item and that indicates a manner in which related information was communicated, if the data matches one of the category type definitions in such a manner that the interaction or use corresponds to one of the content set items in the group specified by that category type definition and had related information that was communicated in the manner specified by the communication definition indicated by that category type definition, an occurrence of that category type can be identified.
-
103. A method for analyzing customer data to identify occurrences corresponding to defined categories, the method comprising:
-
receiving a request from a customer to analyze interaction or usage data for that customer related to a content set having multiple content set items;
receiving an indication of definitions for the customer that each specify a group of content set items related to a category and are each associated with at least one manner of communicating information;
receiving a first set of data for the customer that includes information about at least one interaction or use;
analyzing the received set of data to determine whether the received data includes information about any interactions or uses that match one of the definitions in such a manner that the interaction or use is with a content set item in the group specified by that definition and had related information communicated in a manner associated with that definition; and
when it is determined that the received data matches one of the definitions, providing information to the customer about an occurrence for that category. - View Dependent Claims (104, 105, 106, 107)
-
-
108. A method for creating definitions of category types for analyzing interaction data for a web site to identify occurrences corresponding to defined category types, the method comprising:
-
receiving an indication of a log file for the web site or of other information related to the web site that indicates multiple interactions with a web site server for the web site, each indicated interaction having associated information including network address information for the web site server and a URL specified as part of the interaction;
analyzing the log file or the other information to identify distinct network addresses for the web site servers for the web site, and generating a site definition for each of the identified network addresses that includes that network address; and
analyzing the log file or the other information to identify groups of related web pages, and generating a category type definition for each of the identified groups, so that information about an interaction with the web site can be analyzed to determine whether the information matches one of the category type definitions in such a manner that the interaction is with a web page that is a member of the group specified by that category type definition and was with a web site server having a network address that matches one of the site definitions. - View Dependent Claims (109, 110, 111)
-
Specification