×

Systems and methods for clickstream analysis to modify an off-line business process involving forecasting demand

  • US 7,814,139 B2
  • Filed: 10/24/2007
  • Issued: 10/12/2010
  • Est. Priority Date: 03/07/2002
  • Status: Active Grant
First Claim
Patent Images

1. A method of processing data at a host computer, comprising:

  • receiving a data file at the host computer containing records of a plurality of HTTP (HyperText Transfer Protocol) transactions of a plurality of users, each of the HTTP transactions including at least one URL (Uniform Resource Locator);

    converting the data file into a common file format at the host computer;

    cleansing the data file in the common file format at the host computer by applying a plurality of URL rules to remove session identifiers while accounting for the use of proxies, thereby producing a cleansed data file containing at least one modified URL;

    performing a panel selection process at the host computer, comprising;

    conducting a periodic survey of a subset of Internet users to determine characteristics of an Internet community that includes the subset of Internet users, the characteristics including demographic characteristics;

    selecting a panel of users from the subset of Internet users, the panel of users having a targeted combination of the demographic characteristics;

    retaining the data file only when it contains a user identifier of a user in the panel of users;

    performing a transformation process, comprising;

    removing from the data file those URLs that do not contain the user identifier;

    determining which URLs in the data file belong to a session of the user in the panel of users;

    assigning a session identifier to the session;

    creating a plurality of session data files, each of which contain the user identifier, a time stamp, the URLs in the data file that belong to the session, and the session identifier;

    decomposing the URLs in each of the plurality of session data files so that the URLs in each of the plurality of session data files are decomposed URLs; and

    hashing the decomposed URLs so that the decomposed URLs are hashed, decomposed URLs;

    calculating a metric of user behavior at the host computer based upon the hashed, decomposed URLs;

    merging the metric of user behavior into a file containing a plurality of metrics of online user behavior at the host computer; and

    transmitting from the host computer to a remote machine the file containing the plurality of metrics of online user behavior, wherein the file containing the plurality of metrics of online user behavior enables a third party to forecast offline demand for a good or service based on the plurality of metrics of online user behavior.

View all claims
  • 8 Assignments
Timeline View
Assignment View
    ×
    ×