System and method for estimating prevalence of digital content on the world-wide-web
First Claim
1. A system for estimating a number of times content has been accessed via a network, the system comprising:
- an estimating device to determine an estimate of a number of times that a webpage has been accessed at a web server;
a prober to repeatedly send requests to the web server for the webpage and, in response, receive content files; and
a statistical summarization system including a processor to determine a number of times that a first content object is included in the content files received in response to the requests, determine a total number of the requests, and estimate a number of times that the first content object has been accessed by visitors of the webpage by;
determining a rotation rate for the first content object by dividing the number of times that the first content object was included in the content files received in response to the requests by the total number of the requests; and
determining the number of times that the first content object has been accessed by visitors by multiplying the estimate of the number of times that the webpage has been accessed by the rotation rate.
14 Assignments
0 Petitions
Accused Products
Abstract
The present invention is a system, method and computer program product for tracking and measuring digital content that is distributed on a computer network such as the Internet. The system collects online advertisement data, analyzes the data, and uses the data to calculate measurements of the prevalence of those advertisements. The system processes raw traffic data by cleansing and summarizing the traffic data prior to storing the processed data in a database. An advertisement sampling system uses site selection and definition criteria and a probe map to retrieve Web pages from the Internet, extract advertisements from those Web pages, classify each advertisement, and store the data in a database. A statistical summarization system accesses the processed raw traffic data and the advertisement data in the database to calculate advertising prevalence statistics including the advertising frequency, impressions, and spending.
-
Citations
17 Claims
-
1. A system for estimating a number of times content has been accessed via a network, the system comprising:
-
an estimating device to determine an estimate of a number of times that a webpage has been accessed at a web server; a prober to repeatedly send requests to the web server for the webpage and, in response, receive content files; and a statistical summarization system including a processor to determine a number of times that a first content object is included in the content files received in response to the requests, determine a total number of the requests, and estimate a number of times that the first content object has been accessed by visitors of the webpage by; determining a rotation rate for the first content object by dividing the number of times that the first content object was included in the content files received in response to the requests by the total number of the requests; and determining the number of times that the first content object has been accessed by visitors by multiplying the estimate of the number of times that the webpage has been accessed by the rotation rate. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method of estimating a number of times content has been accessed via a network, the method comprising:
-
repeatedly sending requests for a webpage and, in response, receiving content files; determining a number of times that a first content object is included in the content files received in response to the requests; and estimating, with a processor, a number of times that the first content object has been accessed by visitors of the webpage by; determining a rotation rate for the first content object by dividing the number of times that the first content object was included in the content files received in response to the requests by a total number of the requests; and determining the number of times that the first content object has been accessed by visitors by multiplying an estimate of the number of times that the webpage has been accessed by the rotation rate. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A storage device or storage disk storing instructions that, when executed, cause a machine to at least:
-
repeatedly sending requests for a webpage; determine a number of times that a first content object is included in content files received in response to the requests; and estimate a number of times that the first content object has been accessed by visitors of the webpage by; determining a rotation rate for the first content object by dividing the number of times that the first content object was included in the content files received in response to the requests by a total number of the requests; and determining the number of times that the first content object has been accessed by visitors by multiplying an estimate of the number of times that the webpage has been accessed by the rotation rate. - View Dependent Claims (14, 15, 16, 17)
-
Specification