Methods and apparatus to correct errors in audience measurements for media accessed using over-the-top devices
First Claim
1. A method comprising:
- identifying, by executing an instruction via a processor, a first set of impression data received from a computer at a first data source, the first set of impression data having matched demographic data from users registered with both an over-the-top (OTT) device and a database proprietor, the first set of impression data different from a second set of data from a second data source, the computer producing a misattribution error in the first set of impression data, the misattribution error based on a demographic data error in the first set of impression data, the demographic data error based on a difference between reported demographic data in the first set of impression data and actual demographic data corresponding to the first set of impression data;
generating, by executing an instruction via the processor, a third set of data based on the second set of data from the second data source;
generating, by executing an instruction via the processor, an independent binary model based on a difference between the first set of impression data and the third set of data;
correcting the demographic data error in the first set of impression data by applying, by executing an instruction via the processor, the independent binary model to the first set of impression data to generate corrected demographic data; and
correcting the misattribution error produced by the computer by assigning, by executing an instruction via the processor, viewership to an impression associated with the first set of impression data using the corrected demographic data.
8 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus to correct errors in measuring audiences of over-the-top media are disclosed. In some examples, the methods and apparatus identify a first set of data from a first data source, the first set of data different from a second set of data from a second data source. In some examples, the methods and apparatus generate a third set of data based on the second set of data from the second data source. In some examples, the methods and apparatus generate a model based on a difference between the first set of data and the third set of data. In some examples, the methods and apparatus apply the model to the first set of data. In some examples, the methods and apparatus assign viewership to an impression associated with the first set of data by imputing viewership associated with the second set of data to the first set of data.
-
Citations
20 Claims
-
1. A method comprising:
-
identifying, by executing an instruction via a processor, a first set of impression data received from a computer at a first data source, the first set of impression data having matched demographic data from users registered with both an over-the-top (OTT) device and a database proprietor, the first set of impression data different from a second set of data from a second data source, the computer producing a misattribution error in the first set of impression data, the misattribution error based on a demographic data error in the first set of impression data, the demographic data error based on a difference between reported demographic data in the first set of impression data and actual demographic data corresponding to the first set of impression data; generating, by executing an instruction via the processor, a third set of data based on the second set of data from the second data source; generating, by executing an instruction via the processor, an independent binary model based on a difference between the first set of impression data and the third set of data; correcting the demographic data error in the first set of impression data by applying, by executing an instruction via the processor, the independent binary model to the first set of impression data to generate corrected demographic data; and correcting the misattribution error produced by the computer by assigning, by executing an instruction via the processor, viewership to an impression associated with the first set of impression data using the corrected demographic data. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method comprising:
-
identifying, by executing an instruction via a processor, a first set of impression data received from a computer at a first data source, the first set of impression data different from a second set of data from a second data source, the computer producing a misattribution error in the first set of impression data, the misattribution error based on a demographic data error in the first set of impression data, the demographic data error based on a difference between reported demographic data in the first set of impression data and actual demographic data corresponding to the first set of impression data; generating, via the processor, a third set of data based on the second set of data from the second data source; generating, via the processor, an independent binary model based on a difference between the first set of impression data and the third set of data; and correcting the demographic data error in the first set of impression data by applying, via the processor, the independent binary model to the first set of impression data to generate corrected demographic data; and correcting the misattribution error produced by the computer by assigning, via the processor, viewership to an impression associated with the first set of impression data using the corrected demographic data, the assigning of the viewership to the impression includes; identifying viewing history associated with the second set of data; determining a first time associated with a first demographic viewing a media presentation in a first household associated with the second set of data; determining a second time associated with the first demographic and a second demographic viewing the media presentation in the household; determining a first probability that the first demographic viewed the media presentation by dividing the first time by the second time; identifying a first person in the first household associated with the second set of data having a second probability similar to the first probability; and imputing a viewing history of the first person to a second person in a second household associated with the first set of impression data. - View Dependent Claims (7)
-
-
8. An apparatus comprising:
-
a demographic corrector to; identify a first set of impression data received from a computer at a first data source, the first set of impression data having matched demographic data from users registered with both an over-the-top (OTT) device and a database proprietor, the first set of impression data different from a second set of data from a second data source, the computer producing a misattribution error in the first set of impression data, the misattribution error based on a demographic data error in the first set of impression data, the demographic data error based on a difference between reported demographic data in the first set of impression data and actual demographic data corresponding to the first set of impression data; generate a third set of data based on the second set of data from the second data source; generate an independent binary model based on a difference between the first set of impression data and the third set of data; and correct the demographic data error in the first set of impression data by applying the independent binary model to the first set of impression data to generate corrected demographic data; and a viewership assigner to correct the misattribution error produced by the computer by assigning viewership to an impression associated with the first set of impression data using the corrected demographic data, in which at least one of the demographic corrector or the viewership assigner is a logic circuit. - View Dependent Claims (9, 10, 11, 12)
-
-
13. An apparatus comprising:
-
a demographic corrector to; identify a first set of impression data received from a computer at a first data source, the first set of impression data different from a second set of data from a second data source, the computer producing a misattribution error in the first set of impression data, the misattribution error based on a demographic data error in the first set of impression data, the demographic data error based on a difference between reported demographic data in the first set of impression data and actual demographic data corresponding to the first set of impression data; generate a third set of data based on the second set of data from the second data source; generate a model based on a difference between the first set of impression data and the third set of data; and correct the demographic data error in the first set of impression data by applying the model to the first set of impression data to generate corrected demographic data; and a viewership assigner to correct the misattribution error produced by the computer by assigning viewership to an impression associated with the first set of impression data using the corrected demographic data, the viewership assigner is to assign viewership to the impression by; identifying viewing history associated with the second set of data; determining a first time associated with a first demographic viewing a media presentation in a first household associated with the second set of data; determining a second time associated with the first demographic and a second demographic viewing the media presentation in the household; determining a first probability that the first demographic viewed the media presentation by dividing the first time by the second time; identifying a first person in the first household associated with the second set of data having a second probability similar to the first probability; and imputing a viewing history of the first person to a second person in a second household associated with the first set of impression data, at least one of the demographic corrector or the viewership assigner is a logic circuit. - View Dependent Claims (14)
-
-
15. A tangible computer readable storage medium comprising instructions that, when executed, cause a machine to at least:
-
identify a first set of impression data received from a computer at a first data source, the first set of impression data having matched demographic data from users registered with both an over-the-top (OTT) device and a database proprietor, the first set of impression data different from a second set of data from a second data source, the computer producing a misattribution error in the first set of impression data, the misattribution error based on a demographic data error in the first set of impression data, the demographic data error based on a difference between reported demographic data in the first set of impression data and actual demographic data corresponding to the first set of impression data; generate a third set of data based on the second set of data from the second data source; generate an independent binary model based on a difference between the first set of impression data and the third set of data; correct the demographic data error in the first set of impression data by applying the independent binary model to the first set of impression data to generate corrected demographic data; and correct the misattribution error produced by the computer by assigning viewership to an impression associated with the first set of impression data using the corrected demographic data. - View Dependent Claims (16, 17, 18)
-
-
19. A tangible computer readable storage medium comprising instructions that, when executed, cause a machine to at least:
-
identify a first set of impression data received from a computer at a first data source, the first set of impression data different from a second set of data from a second data source, the computer producing a misattribution error in the first set of impression data, the misattribution error based on a demographic data error in the first set of impression data, the demographic data error based on a difference between reported demographic data in the first set of impression data and actual demographic data corresponding to the first set of impression data; generate a third set of data based on the second set of data from the second data source; generate an independent binary model based on a difference between the first set of impression data and the third set of data; correct the demographic data error in the first set of impression data by applying the independent binary model to the first set of impression data to generate corrected demographic data; correct the misattribution error produced by the computer by assigning viewership to an impression associated with the first set of impression data using the corrected demographic data; identify viewing history associated with the second set of data; determine a first time associated with a first demographic viewing a media presentation in a first household associated with the second set of data; determine a second time associated with the first demographic and a second demographic viewing the media presentation in the household; determine a first probability that the first demographic viewed the media presentation by dividing the first time by the second time; identify a first person in the first household associated with the second set of data having a second probability similar to the first probability; and impute a viewing history of the first person to a second person in a second household associated with the first set of impression data. - View Dependent Claims (20)
-
Specification