Methods and apparatus to meter content exposure using closed caption information
First Claim
Patent Images
1. A method comprising:
- developing a keyword database of terms based on a program guide descriptive of a plurality of programs for a given time period;
generating a plurality of likelihood values for respective ones of the plurality of programs based on comparison of closed caption text associated with a presented program to the keyword database, the values representing likelihoods that the respective ones of the plurality of programs is the presented program, the likelihood values being generated without comparing the collected audience measurement data to any reference audience measurement data;
collecting an audience measurement parameter for the presented program, the audience measurement parameter useable to identify the presented program;
employing the plurality of likelihood values using a processor to select a subset of the plurality of programs to form a list of most probable presented programs, wherein the selected subset includes more than one of and less than all of the plurality of programs; and
sending the list of most probable programs and the collected audience measurement data to a collection server, the collection server to compare the collected audience measurement data to reference audience measurement data for respective ones of the most probable programs in an order selected based on the likelihood values for respective ones of the most probable programs in the list.
11 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus to meter content exposure using closed caption information are disclosed. An example method comprises developing a keyword database of terms based on program guide descriptive of programs for a given time period, generating one or more values representative of likelihoods that one or more respective media content was presented based on a comparison of closed caption text and the keyword database, collecting audience measurement data, and employing the one or more likelihood values to identify a set of reference data for comparison to the audience measurement data to identify presented content.
-
Citations
22 Claims
-
1. A method comprising:
-
developing a keyword database of terms based on a program guide descriptive of a plurality of programs for a given time period; generating a plurality of likelihood values for respective ones of the plurality of programs based on comparison of closed caption text associated with a presented program to the keyword database, the values representing likelihoods that the respective ones of the plurality of programs is the presented program, the likelihood values being generated without comparing the collected audience measurement data to any reference audience measurement data; collecting an audience measurement parameter for the presented program, the audience measurement parameter useable to identify the presented program; employing the plurality of likelihood values using a processor to select a subset of the plurality of programs to form a list of most probable presented programs, wherein the selected subset includes more than one of and less than all of the plurality of programs; and sending the list of most probable programs and the collected audience measurement data to a collection server, the collection server to compare the collected audience measurement data to reference audience measurement data for respective ones of the most probable programs in an order selected based on the likelihood values for respective ones of the most probable programs in the list. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. An apparatus comprising:
-
an audience measurement engine to collect an audience measurement parameter for a presented program; an indexing engine to create a keyword database based on data descriptive of a plurality of programs; and a closed caption matcher to; generate likelihood values for respective ones of the plurality of programs based on comparison of closed caption text associated with the presented program to the keyword database, the values representing likelihoods that the respective ones of the plurality of programs is the presented programs, the likelihood values being generated without comparing the collected audience measurement data to any reference audience measurement data; select a subset of the plurality of programs based on the likelihood values to form a list of most probable presented programs, the list of most probable presented programs including more than one of and fewer than all of the plurality of programs; order the list of most probable presented programs based on respective ones of the likelihood values; and send the ordered list of most probable programs and the collected audience measurement data to a collection server, the collection server to compare the collected audience measurement data to reference audience measurement data for respective ones of the most probable programs based on the order of the most probable programs in the list to determine an audience presentation statistic, wherein at least one of the audience measurement engine, the indexing engine or the closed caption matcher is implemented in hardware. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A tangible article of manufacture excluding propagating signals, the article comprising a computer-readable storage medium storing machine readable instructions that, when executed, cause a machine to:
-
develop a keyword database of terms based on a program guide descriptive of a plurality of programs for a given time period; collect audience measurement data for a presented program, the audience measurement data useable to identify the presented program; generate likelihood values for respective ones of the plurality of programs based on comparison of closed caption information associated with the presented program and the keyword database, the values representing likelihoods that the respective ones of the plurality of programs is the presented program, the likelihood values being generated without comparing the collected audience measurement data to any reference audience measurement data; select a subset of the plurality of programs based on the likelihood values to form a list of most probable presented programs, the list of most probable presented programs including more than one of and fewer than all of the plurality of programs; order the list of most probable presented programs based on respective ones of the generated likelihood values; and send the ordered list of most probable programs and the collected audience measurement data to a collection server, the collection server to compare the collected audience measurement data to reference audience measurement data for respective ones of the most probable programs based on the order of the most probable programs in the list to identify the presented program. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A method comprising:
-
receiving from a content meter an audience measurement parameter for a presented program; receiving from the content meter a list of most probable presented programs, programs in the list being selected and ordered based on comparisons of closed-caption text associated with the presented programs to a keyword database, the ordered list including more than one of and fewer than all of the plurality of programs; and comparing using a processor reference audience measurement parameters for respective ones of the most probable presented programs until the presented content is identified, the reference audience measurement parameters compared in accordance with the order of the most probable presented programs in the list. - View Dependent Claims (21, 22)
-
Specification