ENHANCED MATCHING THROUGH EXPLORE/EXPLOIT SCHEMES
First Claim
1. A computer-implemented method comprising:
- simulating user behavior pertaining to a specified future time period,wherein the simulating is based on an observed user behavior, andwherein the simulating produces first simulated data;
selecting an item of a plurality of items during the specified future time period based on the first simulated data; and
presenting the item to a user.
9 Assignments
0 Petitions
Accused Products
Abstract
Content items are selected to be displayed on a portal page in such a way as to maximize a performance metric such as click-through rate. Problems relating to content selection are addressed, such as changing content pool, variable performance metric, and delay in receiving feedback on an item once the item has been displayed to a user. An adaptation of priority-based schemes for the multi-armed bandit problem are used to project future trends of data. The adaptation introduces experiments concerning a future time period into the calculation, which increases the set of data on which to solve the multi-armed bandit problem. Also, a Bayesian explore/exploit method is formulated as an optimization problem that addresses all of the issues of content item selection for a portal page. This optimization problem is modified by Lagrange relaxation and normal approximation, which allow computation of the optimization problem in real time.
45 Citations
28 Claims
-
1. A computer-implemented method comprising:
-
simulating user behavior pertaining to a specified future time period, wherein the simulating is based on an observed user behavior, and wherein the simulating produces first simulated data; selecting an item of a plurality of items during the specified future time period based on the first simulated data; and presenting the item to a user. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-implemented method comprising:
-
generating a plan for presenting an item from a plurality of items to a user; wherein the plan is based on a simulation of user behavior pertaining to a specified future time period; wherein the simulation comprises a function measuring a potential that presenting the item will maximize a performance metric; and presenting, to the user, the item selected from the plurality of items according to the plan. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A machine-readable volatile or non-volatile medium carrying one or more sequences of instructions, wherein execution of the one or more sequences of instructions by one or more processors causes the one or more processors to perform the steps of:
-
simulating user behavior pertaining to a specified future time period, wherein the simulating is based on an observed user behavior, and wherein the simulating produces first simulated data; selecting an item of a plurality of items during the specified future time period based on the first simulated data; and presenting the item to a user. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
-
22. A machine-readable volatile or non-volatile medium carrying one or more sequences of instructions, wherein execution of the one or more sequences of instructions by one or more processors causes the one or more processors to perform the steps of:
-
generating a plan for presenting an item from a plurality of items to a user; wherein the plan is based on a simulation of user behavior pertaining to a specified future time period; wherein the simulation comprises a function measuring a potential that presenting the item will maximize a performance metric; and presenting, to the user, the item selected from the plurality of items according to the plan. - View Dependent Claims (23, 24, 25, 26, 27, 28)
-
Specification