System and method for matching objects using a cluster-dependent multi-armed bandit
First Claim
1. A computer system for matching objects, comprising:
- a cluster-dependent multi-armed bandit engine for matching a set of objects clustered by dependencies to another set of objects in order to determine an overall maximal payoff; and
a storage operably coupled to the cluster-dependent multi-armed bandit engine for storing clusters of dependent objects with associated payoffs.
4 Assignments
0 Petitions
Accused Products
Abstract
An improved system and method for matching objects using a cluster-dependent multi-armed bandit is provided. The matching may be performed by using a multi-armed bandit where the arms of the bandit may be dependent. In an embodiment, a set of objects segmented into a plurality of clusters of dependent objects may be received, and then a two step policy may be employed by a multi-armed bandit by first running over clusters of arms to select a cluster, and then secondly picking a particular arm inside the selected cluster. The multi-armed bandit may exploit dependencies among the arms to efficiently support exploration of a large number of arms. Various embodiments may include policies for discounted rewards and policies for undiscounted reward. These policies may consider each cluster in isolation during processing, and consequently may dramatically reduce the size of a large state space for finding a solution.
-
Citations
20 Claims
-
1. A computer system for matching objects, comprising:
-
a cluster-dependent multi-armed bandit engine for matching a set of objects clustered by dependencies to another set of objects in order to determine an overall maximal payoff; and a storage operably coupled to the cluster-dependent multi-armed bandit engine for storing clusters of dependent objects with associated payoffs. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer-implemented method for matching objects, comprising:
-
receiving a first set of objects segmented into a plurality of clusters of dependent objects; matching a plurality of objects from the plurality of clusters of dependent objects to a plurality of objects from a second set of objects by sampling the plurality of objects from the plurality of clusters of dependent objects using a multi-armed bandit; and outputting payoffs for the plurality of objects and the plurality of clusters to which the plurality of objects belong. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A computer system for matching objects, comprising:
-
means for receiving a first set of objects segmented into a plurality of clusters of dependent objects; means for matching a plurality of objects from the plurality of clusters of dependent objects to a plurality of objects from a second set of objects by sampling the plurality of objects from the plurality of clusters of dependent objects using a multi-armed bandit; and means for outputting payoffs for the plurality of objects and the plurality of clusters to which the plurality of objects belong. - View Dependent Claims (17, 18, 19, 20)
-
Specification