×

Action selection for reinforcement learning using influence diagrams

  • US 20060224535A1
  • Filed: 06/29/2005
  • Published: 10/05/2006
  • Est. Priority Date: 03/08/2005
  • Status: Active Grant
First Claim
Patent Images

1. An online reinforcement learning system comprising:

  • a model comprising an influence diagram with at least one chance node, the model receives an input and provides a probability distribution associated with uncertainty regarding parameters of the model;

    a decision engine that selects an action based, at least in part, upon the probability distribution, the decision engine employs the Thompson strategy heuristic technique to maximize long term expected utility; and

    , a reinforcement learning component that modifies at least one of the parameters of the model based upon feedback associated with the selected action.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×