×

Online asynchronous reinforcement learning from concurrent customer histories

  • US 8,924,318 B2
  • Filed: 09/28/2012
  • Issued: 12/30/2014
  • Est. Priority Date: 09/28/2011
  • Status: Active Grant
First Claim
Patent Images

1. An apparatus, comprising;

  • one or more computing devices, each of the computing devices having one or more processors and memories configured to perform a method of asynchronous reinforcement learning (RL), including;

    obtaining an indication of a Decision Request;

    receiving, obtaining, accessing or constructing a user state pertaining to at least one user; and

    in response to the Decision Request;

    scoring a plurality of actions according to one or more value functions based, at least in part, upon the user state;

    applying a policy to identify one of the scored actions as a decision; and

    providing an indication of the decision or applying the decision to the at least one user;

    obtaining an indication of an Update Request, the Update Request being activated independent of user activity;

    receiving, obtaining, accessing or constructing a further user state pertaining to the at least one user; and

    in response to the Update Request;

    updating at least one of;

    the one or functions and the policy based, at least in part, upon the further user state,wherein the Decision Request is activated in response to an event timer and the event timer operates to periodically generate Decision Requests, wherein a frequency with which the event timer generates the Decision Requests is based at least in part, upon a period of time from a last user event pertaining to the at least one user or from a last user action, the last user action including the providing of the indication of the decision or the applying of the decision to the at least one user.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×