×

Automated Action-Selection System and Method , and Application Thereof to Training Prediction Machines and Driving the Development of Self-Developing Devices

  • US 20080319929A1
  • Filed: 07/26/2005
  • Published: 12/25/2008
  • Est. Priority Date: 07/27/2004
  • Status: Active Grant
First Claim
Patent Images

1. :

  • An automated action-selection system adapted to generate signals specifying values for a set of one or more action variables defining an action that can be taken whereby to affect a setup S, the automated action-selection system comprising;

    input means for receiving signals indicative of the value, at a time t, of a set of zero or more system-state/context parameters (SC(t)) describing the state and/or context of the setup S;

    a region definer adapted to define a set of regions in a multi-dimensional system-state/context/action space, each dimension of the system-state/context/action space being defined by a respective different parameter or variable of the sets of system-state/context parameters and action variables;

    means for determining a set of candidate actions, each candidate action consisting of a possible set of values for the action variables;

    a region identifier for identifying the region in system-state/context/action space containing the combination of a given candidate action with values of any system-state/context parameters at time t;

    a prediction unit adapted to predict the value of a set of one or more predicted variables (VAR) a predetermined interval after time t, wherein a prediction function applied by the prediction unit depends upon the region in system-state/context/action space containing the combination of this given candidate action with any system-state/context parameters at time t;

    calculator means adapted to calculate, for selected candidate actions, a respective indicator of the actual error in the prediction made by the prediction unit for said selected candidate action,memory means for storing indicators of actual prediction errors made by the prediction unit for respective candidate actions selected on one or more previous occasions;

    assessment means adapted to evaluate the expected improvement in the performance of the prediction unit if a given candidate action is performed, wherein an assessment performed by the assessment means depends upon the region R in system-state/context/action space containing the combination of this given candidate action with the values, at time t, of any system-state/context parameters, and the assessment means is further adapted to evaluate said expected improvement by comparing an indicator of the actual prediction error that existed on one or more occasions, previous to time t, when the setup S had a combination of system-state/context parameters and action variables located in the same region R of the system-state/context/action; and

    means for generating a signal indicating the desirability of selecting a given candidate action for performance, said signal being dependent on the expected improvement in the performance of the prediction unit evaluated by the assessment unit for said given candidate action.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×