Apparatus and methods for reinforcement-guided supervised learning
First Claim
1. A method of generating a predicted control output by an adaptive controller of a robotic apparatus comprising a predictor and a combiner, the method comprising:
- operating the adaptive controller in accordance with a reinforcement learning process based on a reinforcement signal, the reinforcement signal being based on a performance measure associated with the reinforcement learning process;
operating the predictor in accordance with a supervised learning process based on a teaching signal, the teaching signal conveying information related to a target output of the predictor;
generating a control output via the adaptive controller based on a sensory input and the reinforcement signal, the sensory input including information associated with an environment of the robotic apparatus;
determining a predicted control output via the predictor based on the sensory input and the teaching signal;
determining a combined output via the combiner based on the control output and the predicted control output, the combined output being characterized by a transform function; and
providing the combined output via the adaptive controller to the robotic apparatus, the combined output causing the robotic apparatus to execute a maneuver in accordance with the sensory input;
wherein the teaching signal comprises the combined control output.
2 Assignments
0 Petitions
Accused Products
Abstract
Framework may be implemented for transferring knowledge from an external agent to a robotic controller. In an obstacle avoidance/target approach application, the controller may be configured to determine a teaching signal based on a sensory input, the teaching signal conveying information associated with target action consistent with the sensory input, the sensory input being indicative of the target/obstacle. The controller may be configured to determine a control signal based on the sensory input, the control signal conveying information associated with target approach/avoidance action. The controller may determine a predicted control signal based on the sensory input and the teaching signal, the predicted control conveying information associated with the target action. The control signal may be combined with the predicted control in order to cause the robotic apparatus to execute the target action.
-
Citations
20 Claims
-
1. A method of generating a predicted control output by an adaptive controller of a robotic apparatus comprising a predictor and a combiner, the method comprising:
-
operating the adaptive controller in accordance with a reinforcement learning process based on a reinforcement signal, the reinforcement signal being based on a performance measure associated with the reinforcement learning process; operating the predictor in accordance with a supervised learning process based on a teaching signal, the teaching signal conveying information related to a target output of the predictor; generating a control output via the adaptive controller based on a sensory input and the reinforcement signal, the sensory input including information associated with an environment of the robotic apparatus; determining a predicted control output via the predictor based on the sensory input and the teaching signal; determining a combined output via the combiner based on the control output and the predicted control output, the combined output being characterized by a transform function; and providing the combined output via the adaptive controller to the robotic apparatus, the combined output causing the robotic apparatus to execute a maneuver in accordance with the sensory input; wherein the teaching signal comprises the combined control output. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A computerized controller apparatus of a robot, the apparatus comprising:
-
a controller block; a predictor block; and one or more processors configured to execute computer program modules to perform a method of transferring information related to execution of a control task associated with a sensory context by the robot from the controller block to the predictor block, the method comprising; configuring the predictor block to operate in accordance with a supervised learning process based on a teaching input, the teaching input being provided by the control block based on a reinforcement learning process configured to be adapted based on the sensory context and a reinforcement signal, the reinforcement learning process adaptation being configured to occur during one or more trials effectuated prior to the provision of the teaching input; and based on the sensory context, causing the predictor block to generate a predicted control output that causes the execution of the control task. - View Dependent Claims (16, 17)
-
-
18. A computerized robotic control apparatus, comprising:
one or more processors configured to by machine-readable instructions to; determine a teaching signal based on a sensory input, the teaching signal conveying information associated with a target action consistent with the sensory input, the sensory input being indicative of at least one object in an environment of the robotic apparatus; determine a control signal based on the sensory input, the control signal conveying information associated with the target action; determine a predicted control signal based on the sensory input and the teaching signal, the predicted control conveying information associated with the target action; and combine the control signal and the predicted control signal into a combined control output, the combined control output causing the robotic apparatus to execute a maneuver, the target action comprising the maneuver. - View Dependent Claims (19, 20)
Specification