APPARATUS AND METHODS FOR OPERATING ROBOTIC DEVICES USING SELECTIVE STATE SPACE TRAINING
First Claim
1. A method of operating a robotic controller apparatus, the method comprising:
- determining a current controller performance associated with performing a target task;
determining a difficult portion of a target trajectory associated with the target task, the difficult portion characterized by an extent of a state space; and
providing a training input for navigating the difficult portion, the training input configured to transition the current performance towards the target trajectory;
wherein;
the difficult portion of the target trajectory is determined based at least on the current performance being outside a range from the target trajectory;
the state space is associated with performing of the target task by the controller; and
performing by the controller of a portion of the target task outside the extent is configured based on autonomous controller operation.
2 Assignments
0 Petitions
Accused Products
Abstract
Apparatus and methods for training and controlling of e.g., robotic devices. In one implementation, a robot may be utilized to perform a target task characterized by a target trajectory. The robot may be trained by a user using supervised learning. The user may interface to the robot, such as via a control apparatus configured to provide a teaching signal to the robot. The robot may comprise an adaptive controller comprising a neuron network, which may be configured to generate actuator control commands based on the user input and output of the learning process. During one or more learning trials, the controller may be trained to navigate a portion of the target trajectory. Individual trajectory portions may be trained during separate training trials. Some portions may be associated with robot executing complex actions and may require additional training trials and/or more dense training input compared to simpler trajectory actions.
84 Citations
25 Claims
-
1. A method of operating a robotic controller apparatus, the method comprising:
-
determining a current controller performance associated with performing a target task; determining a difficult portion of a target trajectory associated with the target task, the difficult portion characterized by an extent of a state space; and providing a training input for navigating the difficult portion, the training input configured to transition the current performance towards the target trajectory; wherein; the difficult portion of the target trajectory is determined based at least on the current performance being outside a range from the target trajectory; the state space is associated with performing of the target task by the controller; and performing by the controller of a portion of the target task outside the extent is configured based on autonomous controller operation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An adaptive controller apparatus comprising a plurality of computer readable instructions configured to, when executed, cause performing of a target task by at least:
-
during a first training trial, determining a predicted signal configured in accordance with a sensory input, the predicted signal configured to cause execution of an action associated with the target task, the action execution being characterized by a first performance; during a second training trial, based on a teaching input and the predicted signal, determining a combined signal configured to cause execution of the action, the action execution during the second training trial being characterized by a second performance; and adjusting a learning parameter of the controller based on the first performance and the second performance. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
-
17. A robotic apparatus comprising:
-
a platform characterized by first and second degrees of freedom; a sensor module configured to provide information related to the platform'"'"'s environment; and an adaptive controller apparatus configured to determine first and second control signals to facilitate operation of the first and the second degrees of freedom, respectively; wherein; the first and the second control signals are configured to cause the platform to perform a target action; the first control signal is determined in accordance with the information and a teaching input; the second control signal is determined in an absence of the teaching input and in accordance with the information and a configuration of the controller, and the configuration is determined based at least on an outcome of training of the controller to operate the second degree of freedom. - View Dependent Claims (18, 19, 20, 21)
-
-
22. A method of optimizing the operation of a robotic controller apparatus, the method comprising:
-
determining a current controller performance associated with performing a target task, the current performance being non-optimal for accomplishing the task; and for at least a selected first portion of a target trajectory associated with the target task, the first portion characterized by an extent of a state space, providing a training input that facilitates navigation of the first portion, the training input configured to transition the current performance towards the target trajectory; wherein the first portion of the target trajectory is selected based at least on the current performance not meeting at least one prescribed criterion with respect to the target trajectory. - View Dependent Claims (23, 24, 25)
-
Specification