Apparatus and methods for operating robotic devices using selective state space training
First Claim
1. An adaptive controller apparatus comprising a plurality of computer readable instructions configured to, when executed, cause a performance of a target task by a robot, the computer readable instructions configured to cause the adaptive controller apparatus to:
- during a first training trial comprising at least one action and performed without at least one second action, determine a predicted signal configured in accordance with a sensory input, the predicted signal being configured to cause execution of an action associated with the target task, the execution of the action associated with the target task being characterized by a first performance;
during a second training trial, based on a teaching input and the predicted signal, determine a combined signal of the at least one action and the at least one second action, the combined signal configured to cause the execution of the action associated with the target task, the execution of the action associated with the target task during the second training trial being characterized by a second performance; and
adjust a learning parameter of the adaptive controller apparatus based on the first performance and the second performance, the adjustment of the learning parameter comprising one or more iterative adjustments of at least the first performance of the first training trial and the second performance of the second training trial in alternation until the learning parameter reaches a target threshold;
wherein the performance of the target task comprises the execution of the action associated with the target task and the at least one second action contemporaneously.
2 Assignments
0 Petitions
Accused Products
Abstract
Apparatus and methods for training and controlling of e.g., robotic devices. In one implementation, a robot may be utilized to perform a target task characterized by a target trajectory. The robot may be trained by a user using supervised learning. The user may interface to the robot, such as via a control apparatus configured to provide a teaching signal to the robot. The robot may comprise an adaptive controller comprising a neuron network, which may be configured to generate actuator control commands based on the user input and output of the learning process. During one or more learning trials, the controller may be trained to navigate a portion of the target trajectory. Individual trajectory portions may be trained during separate training trials. Some portions may be associated with robot executing complex actions and may require additional training trials and/or more dense training input compared to simpler trajectory actions.
345 Citations
19 Claims
-
1. An adaptive controller apparatus comprising a plurality of computer readable instructions configured to, when executed, cause a performance of a target task by a robot, the computer readable instructions configured to cause the adaptive controller apparatus to:
-
during a first training trial comprising at least one action and performed without at least one second action, determine a predicted signal configured in accordance with a sensory input, the predicted signal being configured to cause execution of an action associated with the target task, the execution of the action associated with the target task being characterized by a first performance; during a second training trial, based on a teaching input and the predicted signal, determine a combined signal of the at least one action and the at least one second action, the combined signal configured to cause the execution of the action associated with the target task, the execution of the action associated with the target task during the second training trial being characterized by a second performance; and adjust a learning parameter of the adaptive controller apparatus based on the first performance and the second performance, the adjustment of the learning parameter comprising one or more iterative adjustments of at least the first performance of the first training trial and the second performance of the second training trial in alternation until the learning parameter reaches a target threshold; wherein the performance of the target task comprises the execution of the action associated with the target task and the at least one second action contemporaneously. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A robotic apparatus comprising:
-
a platform characterized by first and second degrees of freedom of motion; a sensor module configured to provide information related to an environment of the platform; and an adaptive controller apparatus configured to determine first and second control signals to facilitate operation of the first and the second degrees of freedom of motion of the robotic apparatus, respectively; wherein; the first and the second control signals are configured to cause the platform to perform a target action; the first control signal is determined in accordance with the information related to the environment of the platform and a teaching input; the second control signal is determined in an absence of the teaching input and in accordance with the information related to the environment of the platform and a configuration of the adaptive controller apparatus; and the configuration is determined based at least on an outcome of training of the adaptive controller apparatus to operate the second degree of freedom of motion of the robotic apparatus, the training to operate the second degree of freedom of motion being configured to occur with the first degree of freedom of motion of the robotic apparatus held static entirely during the training. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A method of operating a robotic controller apparatus, the robotic controller apparatus configured to cause a robot to perform a target task, the method comprising:
-
during a first training trial comprising at least one action and performed without at least one second action; determining a predicted signal configured in accordance with a sensory input; and executing an action associated with the target task via the predicted signal, the execution of the action associated with the target task being characterized by a first performance; during a second training trial, based on a teaching input and the predicted signal; determining a combined signal of the at least one action and the at least one second action; and executing the action associated with the target task via the combined signal, the execution of the action associated with the target task during the second training trial being characterized by a second performance; and adjusting a learning parameter of the robotic controller apparatus based on the first performance and the second performance, the adjustment of the learning parameter comprising iteratively adjusting at least the first performance of the first training trial and the second performance of the second training trial in alternation until the learning parameter reaches a target threshold; wherein the performance of the target task comprises the execution of the action associated with the target task and the at least one second action contemporaneously. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
Specification