Robotic training apparatus and methods
First Claim
1. A computerized controller apparatus configured to effectuate control of a robotic device, the controller apparatus comprising:
- one or more processors configured to execute computer program modules to cause the one or more processors to;
(1) during a first training trial of a plurality of trials;
determine a first signal based on a characteristic of an input provided to the controller apparatus by the robotic device; and
cause the robotic device to perform a first action based on a first control signal, the first control signal being determined based on the first signal and a first user input, the first action being characterized by a first performance measure; and
(2) during a second training trial of the plurality of trials, the second training trial being subsequent to the first training trial;
determine a second signal based on the characteristic of the input and a first error, the first error being determined based on a proximity measure between the first action and a target action, the first error being greater than a third error corresponding to a third action performed by the robotic device responsive to a third control signal configured based solely on the first user input; and
cause the robotic device to perform a second action based on a second control signal, the second control signal being determined based on the second signal and a second user input, the second action being characterized by a second error, the second error being determined based on a proximity measure between the second action and the target action;
wherein;
the second error is smaller than the first error.
2 Assignments
0 Petitions
Accused Products
Abstract
Apparatus and methods for training of robotic devices. Robotic devices may be trained by a user guiding the robot along target trajectory using an input signal. A robotic device may comprise an adaptive controller configured to generate control commands based on one or more of the user guidance, sensory input, and/or performance measure. Training may comprise a plurality of trials. During first trial, the user input may be sufficient to cause the robot to complete the trajectory. During subsequent trials, the user and the robot'"'"'s controller may collaborate so that user input may be reduced while the robot control may be increased. Individual contributions from the user and the robot controller during training may be may be inadequate (when used exclusively) to complete the task. Upon learning, user'"'"'s knowledge may be transferred to the robot'"'"'s controller to enable task execution in absence of subsequent inputs from the user.
-
Citations
22 Claims
-
1. A computerized controller apparatus configured to effectuate control of a robotic device, the controller apparatus comprising:
-
one or more processors configured to execute computer program modules to cause the one or more processors to; (1) during a first training trial of a plurality of trials; determine a first signal based on a characteristic of an input provided to the controller apparatus by the robotic device; and cause the robotic device to perform a first action based on a first control signal, the first control signal being determined based on the first signal and a first user input, the first action being characterized by a first performance measure; and (2) during a second training trial of the plurality of trials, the second training trial being subsequent to the first training trial; determine a second signal based on the characteristic of the input and a first error, the first error being determined based on a proximity measure between the first action and a target action, the first error being greater than a third error corresponding to a third action performed by the robotic device responsive to a third control signal configured based solely on the first user input; and cause the robotic device to perform a second action based on a second control signal, the second control signal being determined based on the second signal and a second user input, the second action being characterized by a second error, the second error being determined based on a proximity measure between the second action and the target action; wherein; the second error is smaller than the first error. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method of training a computerized robotic apparatus to perform a target action based on sensory context, the training being effectuated via a plurality of iterations, the method comprising:
-
during a first iteration; causing the apparatus to generate a first control signal based on the sensory context; causing the apparatus to execute a first action based on the first control signal and a user input received from a user, the user input being indicative of the target action; and causing the apparatus to adjust a learning parameter based on a first performance value determined based on the first action and the target action; and during a subsequent iteration; causing the apparatus to generate a second control signal based on the sensory context and the adjusted learning parameter; and causing the apparatus to execute a second action based on the second control signal, the second action being characterized by a second performance value; wherein; the plurality of iterations is configured to occur within a time interval comprising at least a duration; the first iteration and the subsequent iteration are separated by the duration; a provision of the user input during the subsequent iteration is configured to cause a controller to execute a third action characterized by a third performance value; and the second action is configured to deviate less from the target action as compared to the third action. - View Dependent Claims (9, 10)
-
-
11. A robotic apparatus, comprising:
-
a platform comprising one or more controllable elements; and a controller comprising one or more processors configured to execute computer program modules configured to operate individual ones of the one or more controllable elements, the computer program modules comprising; a first logic module configured to receive a first input from a human; a second logic module configured to analyze the first input and to cause the platform to execute a first action in accordance with the first input; and a third logic module configured to receive a second input from the human subsequent to the receipt of the first input, the second input being configured to cause a corrected action by the platform, the corrected action being characterized by a lower deviation from a target action; wherein; the execution of the first action is based on the operation of the individual ones of the one or more controllable elements; the target action corresponds to an operation of the platform based on the first input from the human; the analysis of the first input comprises a determination of a deviation between the first action and the target action; and the analysis of the first input is configured to cause a modification of a controller state in accordance with a learning process configured based on a performance measure. - View Dependent Claims (12, 13)
-
-
14. A robotic apparatus, comprising:
-
a platform comprising one or more controllable elements; and a controller comprising one or more processors configured to execute computer program modules configured to operate individual ones of the one or more controllable elements, the computer program modules comprising; a first logic module configured to receive a first input from a human; a second logic module configured to analyze the first input and to cause the platform to execute a first action in accordance with the first input; and a third logic module configured to receive a second input from the human subsequent to the receipt of the first input, the second input being configured to cause a corrected action by the platform, the corrected action being characterized by a lower deviation from a target action; wherein; the execution of the first action is based on the operation of the individual ones of the one or more controllable elements; the second logic module comprises a predictor sub-module configured to determine a predicted control output based on a characteristic of a sensory signal, the sensory signal being configured to convey information associated with one or both of an environment of the robotic apparatus and a platform state; the first input and the second input are configured based on the sensory signal; and the first action is configured based on the predicted control output. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A robotic apparatus, comprising:
-
a platform comprising one or more controllable elements; and a controller comprising one or more processors configured to execute computer program modules configured to operate individual ones of the one or more controllable elements, the computer program modules comprising; a first logic module configured to receive a first input from a human; a second logic module configured to analyze the first input and to cause the platform to execute a first action in accordance with the first input; and a third logic module configured to receive a second input from the human subsequent to the receipt of the first input, the second input being configured to cause a corrected action by the platform, the corrected action being characterized by a lower deviation from a target action; wherein; the execution of the first action is based on the operation of the individual ones of the one or more controllable elements; the corrected action is effectuated based on a cooperative interaction with the human, the cooperative interaction being characterized by a plurality of iterations; the first input corresponds to a first given iteration of the plurality of iterations; and the second input corresponds to a second given iteration of the plurality of iterations, the second given iteration occurring subsequent to the first given iteration. - View Dependent Claims (21, 22)
-
Specification