Discrepancy detection apparatus and methods for machine learning
First Claim
1. A robotic apparatus, comprising:
- a platform comprising a controllable actuator;
a sensor module configured to provide environmental information associated with an environment of the platform; and
a controller configured to;
provide a control instruction for the controllable actuator, the control instruction configured to cause the platform to execute an action to accomplish a target task in accordance with the environmental information;
determine a predicted outcome of the action;
determine a discrepancy signal based on an actual outcome of the action and the predicted outcome; and
determine a repeat indication responsive to the discrepancy being within a range of a target value associated with the target task;
wherein the repeat indication is configured to cause the robot to execute a second action to achieve the target task.
2 Assignments
0 Petitions
Accused Products
Abstract
A robotic device may comprise an adaptive controller configured to learn to predict consequences of robotic device'"'"'s actions. During training, the controller may receive a copy of the planned and/or executed motor command and sensory information obtained based on the robot'"'"'s response to the command. The controller may predict sensory outcome based on the command and one or more prior sensory inputs. The predicted sensory outcome may be compared to the actual outcome. Based on a determination that the prediction matches the actual outcome, the training may stop. Upon detecting a discrepancy between the prediction and the actual outcome, the controller may provide a continuation signal configured to indicate that additional training may be utilized. In some classification implementations, the discrepancy signal may be used to indicate occurrence of novel (not yet learned) objects in the sensory input and/or indicate continuation of training to recognize said objects.
192 Citations
20 Claims
-
1. A robotic apparatus, comprising:
-
a platform comprising a controllable actuator; a sensor module configured to provide environmental information associated with an environment of the platform; and a controller configured to; provide a control instruction for the controllable actuator, the control instruction configured to cause the platform to execute an action to accomplish a target task in accordance with the environmental information; determine a predicted outcome of the action; determine a discrepancy signal based on an actual outcome of the action and the predicted outcome; and determine a repeat indication responsive to the discrepancy being within a range of a target value associated with the target task; wherein the repeat indication is configured to cause the robot to execute a second action to achieve the target task. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method of training an adaptive robotic apparatus, the method comprising:
for a given training trial; causing the apparatus to execute an action based on a context; determining a current discrepancy between a target outcome of the action and a predicted outcome of the action; comparing the current discrepancy to a prior discrepancy, the prior discrepancy being determined based on a prior observed outcome of the action and a prior predicted outcome of the action determined at a prior trial; and providing an indication responsive to the current discrepancy being smaller than the prior discrepancy, the indication being configured to cause the apparatus to execute the action based on the context during a trial subsequent to the given trial. - View Dependent Claims (7, 8, 9, 10)
-
11. A non-transitory computer-readable storage medium having instructions embodied thereon, the instructions being executable by one or more processors to perform a method of adapting training of a learning apparatus, the method comprising:
-
determining a discrepancy between a predicted outcome and an observed outcome of an action of the learning apparatus; determining an expected error associated with the determination of the discrepancy; comparing the expected error to a target error associated with the determination of the discrepancy; and providing a continue-training indication based on the expected error being smaller than the target error. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification