Bayesian-centric autonomous robotic learning
First Claim
1. An autonomous robotic system for learning, the system comprising:
- a mobile platform having a central body;
a displacement module operably connected to the central body and configured to displace the mobile platform;
a payload module releasably connected to the central body and configured to perform a mission specific operation;
an input module configured to receive a user input command;
a controller disposed within the central body and operably connected to the displacement module and the input module;
a data store operably coupled to the controller, wherein the data store comprises a program of instructions that, when executed by the controller, cause the controller to perform operations to adaptively optimize a control system of the mobile platform, the operations comprising;
receive, via the input module, a predetermined target goal;
store the predetermined goal in the data store;
retrieve the predetermined target goal from the data store;
retrieve a set of parameters associated with the predetermined target goal;
retrieve a set of coefficients associated with the retrieved set of parameters;
determine a current success probability of achieving the predetermined target goal based on a Bayesian equation formed by the retrieved set of parameters and the retrieved set of coefficients;
receive a perturbation signal;
modify a selected one of the retrieved coefficients or a selected one of the retrieved parameters in response to the received perturbation signal;
determine a perturbed success probability based on the Bayesian equation using the selected one of the retrieved coefficients or the selected one of the retrieved parameters as modified by the received perturbation signal; and
,if the perturbed success probability exceeds the current success probability, then store the selected one of the retrieved coefficients or the selected one of the retrieved parameters as modified by the received perturbation signal and in association with the predetermined target goal.
1 Assignment
0 Petitions
Accused Products
Abstract
Various apparatus and methods include autonomous robot operations to perturb a current Bayesian equation and determining whether the perturbed Bayesian equation yields an improved probability of success of achieving a goal relative to the current Bayesian equation. In an illustrative example, the perturbation may modulate a coefficient of a parameter in the Bayesian equation. In some examples, the perturbation may include assessment of whether adding or removing a parameter may improve the probability of success of achieving the goal. The parameters of the Bayesian equation may include, for example, current state information, alone or in combination with sensor input values and/or historical information, for example. In some implementations, the robot may advantageously autonomously optimize its operations by perturbing a current Bayesian equation associated with, for example, a current goal, sub-goal, task, or probability of success criteria.
-
Citations
20 Claims
-
1. An autonomous robotic system for learning, the system comprising:
-
a mobile platform having a central body; a displacement module operably connected to the central body and configured to displace the mobile platform; a payload module releasably connected to the central body and configured to perform a mission specific operation; an input module configured to receive a user input command; a controller disposed within the central body and operably connected to the displacement module and the input module; a data store operably coupled to the controller, wherein the data store comprises a program of instructions that, when executed by the controller, cause the controller to perform operations to adaptively optimize a control system of the mobile platform, the operations comprising; receive, via the input module, a predetermined target goal; store the predetermined goal in the data store; retrieve the predetermined target goal from the data store; retrieve a set of parameters associated with the predetermined target goal; retrieve a set of coefficients associated with the retrieved set of parameters; determine a current success probability of achieving the predetermined target goal based on a Bayesian equation formed by the retrieved set of parameters and the retrieved set of coefficients; receive a perturbation signal; modify a selected one of the retrieved coefficients or a selected one of the retrieved parameters in response to the received perturbation signal; determine a perturbed success probability based on the Bayesian equation using the selected one of the retrieved coefficients or the selected one of the retrieved parameters as modified by the received perturbation signal; and
,if the perturbed success probability exceeds the current success probability, then store the selected one of the retrieved coefficients or the selected one of the retrieved parameters as modified by the received perturbation signal and in association with the predetermined target goal. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An autonomous robotic system for learning, the system comprising:
-
a mobile platform having a central body; a displacement module operably connected to the central body and configured to displace the mobile platform; a payload module releasably connected to the central body and configured to perform a mission specific operation; a controller disposed within the central body and operably connected to the displacement module; a data store operably coupled to the controller, wherein the data store comprises a program of instructions that, when executed by the controller, cause the controller to perform operations to adaptively optimize a control system of the mobile platform, the operations comprising; retrieve a predetermined target goal from the data store; retrieve a set of parameters associated with the predetermined target goal; retrieve a set of coefficients associated with the retrieved set of parameters; determine a current success probability of achieving the predetermined target goal based on a Bayesian equation formed by the retrieved set of parameters and the retrieved set of coefficients; receive a perturbation signal; modify a selected one of the retrieved coefficients or a selected one of the retrieved parameters in response to the received perturbation signal; determine a perturbed success probability based on the Bayesian equation using the selected one of the retrieved coefficients or the selected one of the retrieved parameters as modified by the received perturbation signal; and
,if the perturbed success probability exceeds the current success probability, then store the selected one of the retrieved coefficients or the selected one of the retrieved parameters as modified by the received perturbation signal and in association with the predetermined target goal. - View Dependent Claims (12, 13, 14, 15)
-
-
16. An autonomous robotic system for learning, the system comprising:
-
a mobile platform having a central body; means for displacing the central body; a payload module releasably connected to the central body and configured to perform a mission specific operation; a controller disposed within the central body and operably connected to the displacement module; a data store operably coupled to the controller, wherein the data store comprises a program of instructions that, when executed by the controller, cause the controller to perform operations to adaptively optimize a control system of the mobile platform, the operations comprising; retrieve a predetermined target goal from the data store; retrieve a set of parameters associated with the predetermined target goal; retrieve a set of coefficients associated with the retrieved set of parameters; determine a current success probability of achieving the predetermined target goal based on a Bayesian equation formed by the retrieved set of parameters and the retrieved set of coefficients; receive a perturbation signal; modify a selected one of the retrieved coefficients or a selected one of the retrieved parameters in response to the received perturbation signal; determine a perturbed success probability based on the Bayesian equation using the selected one of the retrieved coefficients or the selected one of the retrieved parameters as modified by the received perturbation signal; and
,if the perturbed success probability exceeds the current success probability, then store the selected one of the retrieved coefficients or the selected one of the retrieved parameters as modified by the received perturbation signal and in association with the predetermined target goal. - View Dependent Claims (17, 18, 19, 20)
-
Specification