×

Online learning and vehicle control method based on reinforcement learning without active exploration

  • US 10,065,654 B2
  • Filed: 07/08/2016
  • Issued: 09/04/2018
  • Est. Priority Date: 07/08/2016
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer-implemented method of adaptively controlling an autonomous operation of a vehicle, the method comprising:

  • a) in a critic network in a computing system configured to autonomously control the vehicle, determining, using samples of passively collected data and a state cost, an estimated average cost, and an approximated cost-to-go function that produces a minimum value for a cost-to-go of the vehicle when applied by an actor network; and

    b) in an actor network in the computing system and operatively coupled to the critic network, determining a control input to apply to the vehicle which produces the minimum value for the cost-to-go,wherein the actor network is configured to determine the control input by estimating a noise level using the estimated average cost, an estimated cost-to-go determined from the approximated cost-to-go function, a control dynamics for a current state of the vehicle, and the samples of passively collected data, andwherein the approximated cost-to-go function is determined using a linear combination of weighted radial basis functions in accordance with the following relationship;

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×