Suboptimal immediate navigational response based on long term planning

US 10,627,830 B2
Filed: 06/06/2019
Issued: 04/21/2020
Est. Priority Date: 01/05/2016
Status: Active Grant

First Claim

Patent Images

1. A navigation system for a host vehicle, the system comprising:

at least one processing device programmed to;

receive, from a camera, a plurality of images representative of an environment of the host vehicle;

analyze the plurality of images to identify a present navigational state associated with the host vehicle;

determine a first potential navigational action for the host vehicle based on the identified present navigational state;

determine a first indicator of an expected reward based on the first potential navigational action and the identified present navigational state;

predict a first future navigational state based on the first potential navigational action;

determine a second indicator of an expected reward associated with at least one future action determined to be available to the host vehicle in response to the first future navigational state;

determine a second potential navigational action for the host vehicle based on the identified present navigational state;

determine a third indicator of an expected reward based on the second potential navigational action and the identified present navigational state;

predict a second future navigational state based on the second potential navigational action;

determine a fourth indicator of an expected reward associated with at least one future action determined to be available to the host vehicle in response to the second future navigational state;

select the second potential navigational action based on a determination that, while the expected reward associated with the first indicator is greater than the expected reward associated with the third indicator, the expected reward associated with the fourth indicator is greater than the expected reward associated with the first indicator; and

cause at least one adjustment of a navigational actuator of the host vehicle in response to the selected second potential navigational action.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A system for navigating an autonomous vehicle using reinforcement learning techniques is provided. The system includes at least one processing device programmed to: receive, from a camera, a plurality of images representative of an environment of the host vehicle; analyze the plurality of images to identify a present navigational state associated with the host vehicle; select a second potential navigational action based on a determination that an expected reward associated with a fourth indicator is greater than an expected reward associated with a second indicator.

Citations

11 Claims

1. A navigation system for a host vehicle, the system comprising:
- at least one processing device programmed to;
  
  receive, from a camera, a plurality of images representative of an environment of the host vehicle;
  
  analyze the plurality of images to identify a present navigational state associated with the host vehicle;
  
  determine a first potential navigational action for the host vehicle based on the identified present navigational state;
  
  determine a first indicator of an expected reward based on the first potential navigational action and the identified present navigational state;
  
  predict a first future navigational state based on the first potential navigational action;
  
  determine a second indicator of an expected reward associated with at least one future action determined to be available to the host vehicle in response to the first future navigational state;
  
  determine a second potential navigational action for the host vehicle based on the identified present navigational state;
  
  determine a third indicator of an expected reward based on the second potential navigational action and the identified present navigational state;
  
  predict a second future navigational state based on the second potential navigational action;
  
  determine a fourth indicator of an expected reward associated with at least one future action determined to be available to the host vehicle in response to the second future navigational state;
  
  select the second potential navigational action based on a determination that, while the expected reward associated with the first indicator is greater than the expected reward associated with the third indicator, the expected reward associated with the fourth indicator is greater than the expected reward associated with the first indicator; and
  
  cause at least one adjustment of a navigational actuator of the host vehicle in response to the selected second potential navigational action.
- View Dependent Claims (2, 3, 4, 5)
- - 2. The navigation system of claim 1, wherein the navigational actuator includes at least one of a steering mechanism, a brake, or an accelerator.
  - 3. The navigation system of claim 1, wherein the first potential navigational action includes a merge in front of a target vehicle and wherein the second potential navigational action includes a merge behind the target vehicle.
  - 4. The navigation system of claim 1, wherein a difference between the expected reward associated with the fourth indicator and the expected reward associated with the first indicator is greater than a difference between the expected reward associated with the third indicator and the expected reward associated with the first indicator.
  - 5. The navigation system of claim 1, wherein a difference between the expected reward associated with the fourth indicator and the expected reward associated with the second indicator is greater than a difference between the expected reward associated with the third indicator and the expected reward associated with the first indicator.

6. An autonomous vehicle, the autonomous vehicle comprising:
- a frame;
  
  a body attached to the frame;
  
  a camera; and
  
  at least one processing device programmed to;
  
  receive, from the camera, a plurality of images representative of an environment of the autonomous vehicle;
  
  analyze the plurality of images to identify a present navigational state associated with the autonomous vehicle;
  
  determine a first potential navigational action for the autonomous vehicle based on the identified present navigational state;
  
  determine a first indicator of an expected reward based on the first potential navigational action and the identified present navigational state;
  
  predict a first future navigational state based on the first potential navigational action;
  
  determine a second indicator of an expected reward associated with at least one future action determined to be available to the autonomous vehicle in response to the first future navigational state;
  
  determine a second potential navigational action for the autonomous vehicle based on the identified present navigational state;
  
  determine a third indicator of an expected reward based on the second potential navigational action and the identified present navigational state;
  
  predict a second future navigational state based on the second potential navigational action;
  
  determine a fourth indicator of an expected reward associated with at least one future action determined to be available to the autonomous vehicle in response to the second future navigational state;
  
  select the second potential navigational action based on a determination that, while the expected reward associated with the first indicator is greater than the expected reward associated with the third indicator, the expected reward associated with the fourth indicator is greater than the expected reward associated with the first indicator; and
  
  cause at least one adjustment of a navigational actuator of the autonomous vehicle in response to the selected second potential navigational action.
- View Dependent Claims (7, 8, 9, 10)
- - 7. The autonomous vehicle of claim 6, wherein the navigational actuator includes at least one of a steering mechanism, a brake, or an accelerator.
  - 8. The autonomous vehicle of claim 6, wherein the first potential navigational action includes a merge in front of a target vehicle and wherein the second potential navigational action includes a merge behind the target vehicle.
  - 9. The autonomous vehicle of claim 6, wherein a difference between the expected reward associated with the fourth indicator and the expected reward associated with the first indicator is greater than a difference between the expected reward associated with the third indicator and the expected reward associated with the first indicator.
  - 10. The autonomous vehicle of claim 6, wherein a difference between the expected reward associated with the fourth indicator and the expected reward associated with the second indicator is greater than a difference between the expected reward associated with the third indicator and the expected reward associated with the first indicator.

11. A method for navigating an autonomous vehicle, the method comprising:
- receiving, from a camera, a plurality of images representative of an environment of the autonomous vehicle;
  
  analyzing the plurality of images to identify a present navigational state associated with the autonomous vehicle;
  
  determining a first potential navigational action for the autonomous vehicle based on the identified present navigational state;
  
  determining a first indicator of an expected reward based on the first potential navigational action and the identified present navigational state;
  
  predicting a first future navigational state based on the first potential navigational action;
  
  determining a second indicator of an expected reward associated with at least one future action determined to be available to the autonomous vehicle in response to the first future navigational state;
  
  determining a second potential navigational action for the autonomous vehicle based on the identified present navigational state;
  
  determining a third indicator of an expected reward based on the second potential navigational action and the identified present navigational state;
  
  predicting a second future navigational state based on the second potential navigational action;
  
  determining a fourth indicator of an expected reward associated with at least one future action determined to be available to the autonomous vehicle in response to the second future navigational state;
  
  selecting the second potential navigational action based on a determination that, while the expected reward associated with the first indicator is greater than the expected reward associated with the third indicator, the expected reward associated with the fourth indicator is greater than the expected reward associated with the first indicator; and
  
  causing at least one adjustment of a navigational actuator of the autonomous vehicle in response to the selected second potential navigational action.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
MobilEye Vision Technologies Ltd. (Intel Corporation)
Original Assignee
MobilEye Vision Technologies Ltd. (Intel Corporation)
Inventors
Stein, Gideon, Shalev-Shwartz, Shai, Shammah, Shaked, Shashua, Amnon
Primary Examiner(s)
Dager, Jonathan M

Application Number

US16/433,926
Publication Number

US 20190286155A1
Time in Patent Office

320 Days
Field of Search

None
US Class Current
CPC Class Codes

B60W 2050/0088   Adaptive recalibration

B60W 2420/403   Image sensing, e.g. optical...

B60W 2520/16   Pitch

B60W 2520/18   Roll

B60W 2552/15   Road slope, i.e. the inclin...

B60W 2552/30   Road curve radius

B60W 2552/53   Road markings, e.g. lane ma...

B60W 2554/00   Input parameters relating t...

B60W 2554/20   Static objects

B60W 2554/4029   Pedestrians

B60W 2554/4046   Behavior, e.g. aggressive o...

B60W 2554/802   Longitudinal distance

B60W 2554/804   Relative longitudinal speed

B60W 30/09   Taking automatic action to ...

B60W 30/0953   the prediction being respon...

B60W 30/0956   the prediction being respon...

B60W 30/18163   Lane change; Overtaking man...

B60W 50/0097   Predicting future conditions

B60W 50/045   Monitoring control system p...

B60W 60/0011   involving control alternati...

B60W 60/00276 : for two or more other traff...

G01C 21/34 : Route searching; Route guid...

G01C 21/3453 : Special cost functions, i.e...

G01C 21/3602 : Input other than that of de...

G05D 1/0055 : with safety arrangements

G05D 1/0231 : using optical position dete...

G05D 1/0246 : using a video camera in com...

G05D 1/0253 : extracting relative motion ...

G06N 20/00 : Machine learning

G06N 3/00 : Computing arrangements base...

G06N 3/006 : based on simulated virtual ...

G06N 3/044 : Recurrent networks, e.g. Ho...

G06N 3/08 : Learning methods

G06N 5/046 : Forward inferencing; Produc...

G06N 7/01 : Probabilistic graphical mod...

G06V 20/56 : exterior to a vehicle by us...

G06V 20/58 : Recognition of moving objec...

G06V 20/584 : of vehicle lights or traffi...

G06V 20/588 : Recognition of the road, e....

View All

Suboptimal immediate navigational response based on long term planning

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

11 Claims

Specification

Solutions

Use Cases

Quick Links

Suboptimal immediate navigational response based on long term planning

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

11 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links