Method and system for vision-centric deep-learning-based road situation analysis
First Claim
1. A method for vision-centric deep-learning-based road situation analysis, comprising:
- receiving real-time traffic environment visual input from at least one camera;
determining, using a recurrent you only look once (ROLO) engine, at least one initial region of interest from the real-time traffic environment visual input by using a convolutional neural networks (CNN) training method;
verifying, using the recurrent you only look once (ROLO) engine, the at least one initial region of interest to determine if a detected object in the at least one initial region of interest is a candidate object to be tracked by using the CNN training method;
in response to determining the detected object is a candidate object, tracking, using a plurality of long short-term memory units (LSTMs), the detected object based on the real-time traffic environment visual input, and predicting a future status of the detected object by using the CNN training method; and
determining if a warning signal is to be presented to a driver of a vehicle based on the predicted future status of the detected object.
1 Assignment
0 Petitions
Accused Products
Abstract
In accordance with various embodiments of the disclosed subject matter, a method and a system for vision-centric deep-learning-based road situation analysis are provided. The method can include: receiving real-time traffic environment visual input from a camera; determining, using a ROLO engine, at least one initial region of interest from the real-time traffic environment visual input by using a CNN training method; verifying the at least one initial region of interest to determine if a detected object in the at least one initial region of interest is a candidate object to be tracked; using LSTMs to track the detected object based on the real-time traffic environment visual input, and predicting a future status of the detected object by using the CNN training method; and determining if a warning signal is to be presented to a driver of a vehicle based on the predicted future status of the detected object.
-
Citations
20 Claims
-
1. A method for vision-centric deep-learning-based road situation analysis, comprising:
-
receiving real-time traffic environment visual input from at least one camera; determining, using a recurrent you only look once (ROLO) engine, at least one initial region of interest from the real-time traffic environment visual input by using a convolutional neural networks (CNN) training method; verifying, using the recurrent you only look once (ROLO) engine, the at least one initial region of interest to determine if a detected object in the at least one initial region of interest is a candidate object to be tracked by using the CNN training method; in response to determining the detected object is a candidate object, tracking, using a plurality of long short-term memory units (LSTMs), the detected object based on the real-time traffic environment visual input, and predicting a future status of the detected object by using the CNN training method; and determining if a warning signal is to be presented to a driver of a vehicle based on the predicted future status of the detected object. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for vision-centric deep-learning-based road situation analysis, comprising:
-
at least one camera for receiving real-time traffic environment visual input; a recurrent you only look once (ROLO) engine configured for; determining at least one initial region of interest from the real-time traffic environment visual input by using a convolutional neural networks (CNN) training method, and verifying the at least one initial region of interest to determine if a detected object in the at least one initial region of interest is a candidate object to be tracked by using the CNN training method; a plurality of long short-term memory units (LSTMs) configured for; in response to determining the detected object is a candidate object, tracking the detected object based on the real-time traffic environment visual input, and predicting a future status of the detected object by using the CNN training method; and a decision making agent for determining if a warning signal to be presented to a driver of a vehicle based on the predicted future status of the detected object. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification