Learning method, learning device using regression loss and testing method, testing device using the same
First Claim
1. A method for learning one or more parameters of a CNN based on one or more regression losses, comprising steps of:
- (a) learning device instructing a first convolutional layer to an n-th convolutional layer to respectively and sequentially generate a first encoded feature map to an n-th encoded feature map from at least one input image as a training image;
(b) the learning device instructing an n-th deconvolutional layer to a first deconvolutional layer to sequentially generate an n-th decoded feature map to first decoded feature map from the n-th encoded feature map;
(c) the learning device, on condition that each cell of a grid with a plurality of rows and a plurality of columns is generated by dividing at least one specific decoded feature map, among the n-th decoded feature map to the first decoded feature map, with respect to a first direction and a second direction, wherein the first direction is in a direction of the rows of the specific decoded feature map and the second direction is in a direction of the columns thereof, generating at least one obstacle segmentation result representing each of specific rows, where each of bottom lines of each of nearest obstacles is estimated as being located per each of the columns, by referring to at least one feature of at least part of the n-th decoded feature map to the first decoded feature map;
(d) the learning device generating the regression losses referring to each of respective differences of distances between (i) each location of exact rows where each of the bottom lines of each of the nearest obstacles is truly located per each of the columns on at least one GT, for each of the columns, and (ii) each location of the specific rows where each of the bottom lines of each of the nearest obstacles is estimated as being located per each of the columns on the obstacle segmentation result; and
(e) the learning device backpropagating the regression losses, to thereby learn the parameters of the CNN.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for learning parameters of a CNN based on regression losses is provided. The method includes steps of: a learning device instructing a first to an n-th convolutional layers to generate a first to an n-th encoded feature maps; instructing an n-th to a first deconvolutional layers to generate an n-th to a first decoded feature maps from the n-th encoded feature map; generating an obstacle segmentation result by referring to a feature of the decoded feature maps; generating the regression losses by referring to differences of distances between each location of the specific rows, where bottom lines of nearest obstacles are estimated as being located per each of columns of a specific decoded feature map, and each location of exact rows, where the bottom lines are truly located per each of the columns on a GT; and backpropagating the regression losses, to thereby learn the parameters.
20 Citations
28 Claims
-
1. A method for learning one or more parameters of a CNN based on one or more regression losses, comprising steps of:
-
(a) learning device instructing a first convolutional layer to an n-th convolutional layer to respectively and sequentially generate a first encoded feature map to an n-th encoded feature map from at least one input image as a training image; (b) the learning device instructing an n-th deconvolutional layer to a first deconvolutional layer to sequentially generate an n-th decoded feature map to first decoded feature map from the n-th encoded feature map; (c) the learning device, on condition that each cell of a grid with a plurality of rows and a plurality of columns is generated by dividing at least one specific decoded feature map, among the n-th decoded feature map to the first decoded feature map, with respect to a first direction and a second direction, wherein the first direction is in a direction of the rows of the specific decoded feature map and the second direction is in a direction of the columns thereof, generating at least one obstacle segmentation result representing each of specific rows, where each of bottom lines of each of nearest obstacles is estimated as being located per each of the columns, by referring to at least one feature of at least part of the n-th decoded feature map to the first decoded feature map; (d) the learning device generating the regression losses referring to each of respective differences of distances between (i) each location of exact rows where each of the bottom lines of each of the nearest obstacles is truly located per each of the columns on at least one GT, for each of the columns, and (ii) each location of the specific rows where each of the bottom lines of each of the nearest obstacles is estimated as being located per each of the columns on the obstacle segmentation result; and (e) the learning device backpropagating the regression losses, to thereby learn the parameters of the CNN. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A testing method by using a CNN capable of detecting one or more nearest obstacles based on one or more regression losses, comprising steps of:
-
(a) a testing device acquiring at least one test image as at least one input image, on condition that the learning device performed processes of (i) instructing a first convolutional layer an n-th convolutional layer to respectively and sequentially generate a first encoded feature map for training to an n-th encoded feature map for training from at least one training image, (ii) instructing an n-th deconvolutional layer to a first deconvolutional laver to sequentially generate an n-th decoded feature map for training to a first decoded feature map for training from the n-th encoded feature map for training, (iii) assuming that each cell of a grid with a plurality of rows and a plurality of columns is generated by dividing at least one specific decoded feature map for training, among the n-th decoded feature map for training to the first decoded feature map for training, with respect to a first direction and a second direction, wherein the first direction is in a direction of the rows of the specific decoded feature map for training and the second direction is in a direction of the columns thereof, generating at least one obstacle segmentation result for training representing each of specific rows for training, where each of bottom lines of each of nearest obstacles for training is estimated as being located per each of the columns, by referring to at least one feature of at least part of the n-th decoded feature map for training to the first decoded feature map for training, (iv) generating the regression losses by referring to each of respective differences of distances between (iv-1) each location of exact rows where each of the bottom lines of each of the nearest obstacles for training is truly located per each of the columns on at least one GT, for each of the columns, and (iv-2) each location of the specific rows for training where each of the bottom lines of each of the nearest obstacles for training is estimated as being located per each of the columns on the obstacle segmentation result for training, and (v) backpropagating the regression losses, to thereby learn one or more parameters of the CNN; (b) the testing device instructing the first convolutional layer to the n-th convolutional layer to respectively and sequentially generate a first encoded feature map for testing to an n-th encoded feature map for testing from the test image; (c) the testing device instructing the n-th deconvolutional layer to the first deconvolutional layer to sequentially generate an n-th decoded feature map for testing to a first decoded feature map for testing from the n-th encoded feature map for testing; and (d) the testing device, assuming that each cell of a grid with a plurality of rows and a plurality of columns is generated by dividing at least one specific decoded feature map for testing, among the n-th decoded feature map for testing to the first decoded feature map for testing, with respect to the first direction and the second direction, wherein the first direction is in a direction of the rows of the specific decoded feature map for testing and the second direction is in a direction of the columns thereof, generating at least one obstacle segmentation result for testing representing each of specific rows for testing, where each of bottom lines of each of nearest obstacles for testing is estimated as being located per each of the columns, by referring to at least one feature of at least part of the n-th decoded feature map for testing to the first decoded feature map for testing. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A learning device for learning one or more parameters of a CNN based on one or more regression losses, comprising a processor configured to perform the processes of:
acquiring at least one input image as a training image; and
a processor for performing processes of (I) instructing a first convolutional layer to an n-th convolutional layer to respectively and sequentially generate a first encoded feature map to an n-th encoded feature map from the input, (II) instructing an n-th deconvolutional layer to a first deconvolutional layer to sequentially generate an n-th decoded feature map to a first decoded feature map from the n-th encoded feature map, (III) on condition that each cell of a grid with a plurality of rows and a plurality of columns is generated by dividing at least one specific decoded feature map, among the n-th decoded feature map to the first decoded feature map, with respect to a first direction and a second direction, wherein the first direction is in a direction of the rows of the specific decoded feature map and the second direction is in a direction of the columns thereof, generating at least one obstacle segmentation result representing each of specific rows, where each of bottom lines of each of nearest obstacles is estimated as being located per each of the columns, by referring to at least one feature of at least part of the n-th decoded feature map to the first decoded feature map, (IV) generating the regression losses by referring to each of respective differences of distances between (i) each location of exact rows where each of the bottom lines of each of the nearest obstacles is truly located per each of the columns on at least one ground truth (GT), for each of the columns, and (ii) each location of the specific rows where each of the bottom lines of each of the nearest obstacles is estimated as being located per each of the columns on the obstacle segmentation result, and (V) backpropagating the regression losses, to thereby learn the parameters of the CNN.- View Dependent Claims (16, 17, 18, 19, 20, 21)
-
22. A testing device by using a CNN capable of detecting one or more nearest obstacles based on one or more regression losses, comprising a learning device with a processor configured to perform the processes of:
acquiring at least one test image as at least one input image, on condition that the learning device has performed processes of (i) instructing a first convolutional layer to an n-th convolutional layer to respectively and sequentially generate a first encoded feature map for training to an n-th encoded feature map for training from at least one training image, (ii) instructing an n-th deconvolutional layer to a first deconvolutional layer to sequentially generate an n-th decoded feature map for training to a first decoded feature map for training from the n-th encoded feature map for training, (iii) assuming that each cell of a grid with a plurality of rows and a plurality of columns is generated by dividing at least one specific decoded 5 feature map for training, among the n-th decoded feature map for training to the first decoded feature map for training, with respect to a first direction and a second direction, wherein the first direction is in a direction of the rows of the specific decoded feature map for training and the second direction is in a direction of the columns thereof, generating at least one obstacle segmentation result for training representing each of specific rows for training, where each of bottom lines of each of nearest obstacles for training is estimated as being located per each of the columns, by referring to at least one feature of at least part of the n-th decoded feature map for training to the first decoded feature map for training, (iv) generating the regression losses by referring to each of respective differences of distances between (iv-1) each location of exact rows where each of the bottom lines of each of the nearest obstacles for training is truly located per each of the columns on at least one ground truth (GT), for each of the columns, and (iv-2) each location of the;
specific rows for training where each of the bottom lines of each of the nearest obstacles for training is estimated as being located per each of the columns on the obstacle segmentation result for training, and(v) backpropagating the regression losses, to thereby learn one or more parameters of the CNN; and (I) instructing the first convolutional layer to the n-th convolutional layer to respectively and sequentially generate a first encoded feature map for testing to an n-th encoded feature map for testing from the test image, (II) instructing the n-th deconvolutional layer to the first deconvolutional layer to sequentially generate an n-th decoded feature map for testing to a first decoded feature map for testing from the n-th encoded feature map for testing, and (III) assuming that each cell of a grid with a plurality of rows and a plurality of columns is generated by dividing at least one specific decoded feature map for testing, among the n-th decoded feature map for testing to the first decoded feature map for testing, with respect to the first direction and the second direction, wherein the first direction is in a direction of the rows of the specific decoded feature map for testing and the second direction is in a direction of the columns thereof, generating at least one obstacle segmentation result for testing representing each of specific rows for testing, where each of bottom lines of each of nearest Obstacles for testing is estimated as being located per each of the columns, by referring to at least one feature of at least part of the n-th decoded feature map for testing to the first decoded feature map for testing. - View Dependent Claims (23, 24, 25, 26, 27, 28)
Specification