Learning method and learning device for pooling ROI by using masking parameters to be used for mobile devices or compact networks via hardware optimization, and testing method and testing device using the same
First Claim
1. A method for pooling at least one ROI by using one or more masking parameters, comprising steps of:
- (a) a computing device, if an input image is acquired, instructing a convolutional layer of a CNN to generate a feature map corresponding to the input image;
(b) the computing device instructing an RPN of the CNN to determine the ROI corresponding to at least one object included in the input image by using the feature map; and
(c) the computing device instructing an ROI pooling layer of the CNN to apply each of pooling operations correspondingly to each of sub-regions in the ROI by referring to each of the masking parameters corresponding to each of the pooling operations, to thereby generate a masked pooled feature map.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for pooling at least one ROI by using one or more masking parameters is provided. The method is applicable to mobile devices, compact networks, and the like via hardware optimization. The method includes steps of: (a) a computing device, if an input image is acquired, instructing a convolutional layer of a CNN to generate a feature map corresponding to the input image; (b) the computing device instructing an RPN of the CNN to determine the ROI corresponding to at least one object included in the input image by using the feature map; (c) the computing device instructing an ROI pooling layer of the CNN to apply each of pooling operations correspondingly to each of sub-regions in the ROI by referring to each of the masking parameters corresponding to each of the pooling operations, to thereby generate a masked pooled feature map.
-
Citations
24 Claims
-
1. A method for pooling at least one ROI by using one or more masking parameters, comprising steps of:
-
(a) a computing device, if an input image is acquired, instructing a convolutional layer of a CNN to generate a feature map corresponding to the input image; (b) the computing device instructing an RPN of the CNN to determine the ROI corresponding to at least one object included in the input image by using the feature map; and (c) the computing device instructing an ROI pooling layer of the CNN to apply each of pooling operations correspondingly to each of sub-regions in the ROI by referring to each of the masking parameters corresponding to each of the pooling operations, to thereby generate a masked pooled feature map. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A testing method for pooling at least one ROI by using one or more masking parameters, comprising steps of:
-
(a) on condition that (1) a computing device has instructed a convolutional layer of a CNN to generate a feature map for training corresponding to a training image, (2) the computing device has instructed an RPN of the CNN to determine the ROI for training corresponding to at least one object for training included in the training image by using the feature map for training, (3) the computing device has instructed an ROI pooling layer of the CNN to apply each of pooling operations correspondingly to each of sub-regions for training in the ROI for training by referring to each of the masking parameters corresponding to each of the pooling operations, to thereby generate a masked pooled feature map for training, (4) the computing device has instructed an output layer of the CNN to generate CNN output values for training by applying neural network operations to the masked pooled feature map for training, and (5) the computing device has instructed a loss layer of the CNN to generate loss values by referring to the CNN output values for training and their corresponding GTs, and learning the masking parameters of the ROI pooling layer by backpropagating the loss values;
a testing device, if a test image is acquired, instructing the convolutional layer of the CNN to generate a feature map for testing corresponding to the test image;(b) the testing device instructing the RPN of the CNN to determine the ROI for testing corresponding to at least one object for testing included in the test image by using the feature map for testing; (c) the testing device instructing the ROI pooling layer of the CNN to apply each of the pooling operations correspondingly to each of sub-regions for testing in the ROI for testing by referring to each of the masking parameters corresponding to each of the pooling operations, to thereby generate a masked pooled feature map for testing; and (d) the testing device instructing the output layer of the CNN to generate CNN output values for testing by applying the neural network operations to the masked pooled feature map for testing. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A computing device for pooling at least one ROI by using one or more masking parameters, comprising:
-
at least one memory that stores instructions; and at least one processor configured to execute the instructions to;
perform processes of (I) instructing a convolutional layer of a CNN to generate a feature map corresponding to an input image;
(II) instructing an RPN of the CNN to determine the ROI corresponding to at least one object included in the input image by using the feature map; and
(III) instructing an ROI pooling layer of the CNN to apply each of pooling operations correspondingly to each of sub-regions in the ROI by referring to each of the masking parameters corresponding to each of the pooling operations, to thereby generate a masked pooled feature map. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
-
20. A testing device for pooling at least one ROI by using one or more masking parameters, comprising:
-
at least one memory that stores instructions; and at least one processor, on condition that (1) a computing device has instructed a convolutional layer of a CNN to generate a feature map for training corresponding to a training image, (2) the computing device has instructed an RPN of the CNN to determine the ROI for training corresponding to at least one object for training included in the training image by using the feature map for training, (3) the computing device has instructed an ROI pooling layer of the CNN to apply each of pooling operations correspondingly to each of sub-regions for training in the ROI for training by referring to each of the masking parameters corresponding to each of the pooling operations, to thereby generate a masked pooled feature map for training, (4) the computing device has instructed an output layer of the CNN to generate CNN output values for training by applying neural network operations to the masked pooled feature map for training, and (5) the computing device has instructed a loss layer of the CNN to generate loss values by referring to the CNN output values for training and their corresponding GTs, and learning the masking parameters of the ROI pooling layer by backpropagating;
the loss values;
configured to execute the instructions to;
perform processes of (I) instructing the convolutional layer of the CNN to generate a feature map for testing corresponding to a test image;
(II) instructing the RPN of the CNN to determine the ROI for testing corresponding to at least one object for testing included in the test image by using the feature map for testing;
(III) instructing the ROI pooling layer of the CNN to apply each of the pooling operations correspondingly to each of sub-regions for testing in the ROI for testing by referring to each of the masking parameters corresponding to each of the pooling operations, to thereby generate a masked pooled feature map for testing; and
(IV) instructing the output layer of the CNN to generate CNN output values for testing by applying the neural network operations to the masked pooled feature map for testing. - View Dependent Claims (21, 22, 23, 24)
-
Specification