DEEP LEARNING MODEL USED FOR IMAGE RECOGNITION AND TRAINING APPARATUS OF THE MODEL AND METHOD THEREOF
First Claim
1. A deep learning model used for image recognition, the model comprising:
- a plurality of convolutional layers configured to extract features from an input image in turn and output a plurality of feature maps of identical sizes;
a determination layer configured to, according to positions where objects of attention in the input image are located, determine whether features related to positions contained in the feature maps are features of the positions where the objects of attention are located;
a compositing layer configured to, according to an output result of the determination layer, perform weight and composition processing on the features in the plurality of feature maps outputted by the plurality of convolutional layers, weights of the features of the positions where the objects of attention are located being different from weights of other features; and
a fully-connected layer configured to output a recognition result according to the plurality of feature maps after being weight and composition processed by the compositing layer.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments of this disclosure provide a deep learning model used for image recognition and apparatus and method thereof. The model includes a determination layer configured to determine whether features in feature maps are features of positions where objects of attention are located, and different weights are granted for the positions where the objects of attention are located and other features in performing weight and composition processing on the features. Hence, the model may be guided to be focused on attention features and make correct determination, thereby improving performance and precision of the model.
2 Citations
10 Claims
-
1. A deep learning model used for image recognition, the model comprising:
-
a plurality of convolutional layers configured to extract features from an input image in turn and output a plurality of feature maps of identical sizes; a determination layer configured to, according to positions where objects of attention in the input image are located, determine whether features related to positions contained in the feature maps are features of the positions where the objects of attention are located; a compositing layer configured to, according to an output result of the determination layer, perform weight and composition processing on the features in the plurality of feature maps outputted by the plurality of convolutional layers, weights of the features of the positions where the objects of attention are located being different from weights of other features; and a fully-connected layer configured to output a recognition result according to the plurality of feature maps after being weight and composition processed by the compositing layer. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
Specification