MULTI-LAYER FUSION IN A CONVOLUTIONAL NEURAL NETWORK FOR IMAGE CLASSIFICATION
First Claim
1. A method of constructing a convolutional neural network (CNN) for domain adaptation utilizing features extracted from multiple levels, including:
- selecting a CNN architecture including a plurality of convolutional layers and fully connected layers;
training the CNN on a source domain data set;
selecting a plurality of layers from the plurality of convolutional layers across the trained CNN;
extracting features from the selected layers from the trained CNN;
concatenating the extracted features to form a feature vector;
connecting the feature vector to a fully connected neural network classifier; and
, fine-tuning the fully connected neural network classifier from a target domain data set.
6 Assignments
0 Petitions
Accused Products
Abstract
A method and system for domain adaptation based on multi-layer fusion in a convolutional neural network architecture for feature extraction and a two-step training and fine-tuning scheme. The architecture concatenates features extracted at different depths of the network to form a fully connected layer before the classification step. First, the network is trained with a large set of images from a source domain as a feature extractor. Second, for each new domain (including the source domain), the classification step is fine-tuned with images collected from the corresponding site. The features from different depths are concatenated with and fine-tuned with weights adjusted for a specific task. The architecture is used for classifying high occupancy vehicle images.
54 Citations
24 Claims
-
1. A method of constructing a convolutional neural network (CNN) for domain adaptation utilizing features extracted from multiple levels, including:
-
selecting a CNN architecture including a plurality of convolutional layers and fully connected layers; training the CNN on a source domain data set; selecting a plurality of layers from the plurality of convolutional layers across the trained CNN; extracting features from the selected layers from the trained CNN; concatenating the extracted features to form a feature vector; connecting the feature vector to a fully connected neural network classifier; and
, fine-tuning the fully connected neural network classifier from a target domain data set. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 11, 12, 13, 15, 16, 17, 18, 19, 20, 22, 23)
-
-
10. An image classification system comprising:
a computer programmed to perform classification of an input image from a target domain by operations including; processing the input image using a convolutional neural network (CNN) having a plurality of network layers and trained on a source domain training set; processing outputs of at least a fraction of the plurality of network layers of the CNN using a features fusion network trained on a target domain training set to generate a classification of the input image. - View Dependent Claims (14)
-
21. A method of adapting a convolutional neural network (CNN) trained to classify images of a source domain to a target domain, the adaptation method comprising:
-
inputting features output by at least a fraction of the network levels of the CNN into a features fusion network outputting a weighted combination of the inputted features; and training weights of the weighted combination of inputted features using images in a target domain different from the source domain.
-
-
24. A method of operating a convolutional neural network (CNN) for image classification across multiple domains, comprising:
-
capturing a data representation of an image via an imaging device; processing the data representation in the CNN to extract features therein relevant to the image classification including associating CNN levels with the extracted features, and fusing selected ones of the levels in a concatenated features network, wherein the selected ones of the levels have a selective weighting for effecting the image classification; and
,determining an output image classification for the image from the concatenated features network wherein the fusion of the selected ones of the levels varies among the multiple domains.
-
Specification