Learning image categorization using related attributes
First Claim
1. A non-transitory computer storage medium comprising computer-useable instructions that, when used by one or more computing devices, cause the one or more computing devices to perform operations comprising:
- implementing a regularized double-column convolutional neural network (RDCNN) to classify image features for a set of images, the implementing comprising;
receiving an image from the set of images;
training a first feature column in a first neural network of the RDCNN using a first set of image representations as inputs to the first feature column, the trained first feature column having fixed parameters;
generating a first set of image attributes by the trained first feature column using the fixed parameters; and
training a second feature multi-column in a second neural network of the RDCNN using the generated first set of image attributes, wherein the parameters of the trained first feature column remain fixed, wherein the second feature multi-column comprises at least two columns that are independent in each convolutional layer of the second feature multi-column, andwherein each of the at least two columns receives a different input; and
identifying a class associated with the second feature multi-column for the image using the implemented RDCNN.
2 Assignments
0 Petitions
Accused Products
Abstract
A first set of attributes (e.g., style) is generated through pre-trained single column neural networks and leveraged to regularize the training process of a regularized double-column convolutional neural network (RDCNN). Parameters of the first column (e.g., style) of the RDCNN are fixed during RDCNN training. Parameters of the second column (e.g., aesthetics) are fine-tuned while training the RDCNN and the learning process is supervised by the label identified by the second column (e.g., aesthetics). Thus, features of the images may be leveraged to boost classification accuracy of other features by learning a RDCNN.
-
Citations
20 Claims
-
1. A non-transitory computer storage medium comprising computer-useable instructions that, when used by one or more computing devices, cause the one or more computing devices to perform operations comprising:
-
implementing a regularized double-column convolutional neural network (RDCNN) to classify image features for a set of images, the implementing comprising; receiving an image from the set of images; training a first feature column in a first neural network of the RDCNN using a first set of image representations as inputs to the first feature column, the trained first feature column having fixed parameters; generating a first set of image attributes by the trained first feature column using the fixed parameters; and training a second feature multi-column in a second neural network of the RDCNN using the generated first set of image attributes, wherein the parameters of the trained first feature column remain fixed, wherein the second feature multi-column comprises at least two columns that are independent in each convolutional layer of the second feature multi-column, and wherein each of the at least two columns receives a different input; and identifying a class associated with the second feature multi-column for the image using the implemented RDCNN. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A computer-implemented method comprising:
-
implementing a regularized double-column convolutional neural network (RDCNN) to classify image features for a set of images, the implementing comprising; receiving an image from the set of images; extracting a local image representation of the image; training a first feature column in a first neural network of the RDCNN utilizing the extracted local image representation as an input to the first feature column of the RDCNN, the first feature column associated with style and having fixed parameters; extracting a global image representation of the image; and training a second feature multi-column in a second neural network of the RDCNN utilizing the fixed parameters of the trained first feature column and utilizing the extracted global image representation as one or more global inputs to the second feature multi-column of the RDCNN, and utilizing the local image representation of the image as one or more fine-grained inputs to the second feature multi-column of the RDCNN, the second feature multi-column associated with aesthetics, wherein the second feature multi-column comprises at least two columns that are independent in each convolutional layer of the second feature multi-column, and each of the at least two columns receives a different input; and identifying a class associated with the second feature multi-column for the image utilizing the implemented RDCNN. - View Dependent Claims (16, 17, 18, 19)
-
-
20. A computerized system comprising:
-
one or more processors; and one or more computer storage media storing computer-useable instructions that, when used by the one or more processors, cause the one or more processors to; implement a regularized double-column convolutional neural network (RDCNN) to classify image features for a set of images, the implementing to; receive an image from the set of images; train a first feature column in a first neural network of the RDCNN using a first set of image representations as inputs to the first feature column, the trained first feature column having fixed parameters; and train a second feature multi-column in a second neural network of the RDCNN using the fixed parameters of the trained first feature column, and using a second set of image representations as inputs to the second feature multi-column, wherein the second feature multi-column comprises at least two columns that are independent in each convolutional layer of the second feature multi-column, and each of the at least two columns receives a different input; and identify a class associated with the second feature multi-column for the image using the implemented RDCNN.
-
Specification