Image assessment using deep convolutional neural networks
First Claim
1. A non-transitory computer storage medium comprising computer-useable instructions that, when used by one or more computing devices, cause the one or more computing devices to perform operations comprising:
- implementing a deep convolutional neural network that is trained to learn and classify image features for a set of images;
receiving an image from the set of images;
extracting a global image representation of the image as one or more global inputs to a first column of the deep convolutional neural network;
extracting a local image representation of the image as one or more fine-grained inputs to a second column of the deep convolutional neural network, each convolutional layer of the first column being independent from each convolutional layer of the second column, the first column and the second column in convolutional layers being in different spatial scales;
merging at least one layer of the first column with at least one layer of the second column into a fully connected layer;
using the fully connected layer to calculate a probability of each input being assigned to a class for a particular feature;
averaging results associated with each input associated with the image;
classifying at least one feature for the image using the class with the highest probability; and
providing the classified at least one image feature for use in an image processing task.
2 Assignments
0 Petitions
Accused Products
Abstract
Deep convolutional neural networks receive local and global representations of images as inputs and learn the best representation for a particular feature through multiple convolutional and fully connected layers. A double-column neural network structure receives each of the local and global representations as two heterogeneous parallel inputs to the two columns. After some layers of transformations, the two columns are merged to form the final classifier. Additionally, features may be learned in one of the fully connected layers. The features of the images may be leveraged to boost classification accuracy of other features by learning a regularized double-column neural network.
61 Citations
20 Claims
-
1. A non-transitory computer storage medium comprising computer-useable instructions that, when used by one or more computing devices, cause the one or more computing devices to perform operations comprising:
-
implementing a deep convolutional neural network that is trained to learn and classify image features for a set of images; receiving an image from the set of images; extracting a global image representation of the image as one or more global inputs to a first column of the deep convolutional neural network; extracting a local image representation of the image as one or more fine-grained inputs to a second column of the deep convolutional neural network, each convolutional layer of the first column being independent from each convolutional layer of the second column, the first column and the second column in convolutional layers being in different spatial scales; merging at least one layer of the first column with at least one layer of the second column into a fully connected layer; using the fully connected layer to calculate a probability of each input being assigned to a class for a particular feature; averaging results associated with each input associated with the image; classifying at least one feature for the image using the class with the highest probability; and providing the classified at least one image feature for use in an image processing task. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computer-implemented method comprising:
-
implementing a double-column deep convolutional neural network (DCNN) that is trained to learn and classify features for a set of images; extracting a global image representation of an image as a global input to a first column of the DCNN; extracting a local image representation of the image as a fine-grained input to a second column of the DCNN, each convolutional layer of the first column being independent from each convolutional layer of the second column, the first column and the second column in convolutional layers being in different spatial scales; merging at least one layer of the first column with at least one layer of the second column into a fully connected layer; jointly training weights associated with the fully connected layer; classifying at least one feature for the image using the fully connected layer; and providing the classified at least one image feature for use in an image processing task. - View Dependent Claims (15, 16, 17, 18)
-
-
19. A computerized system comprising:
-
one or more processors; and one or more computer storage media storing computer-useable instructions that, when used by the one or more processors, cause the one or more processors to; implement a double-column deep convolutional neural network (DCNN) to train the DCNN to learn and classify features for a set of images; extract a global image representation of an image as a global input to a first column of the DCNN; extract a local image representation of the image as a fine-grained input to a second column of the DCNN, each convolutional layer of the first column being independent from each convolutional layer of the second column, the first column and the second column in convolutional layers being in different spatial scales; merge at least one layer of the first column with at least one layer of the second column into a fully connected layer; learn or classify at least one feature for the image using the fully connected layer; and provide the classified at least one image feature for use in an image processing task. - View Dependent Claims (20)
-
Specification