Image assessment using deep convolutional neural networks

US 9,536,293 B2
Filed: 07/30/2014
Issued: 01/03/2017
Est. Priority Date: 07/30/2014
Status: Active Grant

First Claim

Patent Images

1. A non-transitory computer storage medium comprising computer-useable instructions that, when used by one or more computing devices, cause the one or more computing devices to perform operations comprising:

implementing a deep convolutional neural network that is trained to learn and classify image features for a set of images;

receiving an image from the set of images;

extracting a global image representation of the image as one or more global inputs to a first column of the deep convolutional neural network;

extracting a local image representation of the image as one or more fine-grained inputs to a second column of the deep convolutional neural network, each convolutional layer of the first column being independent from each convolutional layer of the second column, the first column and the second column in convolutional layers being in different spatial scales;

merging at least one layer of the first column with at least one layer of the second column into a fully connected layer;

using the fully connected layer to calculate a probability of each input being assigned to a class for a particular feature;

averaging results associated with each input associated with the image;

classifying at least one feature for the image using the class with the highest probability; and

providing the classified at least one image feature for use in an image processing task.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Deep convolutional neural networks receive local and global representations of images as inputs and learn the best representation for a particular feature through multiple convolutional and fully connected layers. A double-column neural network structure receives each of the local and global representations as two heterogeneous parallel inputs to the two columns. After some layers of transformations, the two columns are merged to form the final classifier. Additionally, features may be learned in one of the fully connected layers. The features of the images may be leveraged to boost classification accuracy of other features by learning a regularized double-column neural network.

61 Citations

View as Search Results

20 Claims

1. A non-transitory computer storage medium comprising computer-useable instructions that, when used by one or more computing devices, cause the one or more computing devices to perform operations comprising:
- implementing a deep convolutional neural network that is trained to learn and classify image features for a set of images;
  
  receiving an image from the set of images;
  
  extracting a global image representation of the image as one or more global inputs to a first column of the deep convolutional neural network;
  
  extracting a local image representation of the image as one or more fine-grained inputs to a second column of the deep convolutional neural network, each convolutional layer of the first column being independent from each convolutional layer of the second column, the first column and the second column in convolutional layers being in different spatial scales;
  
  merging at least one layer of the first column with at least one layer of the second column into a fully connected layer;
  
  using the fully connected layer to calculate a probability of each input being assigned to a class for a particular feature;
  
  averaging results associated with each input associated with the image;
  
  classifying at least one feature for the image using the class with the highest probability; and
  
  providing the classified at least one image feature for use in an image processing task.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
- - 2. The non-transitory computer storage medium of claim 1, further comprising resizing the image to create the global image representation.
  - 3. The non-transitory computer storage medium of claim 1, further comprising resizing the image by warping the image into a normalized input with a fixed size.
  - 4. The non-transitory computer storage medium of claim 1, further comprising resizing the image by normalizing its shorter side to a normalized input with a fixed length S and center-cropping the normalized input to generate a s×
    - s×
      
      3 input.
  - 5. The non-transitory computer storage medium of claim 1, further comprising resizing the image by normalizing a longer side of the image to a fixed length s and generating a normalized input of a fixed size s×
    - s×
      
      3 by padding border pixels with zero.
  - 6. The non-transitory computer storage medium of claim 1, further comprising randomly cropping the image into a normalized input with a fixed size to create the local image representation, the local image representation preserving details of the image in the original high-resolution format.
  - 7. The non-transitory computer storage medium of claim 1, wherein an architecture associated with each column in the deep convolutional neural network is the same for each column.
  - 8. The non-transitory computer storage medium of claim 1, wherein an architecture associated with each column in the deep convolutional neural network is different for each column.
  - 9. The non-transitory computer storage medium of claim 1, further comprising adding one or more additional columns with additional normalized inputs to form a multi-column convolutional neural network.
  - 10. The non-transitory computer storage medium of claim 1, wherein an architecture associated with each column in the deep convolutional neural network comprises at least four convolutional layers and at least two fully-connected layers.
  - 11. The non-transitory computer storage medium of claim 10, further comprising extracting one or more features from the image at one of the fully-connected layers.
  - 12. The non-transitory computer storage medium of claim 10, further comprising replacing a last layer of the deep convolutional neural network with a regression.
  - 13. The non-transitory computer storage medium of claim 1, wherein the particular image feature is one of aesthetics, style, or scene.

14. A computer-implemented method comprising:
- implementing a double-column deep convolutional neural network (DCNN) that is trained to learn and classify features for a set of images;
  
  extracting a global image representation of an image as a global input to a first column of the DCNN;
  
  extracting a local image representation of the image as a fine-grained input to a second column of the DCNN, each convolutional layer of the first column being independent from each convolutional layer of the second column, the first column and the second column in convolutional layers being in different spatial scales;
  
  merging at least one layer of the first column with at least one layer of the second column into a fully connected layer;
  
  jointly training weights associated with the fully connected layer;
  
  classifying at least one feature for the image using the fully connected layer; and
  
  providing the classified at least one image feature for use in an image processing task.
- View Dependent Claims (15, 16, 17, 18)
- - 15. The method of claim 14, further comprising automatically discovering global and local features of an image from the fully connected layer and a layer immediately preceding the fully connected layer.
  - 16. The method of claim 14, further comprising back propagating error in each column with stochastic gradient descent.
  - 17. The method of claim 14, further comprising adding one or more additional columns with additional normalized inputs to form a multi-column convolutional neural network.
  - 18. The method of claim 14, wherein the image processing task comprises one of searching for an image or editing an image.

19. A computerized system comprising:
- one or more processors; and
  
  one or more computer storage media storing computer-useable instructions that, when used by the one or more processors, cause the one or more processors to;
  
  implement a double-column deep convolutional neural network (DCNN) to train the DCNN to learn and classify features for a set of images;
  
  extract a global image representation of an image as a global input to a first column of the DCNN;
  
  extract a local image representation of the image as a fine-grained input to a second column of the DCNN, each convolutional layer of the first column being independent from each convolutional layer of the second column, the first column and the second column in convolutional layers being in different spatial scales;
  
  merge at least one layer of the first column with at least one layer of the second column into a fully connected layer;
  
  learn or classify at least one feature for the image using the fully connected layer; and
  
  provide the classified at least one image feature for use in an image processing task.
- View Dependent Claims (20)
- - 20. The computerized system of claim 19, wherein the image processing task comprises one of searching for an image or editing an image.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Adobe Inc.
Original Assignee
Adobe Systems Incorporated (Adobe Inc.)
Inventors
Lin, Zhe, Jin, Hailin, Yang, Jianchao
Primary Examiner(s)
Kholdebarin, Iman K

Application Number

US14/447,290
Publication Number

US 20160035078A1
Time in Patent Office

888 Days
Field of Search
US Class Current

1/1
CPC Class Codes

G06F 18/2415   based on parametric or prob...

G06F 18/28   Determining representative ...

G06N 3/045   Combinations of networks

G06N 3/084   Backpropagation, e.g. using...

G06T 7/0002   Inspection of images, e.g. ...

G06V 10/454   Integrating the filters int...

G06V 10/772   Determining representative ...

Image assessment using deep convolutional neural networks

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

61 Citations

20 Claims

Specification

Use Cases

Quick Links

Others

Image assessment using deep convolutional neural networks

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

61 Citations

20 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others