Deep convolutional neural network prediction of image professionalism
First Claim
Patent Images
1. A computerized method of training and utilizing a deep convolutional neural network (DCNN) to gauge professionalism of a subject in a digital image, the method comprising:
- training the DCNN by;
inputting a plurality of sample images to the DCNN, each of the sample images having been labeled with a professionalism score, the inputting including, for each sample image;
passing the image to a convolutional layer of the DCNN, the convolutional layer comprising one or more filters having dynamically adjustable weights, the one or more filters configured to filter the image to produce an output volume for the corresponding image, the output volume comprising a different feature map for each of the one or more filters;
passing the output volume from the convolutional layer through a nonlinearity layer, the nonlinearity layer applying a nonlinearity function to the output volume from the convolutional layer;
passing the output volume from the nonlinearity layer through a pooling layer, the pooling layer lowering spatial dimensions of the output volume from the nonlinearity layer;
passing the output volume from the pooling layer through a classification layer, the classification layer comprising a specialized convolutional layer having a filter designed to output a professionalism score for the image based on the output volume from the pooling layer; and
passing the image through a loss layer, the loss layer applying a loss function to the image, resulting an in indication of a level of error in the professionalism score for the image from the classification layer in comparison to the professionalism score from the label of the image;
determining whether a combination of the levels of error for the plurality of sample images transgresses a preset threshold; and
in response to a determination that the combination of the levels of error transgresses a preset threshold, updating weights of the one or more filters in the convolutional layers of the DCNN to reduce the combination of the levels of error and repeating the training of the DCNN using a different plurality of sample images and the updated weights;
generating a professionalism score for the digital image by;
inputting the digital image to the DCNN, including;
passing the image to the convolutional layer, generating output;
passing the output from the convolutional layer to the nonlinearity layer, generating output;
passing the output from the nonlinearity layer to the pooling layer, generating output; and
passing output from the nonlinearity layer to the classification layer, generating a professionalism score for the digital image.
2 Assignments
0 Petitions
Accused Products
Abstract
In an example embodiment, a deep convolutional neural network (DCNN) is created to assign a professionalism score to an input image. The professionalism score indicates a perceived professionalism of a subject of the input image. The DCNN is designed to automatically learn features of images relevant to the professionalism through a training process.
30 Citations
20 Claims
-
1. A computerized method of training and utilizing a deep convolutional neural network (DCNN) to gauge professionalism of a subject in a digital image, the method comprising:
training the DCNN by; inputting a plurality of sample images to the DCNN, each of the sample images having been labeled with a professionalism score, the inputting including, for each sample image; passing the image to a convolutional layer of the DCNN, the convolutional layer comprising one or more filters having dynamically adjustable weights, the one or more filters configured to filter the image to produce an output volume for the corresponding image, the output volume comprising a different feature map for each of the one or more filters; passing the output volume from the convolutional layer through a nonlinearity layer, the nonlinearity layer applying a nonlinearity function to the output volume from the convolutional layer; passing the output volume from the nonlinearity layer through a pooling layer, the pooling layer lowering spatial dimensions of the output volume from the nonlinearity layer; passing the output volume from the pooling layer through a classification layer, the classification layer comprising a specialized convolutional layer having a filter designed to output a professionalism score for the image based on the output volume from the pooling layer; and passing the image through a loss layer, the loss layer applying a loss function to the image, resulting an in indication of a level of error in the professionalism score for the image from the classification layer in comparison to the professionalism score from the label of the image; determining whether a combination of the levels of error for the plurality of sample images transgresses a preset threshold; and in response to a determination that the combination of the levels of error transgresses a preset threshold, updating weights of the one or more filters in the convolutional layers of the DCNN to reduce the combination of the levels of error and repeating the training of the DCNN using a different plurality of sample images and the updated weights; generating a professionalism score for the digital image by; inputting the digital image to the DCNN, including; passing the image to the convolutional layer, generating output; passing the output from the convolutional layer to the nonlinearity layer, generating output; passing the output from the nonlinearity layer to the pooling layer, generating output; and passing output from the nonlinearity layer to the classification layer, generating a professionalism score for the digital image. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
8. A system comprising:
-
a processor; a computer readable medium having instructions stored there on, which, when executed by the processor, cause the system to; train a DCNN by; inputting a plurality of sample images to the DCNN, each of the sample images having been labeled with a professionalism score, the inputting including, for each sample image; passing the image to a convolutional layer of the DCNN, the convolutional layer comprising one or more filters having dynamically adjustable weights, the one or more filters configured to filter the image to produce an output volume for the corresponding image, the output volume comprising a different feature map for each of the one or more filters; passing the output volume from the convolutional layer through a nonlinearity layer, the nonlinearity layer applying a nonlinearity function to the output volume from the convolutional layer; passing the output volume from the nonlinearity layer through a pooling layer, the pooling layer lowering spatial dimensions of the output volume from the nonlinearity layer; passing the output volume from the pooling layer through a classification layer, the classification layer comprising a specialized convolutional layer having a filter designed to output a professionalism score for the image based on the output volume from the pooling layer; and passing the image through a loss layer, the loss layer applying a loss function to the image, resulting an in indication of a level of error in the professionalism score for the image from the classification layer in comparison to the professionalism score from the label of the image; determining whether a combination of the levels of error for the plurality of sample images transgresses a preset threshold; and in response to a determination that the combination of the levels of error transgresses a preset threshold, updating weights of the one or more filters in the convolutional layers of the DCNN to reduce the combination of the levels of error and repeating the training of the DCNN using a different plurality of sample images and the updated weights; generate a professionalism score for the digital image by; inputting the digital image to the DCNN, including; passing the image to the convolutional layer, generating output; passing the output from the convolutional layer to the nonlinearity layer, generating output; passing the output from the nonlinearity layer to the pooling layer, generating output; and passing output from the nonlinearity layer to the classification layer, generating a professionalism score for the digital image. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory machine-readable storage medium comprising instructions, which when implemented by one or more machines, cause the one or more machines to perform operations comprising:
training a DCNN by; inputting a plurality of sample images to the DCNN, each of the sample images having been labeled with a professionalism score, the inputting including, for each sample image; passing the image to a convolutional layer of the DCNN, the convolutional layer comprising one or more filters having dynamically adjustable weights, the one or more filters configured to filter the image to produce an output volume for the corresponding image, the output volume comprising a different feature map for each of the one or more filters; passing the output volume from the convolutional layer through a nonlinearity layer, the nonlinearity layer applying a nonlinearity function to the output volume from the convolutional layer; passing the output volume from the nonlinearity layer through a pooling layer, the pooling layer lowering spatial dimensions of the output volume from the nonlinearity layer; passing the output volume from the pooling layer through a classification layer, the classification layer comprising a specialized convolutional layer having a filter designed to output a professionalism score for the image based on the output volume from the pooling layer; and passing the image through a loss layer, the loss layer applying a loss function to the image, resulting an in indication of a level of error in the professionalism score for the image from the classification layer in comparison to the professionalism score from the label of the image; determining whether a combination of the levels of error for the plurality of sample images transgresses a preset threshold; and in response to a determination that the combination of the levels of error transgresses a preset threshold, updating weights of the one or more filters in the convolutional layers of the DCNN to reduce the combination of the levels of error and repeating the training of the DCNN using a different plurality of sample images and the updated weights; generating a professionalism score for the digital image by; inputting the digital image to the DCNN, including;
passing the image to the convolutional layer, generating output;
passing the output from the convolutional layer to the nonlinearity layer, generating output;
passing the output from the nonlinearity layer to the pooling layer, generating output; and
passing output from the nonlinearity layer to the classification layer, generating a professionalism score for the digital image.- View Dependent Claims (16, 17, 18, 19, 20)
Specification