Activation layers for deep learning networks
First Claim
Patent Images
1. A computer-implemented method, comprising:
- obtaining a convolutional neural network containing at least one convolutional layer and at least one activation layer, the at least one activation layer including a generalized linear unit, the generalized linear unit having a functional form described using a pair of straight lines with three parameters including a first slope in a positive region, a second slope in a negative region, and an offset applied to the first slope and the second slope, the three parameters learnable over at least one input channel;
training the convolutional neural network using a first subset of a training data set, the training data set including instances of image data with defined labels;
measuring accuracy of the convolutional neural network after the training by classifying images in a test set;
receiving a query image from a client device associated with a user;
processing the query image using the convolutional neural network to determine a classification of an object represented in the query image;
determining a set of items corresponding to the classification; and
providing, to the client device, information for at least a subset of the set of items.
1 Assignment
0 Petitions
Accused Products
Abstract
Tasks such as object classification from image data can take advantage of a deep learning process using convolutional neural networks. These networks can include a convolutional layer followed by an activation layer, or activation unit, among other potential layers. Improved accuracy can be obtained by using a generalized linear unit (GLU) as an activation unit in such a network, where a GLU is linear for both positive and negative inputs, and is defined by a positive slope, a negative slope, and a bias. These parameters can be learned for each channel or a block of channels, and stacking those types of activation units can further improve accuracy.
-
Citations
20 Claims
-
1. A computer-implemented method, comprising:
-
obtaining a convolutional neural network containing at least one convolutional layer and at least one activation layer, the at least one activation layer including a generalized linear unit, the generalized linear unit having a functional form described using a pair of straight lines with three parameters including a first slope in a positive region, a second slope in a negative region, and an offset applied to the first slope and the second slope, the three parameters learnable over at least one input channel; training the convolutional neural network using a first subset of a training data set, the training data set including instances of image data with defined labels; measuring accuracy of the convolutional neural network after the training by classifying images in a test set; receiving a query image from a client device associated with a user; processing the query image using the convolutional neural network to determine a classification of an object represented in the query image; determining a set of items corresponding to the classification; and providing, to the client device, information for at least a subset of the set of items. - View Dependent Claims (2, 3, 4)
-
-
5. A computer-implemented method, comprising:
-
training a convolutional neural network using a set of a training data, the training data including instances of image data with determined classifications; receiving query image data; processing the query image data using the convolutional neural network to determine a classification of an object represented in the query image data, the convolutional neural network containing at least one convolutional layer and at least one activation layer, the at least one activation layer including a generalized linear unit (GLU), the GLU having three parameters including a first slope in a positive region, a second slope in a negative region, and an offset applied to the first slope and the second slope, the three parameters learnable over at least one input channel; determining a set of items corresponding to the classification; and providing, to a client device, information for at least a subset of the set of items. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12)
-
-
13. A system, comprising:
-
at least one processor; and memory including instructions that, when executed by the at least one processor, cause the system to; train a convolutional neural network using a set of a training data, the training data including instances of image data with determined classifications; receive query data; process the query data using the convolutional neural network to determine a classification of at least a portion of the query data, the convolutional neural network containing at least one convolutional layer and at least one activation layer, the at least one activation layer including a generalized linear unit (GLU), the GLU having three parameters including a first slope in a positive region, a second slope in a negative region, and an offset applied to the first slope and the second slope, the three parameters learnable over at least one input channel; determine a set of items corresponding to the classification; and provide, in response to a request, information for at least a subset of the set of items. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification