Activation layers for deep learning networks
First Claim
Patent Images
1. A computer-implemented method, comprising:
- training a convolutional neural network using a set of a training data, the training data including instances of data with determined classifications;
receiving a query from a client device associated with a user;
processing the query using the trained convolutional neural network to determine a classification of a data object represented in the query, the trained convolutional neural network containing at least one convolutional layer and at least one activation layer, the at least one activation layer including a generalized linear unit, the generalized linear unit having a functional form described using a pair of straight lines with three parameters including a first slope in a positive region, a second slope in a negative region, and an offset applied to the first slope and the second slope, the three parameters learnable over at least one input channel;
determining a set of features corresponding to the classification; and
providing, to the client device, information for at least a subset of the set of features.
0 Assignments
0 Petitions
Accused Products
Abstract
Tasks such as object classification from image data can take advantage of a deep learning process using convolutional neural networks. These networks can include a convolutional layer followed by an activation layer, or activation unit, among other potential layers. Improved accuracy can be obtained by using a generalized linear unit (GLU) as an activation unit in such a network, where a GLU is linear for both positive and negative inputs, and is defined by a positive slope, a negative slope, and a bias. These parameters can be learned for each channel or a block of channels, and stacking those types of activation units can further improve accuracy.
5 Citations
17 Claims
-
1. A computer-implemented method, comprising:
-
training a convolutional neural network using a set of a training data, the training data including instances of data with determined classifications; receiving a query from a client device associated with a user; processing the query using the trained convolutional neural network to determine a classification of a data object represented in the query, the trained convolutional neural network containing at least one convolutional layer and at least one activation layer, the at least one activation layer including a generalized linear unit, the generalized linear unit having a functional form described using a pair of straight lines with three parameters including a first slope in a positive region, a second slope in a negative region, and an offset applied to the first slope and the second slope, the three parameters learnable over at least one input channel; determining a set of features corresponding to the classification; and providing, to the client device, information for at least a subset of the set of features. - View Dependent Claims (2, 3)
-
-
4. A computer-implemented method, comprising:
-
training a convolutional neural network using a set of a training data, the training data including instances of data objects with determined classifications; receiving a query; processing the query using the trained convolutional neural network to determine a classification of a data segment represented in the query, the trained convolutional neural network containing at least one convolutional layer and at least one activation layer, the at least one activation layer including a generalized linear unit (GLU), the GLU having three parameters including a first slope in a positive region, a second slope in a negative region, and an offset applied to the first slope and the second slope, the three parameters learnable over at least one input channel; determining a set of data segments corresponding to the classification; and providing, to a client device, information for at least a subset of the set of data segments. - View Dependent Claims (5, 6, 7, 8, 9, 10)
-
-
11. A system, comprising:
-
at least one processor; and memory including instructions that, when executed by the at least one processor, cause the system to; train a convolutional neural network using a set of a training data, the training data including instances of data with determined classifications; receive query data; process the query data using the trained convolutional neural network to determine a classification of at least a portion of the query data, the trained convolutional neural network containing at least one convolutional layer and at least one activation layer, the at least one activation layer including a generalized linear unit (GLU), the GLU having three parameters including a first slope in a positive region, a second slope in a negative region, and an offset applied to the first slope and the second slope, the three parameters learnable over at least one input channel; determine a set of data objects corresponding to the classification; and provide, in response to a request, information for at least a subset of the set of data objects. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
Specification