Structure learning in convolutional neural networks
First Claim
Patent Images
1. A method implemented with a processor, comprising:
- creating a neural network;
generating output from the neural network;
identifying a low performing layer from the neural network, the low performing layer having a relatively lower performance than a performance of another layer in the neural network;
inserting a new specialist layer at the low performing layer; and
repeating the act of identifying and the act of inserting until a top of the neural network is reached.
3 Assignments
0 Petitions
Accused Products
Abstract
The present disclosure provides an improved approach to implement structure learning of neural networks by exploiting correlations in the data/problem the networks aim to solve. A greedy approach is described that finds bottlenecks of information gain from the bottom convolutional layers all the way to the fully connected layers. Rather than simply making the architecture deeper, additional computation and capacitance is only added where it is required.
-
Citations
33 Claims
-
1. A method implemented with a processor, comprising:
-
creating a neural network; generating output from the neural network; identifying a low performing layer from the neural network, the low performing layer having a relatively lower performance than a performance of another layer in the neural network; inserting a new specialist layer at the low performing layer; and repeating the act of identifying and the act of inserting until a top of the neural network is reached. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system, comprising:
-
a processor; a memory for holding programmable code; and wherein the programmable code includes instructions for creating a neural network;
generating output from the neural network;
identifying a low performing layer from the neural network, the low performing layer having a relatively lower performance than a performance of another layer in the neural network;
inserting a new specialist layer at the low performing layer; and
repeating the act of identifying and the act of inserting until a top of the neural network is reached. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A computer program product embodied on a non-transitory computer readable medium, the non-transitory computer readable medium having stored thereon a sequence of instructions which, when executed by a processor causes the processor to execute a method comprising:
-
creating a neural network; generating output from the neural network; identifying a low performing layer from the neural network, the low performing layer having a relatively lower performance than a performance of another layer in the neural network; inserting a new specialist layer at the low performing layer; and repeating the act of identifying and the act of inserting until a top of the neural network is reached. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 33)
-
Specification