Method and apparatus for training a neural network to detect objects in an image
First Claim
Patent Images
1. An architecture of a plurality of neural networks for processing an input signal that is decomposed into a plurality of resolution levels, said architecture comprising:
- a first neural network for receiving input from a fine resolution level of said input signal;
a second neural network for receiving input from a coarse resolution level of said input signal; and
a third neural network having an input layer for receiving input from a resolution level that is coarser than said fine resolution level and finer than said coarse resolution level of said input signal and for receiving inputs from an output layer of said first neural network and from an output layer of said second neural network.
2 Assignments
0 Petitions
Accused Products
Abstract
A signal processing apparatus and concomitant method for learning and integrating features from multiple resolutions for detecting and/or classifying objects. The signal processing apparatus comprises a hierarchical pyramid of neural networks (HPNN) having a “fine-to-coarse” structure or a combination of the “fine-to-coarse” and the “coarse-to-fine” structures.
162 Citations
3 Claims
-
1. An architecture of a plurality of neural networks for processing an input signal that is decomposed into a plurality of resolution levels, said architecture comprising:
-
a first neural network for receiving input from a fine resolution level of said input signal;
a second neural network for receiving input from a coarse resolution level of said input signal; and
a third neural network having an input layer for receiving input from a resolution level that is coarser than said fine resolution level and finer than said coarse resolution level of said input signal and for receiving inputs from an output layer of said first neural network and from an output layer of said second neural network.
-
-
2. An architecture of a plurality of neural networks for processing an input signal that is decomposed into a plurality of resolution levels, said architecture comprising:
-
a first neural network for receiving input from a fine resolution level of said input signal;
a second neural network for receiving input from a first intermediate resolution level that is coarser than said fine resolution level and for receiving input from said first neural network;
a third neural network for receiving input from said first intermediate resolution level and for receiving input from a neural network that is receiving input from a resolution level that is coarser than said first intermediate resolution level;
a fourth neural network for receiving inputs from a coarse resolution level of said input signal;
a fifth neural network for receiving input from a second intermediate resolution level that is finer than said coarse resolution level and for receiving input from said fourth neural network; and
a sixth neural network for receiving input from said second intermediate resolution level and for receiving input from a neural network that is receiving input from a resolution level that is finer than said second intermediate resolution level. - View Dependent Claims (3)
a seventh neural network for receiving input from said second and said third neural networks; and
an eighth neural network for receiving input from said fifth and said sixth neural networks.
-
Specification