Training a neural network to detect objects in images
First Claim
1. A system for detecting objects in images, the system comprising:
- an object detection neural network implemented by one or more computers, the object detection neural network comprising;
one or more hidden neural network layers configured to;
receive an input image, andprocess the input image to generate a hidden layer output; and
an output layer configured to;
receive the hidden layer output,apply a first transformation to the hidden layer output to generate a first output that defines a predetermined number of bounding boxes in the input image, andapply one or more second transformations to the hidden layer output to generate a second output that includes a respective confidence score for each of the bounding boxes that represents a likelihood that the bounding box contains an image of an object.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network to detect object in images. One of the methods includes receiving a training image and object location data for the training image; providing the training image to a neural network and obtaining bounding box data for the training image from the neural network, wherein the bounding box data comprises data defining a plurality of candidate bounding boxes in the training image and a respective confidence score for each candidate bounding box in the training image; determining an optimal set of assignments using the object location data for the training image and the bounding box data for the training image, wherein the optimal set of assignments assigns a respective candidate bounding box to each of the object locations; and training the neural network on the training image using the optimal set of assignments.
-
Citations
20 Claims
-
1. A system for detecting objects in images, the system comprising:
an object detection neural network implemented by one or more computers, the object detection neural network comprising; one or more hidden neural network layers configured to; receive an input image, and process the input image to generate a hidden layer output; and an output layer configured to; receive the hidden layer output, apply a first transformation to the hidden layer output to generate a first output that defines a predetermined number of bounding boxes in the input image, and apply one or more second transformations to the hidden layer output to generate a second output that includes a respective confidence score for each of the bounding boxes that represents a likelihood that the bounding box contains an image of an object. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
8. One or more non-transitory computer storage media storing instructions that when executed by one or more computers cause the one or more computers to implement an object detection neural network, the object detection neural network comprising:
-
one or more hidden neural network layers configured to; receive an input image, and process the input image to generate a hidden layer output; and an output layer configured to; receive the hidden layer output, apply a first transformation to the hidden layer output to generate a first output that defines a predetermined number of bounding boxes in the input image, and apply one or more second transformations to the hidden layer output to generate a second output that includes a respective confidence score for each of the bounding boxes that represents a likelihood that the bounding box contains an image of an object. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A method performed by one or more computers, the method comprising:
-
receiving an input image; processing the input image using one or more hidden neural network layers that are configured to; receive the input image, and process the input image to generate a hidden layer output; and processing the hidden layer output using an output layer that is configured to; receive the hidden layer output, apply a first transformation to the hidden layer output to generate a first output that defines a predetermined number of bounding boxes in the input image, and apply one or more second transformations to the hidden layer output to generate a second output that includes a respective confidence score for each of the bounding boxes that represents a likelihood that the bounding box contains an image of an object. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification