OBJECT DETECTION AND CLASSIFICATION IN IMAGES
First Claim
1. A method comprising:
- receiving an input image;
generating a convolutional feature map;
identifying, by a first type of neural network, a candidate object in the input image;
determining, by a second type of neural network, a category of the candidate object, wherein the first type of neural network and the second type of neural network share at least one algorithm; and
assigning a confidence score to the category of the candidate object.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems, methods, and computer-readable media for providing fast and accurate object detection and classification in images are described herein. In some examples, a computing device can receive an input image. The computing device can process the image, and generate a convolutional feature map. In some configurations, the convolutional feature map can be processed through a Region Proposal Network (RPN) to generate proposals for candidate objects in the image. In various examples, the computing device can process the convolutional feature map with the proposals through a Fast Region-Based Convolutional Neural Network (FRCN) proposal classifier to determine a class of each object in the image and a confidence score associated therewith. The computing device can then provide a requestor with an output including the object classification and/or confidence score.
135 Citations
20 Claims
-
1. A method comprising:
-
receiving an input image; generating a convolutional feature map; identifying, by a first type of neural network, a candidate object in the input image; determining, by a second type of neural network, a category of the candidate object, wherein the first type of neural network and the second type of neural network share at least one algorithm; and assigning a confidence score to the category of the candidate object. - View Dependent Claims (7, 8)
-
-
2. A method as 1 recites, wherein the first type of neural network comprises a translation invariant component configured to:
-
classify an anchor based on overlap with a ground-truth box; and predict a shift and a scale of the anchor.
-
-
3. A method as 1 recites, wherein the identifying the candidate object in the input image comprises:
-
generating one or more anchors at a point of the input image; determining an overlap of individual ones of the one or more anchors to a ground-truth box; assigning a label to each anchor of the one or more anchors based at least in part on the overlap; assigning a score to the label based at least in part on the overlap; and identifying the candidate object at the point based at least in part on the score. - View Dependent Claims (4, 6)
-
-
5. A method as 3 recites, wherein the generating the one or more anchors at the point of the input image comprises generating a set of anchor boxes, the set anchor boxes having three scales and three aspect ratios.
-
9. A system comprising:
-
a processor; and a computer-readable medium including instructions for an object detection and classification network, for actuation by the processor, the object detection and classification network comprising; an initial processing module configured to input an image and generate a convolutional feature map; an object proposal module configured to generate a proposal corresponding to a candidate object in the image; a proposal classifier module configured to assign a category associated with the candidate object, wherein the object proposal module and the proposal classifier module share at least one convolutional layer. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. A computer storage medium having thereon computer-executable instructions, the computer-executable instructions responsive to execution configuring a device to perform operations comprising:
-
receiving an input image; generating a convolutional feature map; identifying, by a neural network, a candidate object in the input image; determining, by a proposal classifier sharing an algorithm with the neural network, a category of the candidate object; and assigning, by the proposal classifier, a confidence score to the category of the candidate object. - View Dependent Claims (17, 18, 19, 20)
-
Specification