Neural network for object detection in images
First Claim
Patent Images
1. A device implemented method for image recognition, the method comprising:
- accessing, using one or more processors of the device coupled to a memory of the device, an image depicting an object of interest and a background within a field of view;
detecting, by the one or more processors using a set of detection layers of the multilayer object model, at least a portion of the object of interest within the image;
detecting, by the one or more processors using a set of detection layers of the multilayer object model, at least a portion of the background within the image;
extracting, by the one or more processors using a set of image representation layers of the multilayer object model comprising a lower image representation layer and a higher image representation layer, context information from the portion of the background, wherein a layer output of the higher image representation layer includes the extracted context information; and
identifying, by the one or more processors, the object of interest from the detected portion of the object of interest and the context information, using the set of image representation layers of the multilayer object model, passing the layer output of the higher image representation layer including the extracted context information backward to the lower image representation layer.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems, devices, media, and methods are presented for identifying and categorically labeling objects within a set of images. The systems and methods receive an image depicting an object of interest, detect at least a portion of the object of interest within the image using a multilayer object model, determine context information, and identify the object of interest included in two or more bounding boxes.
-
Citations
20 Claims
-
1. A device implemented method for image recognition, the method comprising:
-
accessing, using one or more processors of the device coupled to a memory of the device, an image depicting an object of interest and a background within a field of view; detecting, by the one or more processors using a set of detection layers of the multilayer object model, at least a portion of the object of interest within the image; detecting, by the one or more processors using a set of detection layers of the multilayer object model, at least a portion of the background within the image; extracting, by the one or more processors using a set of image representation layers of the multilayer object model comprising a lower image representation layer and a higher image representation layer, context information from the portion of the background, wherein a layer output of the higher image representation layer includes the extracted context information; and identifying, by the one or more processors, the object of interest from the detected portion of the object of interest and the context information, using the set of image representation layers of the multilayer object model, passing the layer output of the higher image representation layer including the extracted context information backward to the lower image representation layer. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system comprising:
-
one or more processors; and a processor-readable storage device coupled to the one or more processors, the processor-readable storage device storing processor-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform operations comprising; accessing an image depicting an object of interest and a background within a field of view; detecting, by the one or more processors using a set of detection layers of the multilayer object model, at least a portion of the object of interest within the image; detecting, by the one or more processors using a set of detection layers of the multilayer object model, at least a portion of the background within the image; extracting, by the one or more processors using a set of image representation layers of the multilayer object model comprising a lower image representation layer and a higher image representation layer, context information from the portion of the background, wherein a layer output of the higher image representation layer includes the extracted context information; and identifying, by the one or more processors, the object of interest from the detected portion of the object of interest and the context information, using the set of image representation layers of the multilayer object model, by passing the layer output of the higher image representation layer including the extracted context information backward to the lower image representation layer. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A processor-readable storage device storing processor-executable instructions that, when executed by one or more processors of a machine, cause the machine to perform operations comprising:
-
accessing an image depicting an object of interest and a background within a field of view; detecting, by the one or more processors using a set of detection layers of the multilayer object model, at least a portion of the object of interest within the image; detecting, by the one or more processors using a set of detection layers of the multilayer object model, at least a portion of the background within the image; extracting, by the one or more processors using a set of image representation layers of the multilayer object model comprising a lower image representation layer and a higher image representation layer, context information from the portion of the background, wherein a layer output of the higher image representation layer includes the extracted context information; and identifying, by the one or more processors, the object of interest from the detected portion of the object of interest and the context information, using the set of image representation layers of the multilayer object model, by passing the layer output of the higher image representation layer including the extracted context information backward to the lower image representation layer. - View Dependent Claims (17, 18, 19, 20)
-
Specification