Augmenting Layer-Based Object Detection With Deep Convolutional Neural Networks
First Claim
1. A computer-implemented method for performing object recognition comprising:
- receiving image data;
extracting a depth image and a color image from the image data;
creating a mask image by segmenting the image data into a plurality of components;
identifying objects within the plurality of components of the mask image;
determining a first likelihood score from the depth image and the mask image using a layered classifier;
determining a second likelihood score from the color image and the mask image by generating an object image by copying pixels from the first image of the components in the mask image and classifying the object image using the deep convolutional neural network (CNN); and
determining a class for at least a portion of the image data based on the first likelihood score and the second likelihood score.
1 Assignment
0 Petitions
Accused Products
Abstract
By way of example, the technology disclosed by this document receives image data; extracts a depth image and a color image from the image data; creates a mask image by segmenting the depth image; determines a first likelihood score from the depth image and the mask image using a layered classifier; determines a second likelihood score from the color image and the mask image using a deep convolutional neural network; and determines a class of at least a portion of the image data based on the first likelihood score and the second likelihood score. Further, the technology can pre-filter the mask image using the layered classifier and then use the pre-filtered mask image and the color image to calculate a second likelihood score using the deep convolutional neural network to speed up processing.
-
Citations
20 Claims
-
1. A computer-implemented method for performing object recognition comprising:
-
receiving image data; extracting a depth image and a color image from the image data; creating a mask image by segmenting the image data into a plurality of components; identifying objects within the plurality of components of the mask image; determining a first likelihood score from the depth image and the mask image using a layered classifier; determining a second likelihood score from the color image and the mask image by generating an object image by copying pixels from the first image of the components in the mask image and classifying the object image using the deep convolutional neural network (CNN); and determining a class for at least a portion of the image data based on the first likelihood score and the second likelihood score.
-
-
2. A computer-implemented method for performing object recognition comprising:
-
receiving image data; creating a mask image by segmenting the image data into a plurality of components; determining a first likelihood score from the image data and the mask image using a layered classifier; determining a second likelihood score from the image data and the mask image using a deep convolutional neural network (CNN); and determining a class for at least a portion of the image data based on the first likelihood score and the second likelihood score. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for performing object recognition comprising:
-
a processor; and a memory storing instructions that, when executed, cause the system to; create a mask image by segmenting the image data into a plurality of components; determine a first likelihood score from the image data and the mask image using a layered classifier; determine a second likelihood score from the image data and the mask image using a deep convolutional neural network (CNN); and determine a class for at least a portion of the image data based on the first likelihood score and the second likelihood score. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification