Three-dimensional visual phrases for object recognition
First Claim
Patent Images
1. A method comprising:
- under control of a processor configured with computer-executable instructions,receiving a collection of images each containing an object;
constructing a three-dimensional (3-D) model of the object, the 3-D model including a plurality of points;
determining a popularity of individual points in the plurality of points, wherein the popularity of an individual point is based at least in part on a number of images of the collection of images in which the individual point is observed;
selecting, based at least in part on a first sampling rate, a first popular point subset of the plurality of points based at least in part on the popularities of the respective individual points in the plurality of points;
selecting, based at least in part on a second sampling rate different than the first sampling rate, a second popular point subset of the plurality of points based at least in part on the popularities of the respective individual points in the plurality of points;
generating one or more 3-D visual phrases based on the first popular point subset of the plurality of points and the second popular point subset of the plurality of points; and
using the one or more 3-D visual phrases to detect the object in an unclassified image.
2 Assignments
0 Petitions
Accused Products
Abstract
The techniques discussed herein discover three-dimensional (3-D) visual phrases for an object based on a 3-D model of the object. The techniques then describe the 3-D visual phrases. Once described, the techniques use the 3-D visual phrases to detect the object in an image (e.g., object recognition).
37 Citations
20 Claims
-
1. A method comprising:
-
under control of a processor configured with computer-executable instructions, receiving a collection of images each containing an object; constructing a three-dimensional (3-D) model of the object, the 3-D model including a plurality of points; determining a popularity of individual points in the plurality of points, wherein the popularity of an individual point is based at least in part on a number of images of the collection of images in which the individual point is observed; selecting, based at least in part on a first sampling rate, a first popular point subset of the plurality of points based at least in part on the popularities of the respective individual points in the plurality of points; selecting, based at least in part on a second sampling rate different than the first sampling rate, a second popular point subset of the plurality of points based at least in part on the popularities of the respective individual points in the plurality of points; generating one or more 3-D visual phrases based on the first popular point subset of the plurality of points and the second popular point subset of the plurality of points; and using the one or more 3-D visual phrases to detect the object in an unclassified image. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. One or more computer storage media storing computer executable instructions that, when executed, perform operations comprising:
-
sampling a plurality of points of a 3-D model of an object to determine a popular point subset, wherein each point of the popular point subset is sampled based on a number of training images in which the point is observed; generating one or more sets of three-dimensional (3-D) visual phrases based on the popular point subset; storing the one or more sets of 3-D visual phrases, each set of 3-D visual phrases being associated with the object; receiving an indication to perform object recognition for one or more images; performing object recognition on the one or more images using the one or more sets of 3-D visual phrases; and categorizing the one or more images based on whether the object is recognized in an image using the one or more sets of 3-D visual phrases. - View Dependent Claims (12, 13)
-
-
14. A system comprising:
-
one or more processors; one or more computer memories, coupled to the one or more processors and storing; an image access module, operable by the one or more processors, to access a plurality of images that each comprise an object; a three-dimensional (3-D) visual phrase discoverer module, operable by the one or more processors, to; sample a plurality of points of a 3-D point cloud for the object to determine a popular point subset, wherein an individual point of the popular point subset is sampled based on a number of the plurality of images in which the individual point is observed; and discover one or more 3-D visual phrases from the 3-D point cloud for the object based at least in part on the popular point subset; and an object detection module, operable by the one or more processors, to receive an indication to perform object recognition for one or more additional images and to use the one or more 3-D visual phrases to recognize the object in individual ones of the one or more additional images. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification