Object finder for photographic images
First Claim
1. A method to detect presence of a 3D (three dimensional) object in a 2D (two dimensional) image containing a 2D representation of said 3D object, said method comprising:
- receiving a digitized version of said 2D image;
selecting one or more view-based detectors;
for each view-based detector, computing a wavelet transform of said digitized version of said 2D image, wherein said wavelet transform generates a plurality of transform coefficients, and wherein each transform coefficient represents visual information from said 2D image that is localized in space, frequency, and orientation;
applying said one or more view-based detectors in parallel to respective plurality of transform coefficients, wherein each view-based detector is configured to detect a specific orientation of said 3D object in said 2D image based on visual information received from corresponding transform coefficients;
combining results of application of said one or more view-based detectors; and
determining orientation and location of said 3D object from said combination of results of application of said one or more view-based detectors.
2 Assignments
0 Petitions
Accused Products
Abstract
An object finder program for detecting presence of a 3D object in a 2D image containing a 2D representation of the 3D object. The object finder uses the wavelet transform of the input 2D image for object detection. A pre-selected number of view-based detectors are trained on sample images prior to performing the detection on an unknown image. These detectors then operate on the given input image and compute a quantized wavelet transform for the entire input image. The object detection then proceeds with sampling of the quantized wavelet coefficients at different image window locations on the input image and efficient look-up of pre-computed log-likelihood tables to determine object presence. The object finder'"'"'s coarse-to-fine object detection strategy coupled with exhaustive object search across different positions and scales results in an efficient and accurate object detection scheme. The object finder detects a 3D object over a wide range in angular variation (e.g., 180 degrees) through the combination of a small number of detectors each specialized to a small range within this range of angular variation.
112 Citations
38 Claims
-
1. A method to detect presence of a 3D (three dimensional) object in a 2D (two dimensional) image containing a 2D representation of said 3D object, said method comprising:
-
receiving a digitized version of said 2D image;
selecting one or more view-based detectors;
for each view-based detector, computing a wavelet transform of said digitized version of said 2D image, wherein said wavelet transform generates a plurality of transform coefficients, and wherein each transform coefficient represents visual information from said 2D image that is localized in space, frequency, and orientation;
applying said one or more view-based detectors in parallel to respective plurality of transform coefficients, wherein each view-based detector is configured to detect a specific orientation of said 3D object in said 2D image based on visual information received from corresponding transform coefficients;
combining results of application of said one or more view-based detectors; and
determining orientation and location of said 3D object from said combination of results of application of said one or more view-based detectors. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 25, 26, 27, 29, 30, 31)
-
-
24. A computer-readable storage medium having stored thereon instructions, which, when executed by a processor, cause the processor to perform the following:
-
digitize a 2D (two dimensional) image, wherein said 2D image contains a 2D representation of a 3D (three dimensional) object;
compute a wavelet transform of said digitized version of said 2D image, wherein said wavelet transform generates a plurality of transform coefficients, and wherein each transform coefficient represents corresponding visual information from said 2D image;
place an image window of fixed size at a first plurality of locations within said 2D image;
evaluate a plurality of visual attributes at each of said first plurality of locations of said image window using corresponding transform coefficients; and
estimate the presence of said 3D object in said 2D image based on evaluation of said plurality of visual attributes at said each of said first plurality of locations.
-
-
28. A method of providing assistance in detecting the presence of a 3D (three dimensional) object in a 2D (two dimensional) image containing a 2D representation of said 3D object, said method comprising:
-
receiving a digitized version of said 2D image from a client site and over a communication network;
determining the location of said 3D object in said 2D image; and
sending a notification of said location of said 3D object to said client site over said communication network.
-
-
32. A system for providing assistance in detecting the presence of a 3D (three dimensional) object in a 2D (two dimensional) image containing a 2D representation of said 3D object, said system comprising:
-
a client computer connected to a communication network, wherein said client computer is configured to transmit a digitized version of said 2D image over said communication network; and
a host computer in communication with the client computer and connected to said communication network, wherein said host computer is configured to perform the following;
receive over said communication network said digitized version of said 2D image from said client computer, determine the orientation and location of said 3D object in said 2D image received from said client computer, and send a notification of said orientation and location of said 3D object to said client computer over said communication network. - View Dependent Claims (33, 35, 36, 37, 38)
-
-
34. A computer system, which, upon being programmed, is configured to perform the following:
-
receive a digitized version of a 2D (two dimensional) image, wherein said 2D image contains a 2D representation of a 3D (three dimensional) object;
select one or more view-based detectors;
for each view-based detector, compute a wavelet transform of said digitized version of said 2D image, wherein said wavelet transform generates a plurality of transform coefficients, and wherein each transform coefficient represents corresponding visual information from said 2D image;
apply said one or more view-based detectors in parallel to respective plurality of transform coefficients, wherein each view-based detector is configured to detect a specific orientation of said 3D object in said 2D image based on visual information received from corresponding transform coefficients;
combine results of application of said one or more view-based detectors; and
determine orientation and location of said 3D object from said combination of results of application of said one or more view-based detectors.
-
Specification