Image Labeling Using Multi-Scale Processing
First Claim
1. An image labeling apparatus comprising:
- an input arranged to receive a first image to be labeled;
a coarsening engine arranged to repeatedly reduce the resolution of the original image according to a scaling factor to form an image pyramid;
a processor arranged to form an energy function from the coarsest image in the pyramid and by adjusting parameter values of that energy function on the basis of the scaling factor;
an optimization engine arranged to optimize the energy function to obtain a labeling of the coarsest image;
the processor being arranged to project the labeling onto the next-coarsest image in the pyramid to obtain a region of unlabeled image elements;
the processor being arranged to form another energy function using the current image in the pyramid and by adjusting parameter values of that energy function on the basis of the scaling factor;
the optimization engine being arranged to optimize the another energy function only over the region of unlabeled image elements to obtain a revised labeling; and
wherein the processor and optimization engine are arranged to repeatedly;
project the revised labeling, form another energy function and optimize that energy function;
to use all the images in the pyramid and so produce a labeled version of the first image.
2 Assignments
0 Petitions
Accused Products
Abstract
Multi-scale processing may be used to reduce the memory and computational requirements of optimization algorithms for image labeling, for example, for object segmentation, 3D reconstruction, stereo correspondence, optical flow and other applications. For example, in order to label a large image (or 3D volume) a multi-scale process first solves the problem at a low resolution, obtaining a coarse labeling of an original high resolution problem. This labeling is refined by solving another optimization on a subset of the image elements. In examples, an energy function for a coarse level version of an input image is formed directly from an energy function of the input image. In examples, the subset of image elements may be selected using a measure of confidence in the labeling.
-
Citations
20 Claims
-
1. An image labeling apparatus comprising:
-
an input arranged to receive a first image to be labeled; a coarsening engine arranged to repeatedly reduce the resolution of the original image according to a scaling factor to form an image pyramid; a processor arranged to form an energy function from the coarsest image in the pyramid and by adjusting parameter values of that energy function on the basis of the scaling factor; an optimization engine arranged to optimize the energy function to obtain a labeling of the coarsest image; the processor being arranged to project the labeling onto the next-coarsest image in the pyramid to obtain a region of unlabeled image elements; the processor being arranged to form another energy function using the current image in the pyramid and by adjusting parameter values of that energy function on the basis of the scaling factor; the optimization engine being arranged to optimize the another energy function only over the region of unlabeled image elements to obtain a revised labeling; and
wherein the processor and optimization engine are arranged to repeatedly;
project the revised labeling, form another energy function and optimize that energy function;
to use all the images in the pyramid and so produce a labeled version of the first image. - View Dependent Claims (2, 3, 4, 5)
-
-
6. An image labeling apparatus comprising:
-
an input arranged to receive a first image to be labeled; a coarsening engine arranged to repeatedly reduce the resolution of the original image to form an image pyramid; a processor arranged to form an energy function for the coarsest image in the pyramid directly from an energy function of the original image; an optimization engine arranged to optimize the energy function to obtain a labeling of the coarsest image; the processor being arranged to project the labeling onto the next-coarsest image in the pyramid to obtain a region of unlabeled image elements; the processor being arranged to form another energy function for the current image in the pyramid from the energy function of the original image; the optimization engine being arranged to optimize the another energy function only over the region of unlabeled image elements to obtain a revised labeling; and
wherein the processor and optimization engine are arranged to repeatedly;
project the revised labeling, form another energy function and optimize that energy function;
to use all the images in the pyramid and so produce a labeled version of the first image. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13)
-
-
14. A method of labeling an image comprising:
-
at an input receiving a first image to be labeled; arranging a coarsening engine to repeatedly reduce the resolution of the original image to form an image pyramid according to a scaling factor; arranging a processor to form an energy function for the coarsest image in the pyramid; arranging an optimization engine to optimize the energy function to obtain a labeling of the coarsest image; arranging the processor to project the labeling onto the next-largest image in the pyramid and to identify, using a measure of confidence of the labeling, a plurality of image elements to be reassessed; arranging the processor to form another energy function for the current image in the pyramid; arranging the optimization engine to optimize the another energy function only over the identified plurality of image elements to obtain a revised labeling; and
using the processor and optimization engine to repeatedly;
project the revised labeling, form another energy function and optimize that energy function;
to use all the images in the pyramid and so produce a labeled version of the first image. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification