Object Recognition Using Textons and Shape Filters
First Claim
1. A method comprising;
- (i) receiving a plurality of training images of objects;
(ii) receiving an object label map for each training image, each object label map comprising a label for each image element specifying one of a plurality of object classes;
(iii) accessing a dictionary of textons, each texton comprising information describing the texture of a patch of surface of an object;
(iv) forming a texton map for each training image using the dictionary of textons, each texton map comprising, for each image element a label indicating a texton;
(v) for each texton map computing a plurality of feature responses by applying a different shape filter for each feature response;
(vi) selecting a sub-set of the shape filters used in computing the feature responses by forming a multi-class classifier to classify image elements into the object classes on the basis of at least some of the feature responses; and
(vii) forming an object detection and recognition system using the selected shape filters.
2 Assignments
0 Petitions
Accused Products
Abstract
Given an image of structured and/or unstructured objects we automatically partition it into semantically meaningful areas each labeled with a specific object class. We use a novel type of feature which we refer to as a shape filter. Shape filters enable us to capture some or all of shape, texture and appearance context information. A shape filter comprises one or more regions of arbitrary shape, size and position within a bounding area of an image, paired with a specified texton. A texton comprises information describing the texture of a patch of surface of an object. In a training process we select a sub-set of possible shape filters and incorporate those into a conditional random field model of object classes. That model is then used for object detection and recognition.
41 Citations
20 Claims
-
1. A method comprising;
-
(i) receiving a plurality of training images of objects; (ii) receiving an object label map for each training image, each object label map comprising a label for each image element specifying one of a plurality of object classes; (iii) accessing a dictionary of textons, each texton comprising information describing the texture of a patch of surface of an object; (iv) forming a texton map for each training image using the dictionary of textons, each texton map comprising, for each image element a label indicating a texton; (v) for each texton map computing a plurality of feature responses by applying a different shape filter for each feature response; (vi) selecting a sub-set of the shape filters used in computing the feature responses by forming a multi-class classifier to classify image elements into the object classes on the basis of at least some of the feature responses; and (vii) forming an object detection and recognition system using the selected shape filters. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method comprising:
-
(i) receiving a plurality of training images of objects; (ii) receiving an object label map for each training image, each object label map comprising a label for each image element specifying one of a plurality of object classes; (iii) accessing a dictionary of textons, each texton comprising information describing the texture of a patch of surface of an object; (iv) forming a texton map for each training image using the dictionary of textons, each texton map comprising, for each image element a label indicating a texton; (v) for each texton map computing a plurality of feature responses by applying a different shape filter for each feature response; (vi) selecting a sub-set of the shape filters used in computing the feature responses by forming a multi-class classifier to classify image elements into the object classes on the basis of at least some of the feature responses; and (vii) forming an object label map for a previously unseen image using the selected shape filters. - View Dependent Claims (14, 15, 16)
-
-
17. One or more computer readable media having computer executable instructions for performing steps comprising:
-
(i) receiving a plurality of training images of objects; (ii) receiving an object label map for each training image, each object label map comprising a label for each image element specifying one of a plurality of object classes; (iii) accessing a dictionary of textons, each texton comprising information describing the texture of a patch of surface of an object; (iv) forming a texton map for each training image using the dictionary of textons, each texton map comprising, for each image element a label indicating a texton; (v) for each texton map computing a plurality of feature responses by applying a different shape filter and texton pair for each feature response; (vi) selecting a sub-set of the shape filters used in computing the feature responses by forming a multi-class classifier to classify image elements into the object classes on the basis of at least some of the feature responses; and (vii) forming an object label map for a previously unseen image using the selected shape filters. - View Dependent Claims (18, 19, 20)
-
Specification