Keypoint descriptor generation by complex wavelet analysis
First Claim
1. A method of generating a keypoint descriptor for identifying an object in an image or in a sequence of images, the keypoint descriptor being substantially invariant to a transformation of the object in the image, wherein the method is performed using processing apparatus and includes:
- a) receiving object data representing an object for identification in an image;
b) processing said object data to generate at least one basis function representing a feature having undergone at least one transformation or a transformation sequence across several consecutive frames, optionally using transformations that are out of the image plane to recognize objects from multiple views;
c) modifying a prototype wavelet function based on said at least one basis function to generate a plurality of modified wavelet functions;
d) comparing the plurality of modified wavelet functions with the at least one basis function;
e) selecting a modified wavelet function of said plurality of modified wavelet functions based on the comparison of step d); and
f) processing an input image or input orientation field according to said selected modified wavelet function to generate the keypoint descriptor,wherein the prototype wavelet function and the plurality of modified wavelet functions comprise a radial distance component and an azimuthal angle component,wherein the prototype wavelet function and the plurality of modified wavelet functions have the formula;
Ψ
k(d,θ
)=dnk−
1e−
α
k d(nk−
α
kd) ·
(cos(mkθ
)+qxsin(mkθ
))where the term dnk−
1e−
α
kd(nk−
α
kd) is the radial distance component and the term (cos(mkθ
)+qx, sin(mkθ
)) is the azimuthal angle component, and where the term qx, denotes an imaginary unit.
7 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates to a method of generating a keypoint descriptor for identifying an object in an image or in a sequence of images, the keypoint descriptor being substantially invariant to a transformation of the object in the image. The method includes receiving object data representing an object for identification in an image; processing said object data to generate at least one basis function representing a feature having undergone at least one transformation or a transformation sequence across several consecutive frames, optionally using transformations that are out of the image plane to recognize objects from multiple views; modifying a prototype wavelet function based on said at least one basis function to generate a plurality of modified wavelet functions; comparing the plurality of modified wavelet functions with the at least one basis function; selecting a modified wavelet function of said plurality of modified wavelet functions based on the comparison of the plurality of modified wavelet functions with the at least one basis function; and processing an input image or input orientation field according to said selected modified wavelet function to generate the keypoint descriptor. The present invention further relates to a method of identifying an object in an image using a keypoint descriptor; and apparatus and computer software for implementing a method in accordance with the present invention.
-
Citations
44 Claims
-
1. A method of generating a keypoint descriptor for identifying an object in an image or in a sequence of images, the keypoint descriptor being substantially invariant to a transformation of the object in the image, wherein the method is performed using processing apparatus and includes:
-
a) receiving object data representing an object for identification in an image; b) processing said object data to generate at least one basis function representing a feature having undergone at least one transformation or a transformation sequence across several consecutive frames, optionally using transformations that are out of the image plane to recognize objects from multiple views; c) modifying a prototype wavelet function based on said at least one basis function to generate a plurality of modified wavelet functions; d) comparing the plurality of modified wavelet functions with the at least one basis function; e) selecting a modified wavelet function of said plurality of modified wavelet functions based on the comparison of step d); and f) processing an input image or input orientation field according to said selected modified wavelet function to generate the keypoint descriptor, wherein the prototype wavelet function and the plurality of modified wavelet functions comprise a radial distance component and an azimuthal angle component, wherein the prototype wavelet function and the plurality of modified wavelet functions have the formula;
Ψ
k(d,θ
)=dnk −
1e−
αk d(nk−
α
kd) ·
(cos(mkθ
)+qxsin(mkθ
))where the term dn k −
1e−
αk d(nk−
α
kd) is the radial distance component and the term (cos(mkθ
)+qx, sin(mkθ
)) is the azimuthal angle component, and where the term qx, denotes an imaginary unit.- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
7. A method according to claim 1, wherein the comparing of step d) includes matching at least one of the modified wavelet functions with said at least one basis function, and the selected modified wavelet function is selected in step e) according to matching criteria.
-
8. A method according to claim 7, wherein the processing of step b) includes generating a plurality of basis functions, the comparing of step d) includes matching a number of the plurality of basis functions with at least one of the modified wavelet functions, and the selecting of step e) includes selecting a modified wavelet function for each of the number of the plurality of basis functions according to matching criteria.
-
9. A method according to claim 8, wherein the keypoint descriptor is generated in step e) on the basis of the selected wavelet functions.
-
10. A method of identifying an object in an image, using processing apparatus, the method including processing image data representing at least part of said image with a keypoint descriptor generated using the method of claim 1.
-
11. A method according to claim 10, wherein the keypoint descriptor is a first keypoint descriptor, and including:
-
i) receiving said image data; ii) processing said image data to generate at least one second keypoint descriptor representative of at least one keypoint of said image; iii) comparing the first keypoint descriptor with the at least one second keypoint descriptor; and iv) determining if said at least one second keypoint descriptor matches the first keypoint descriptor.
-
-
12. A method according to claim 10, the method including estimating transformation parameters of the object in the input image or input orientation field from the keypoint descriptor.
-
13. A method according to claim 12, the method including tracking at least one keypoint from one frame to another frame using the transformation parameters.
-
14. Processing apparatus arranged to perform the method of claim 10.
-
15. A computer program product comprising a non-transitory computer-readable storage medium having computer readable instructions stored thereon, the computer readable instructions being executable by a computerised device to cause the computerised device to perform the method of claim 10.
-
16. Processing apparatus arranged to perform the method of claim 1.
-
17. A computer program product comprising a non-transitory computer-readable storage medium having computer readable instructions stored thereon, the computer readable instructions being executable by a computerised device to cause the computerised device to perform the method of claim 1.
-
18. A method of generating a keypoint descriptor for identifying an object in an image or in a sequence of images, the keypoint descriptor being substantially invariant to a transformation of the object in the image, wherein the method is performed using processing apparatus and includes:
-
a) receiving object data representing an object for identification in an image; b) processing said object data to generate at least one basis function representing a feature having undergone at least one transformation or a transformation sequence across several consecutive frames, optionally using transformations that are out of the image plane to recognise objects from multiple views; c) modifying a prototype wavelet function based on said at least one basis function to generate a plurality of modified wavelet functions; d) comparing the plurality of modified wavelet functions with the at least one basis function; e) selecting a modified wavelet function of said plurality of modified wavelet functions based on the comparison of step d); and f) processing an input image or input orientation field according to said selected modified wavelet function to generate the keypoint descriptor, wherein the object is a three dimensional object in a three dimensional image or the keypoint descriptor is arranged to identify an object in a series of images spaced over time, wherein the prototype wavelet function and the plurality of modified wavelet functions comprise a radial distance component, an azimuthal angle component and a polar angle component, wherein the prototype wavelet function and the plurality of modified wavelet functions have the formula;
Ψ
k(d,θ
,φ
,)=dnk −
1e−
αk d(nk−
α
kd) ·
(cos(mkθ
)+qxsin(mkθ
)) ·
(cos(lkφ
)+qysin(lkφ
))where the term dn k -1e−
αk d(nk−
α
kd) is the radial distance weighting component with radial distance, d, the term (cos(mkθ
)+qx, sin(mkθ
)) is the azimuthal angle component with azimuthal angle θ
, and the term (cos(lkφ
)+qysin(lkφ
)) is polar angle component with polar angle φ
, and where the terms qx, and qy, denote unit quaternions and obey the rules qx2=qy2=qz2=−
1, and qx, qy=qz. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32)
-
-
33. A method of generating a keypoint descriptor for identifying an object in an image or in a sequence of images, the keypoint descriptor being substantially invariant to a transformation of the object in the image, wherein the method is performed using processing apparatus and includes:
-
a) receiving object data representing an object for identification in an image; b) processing said object data to generate at least one basis function representing a feature having undergone at least one transformation or a transformation sequence across several consecutive frames, optionally using transformations that are out of the image plane to recognise objects from multiple views; c) modifying a prototype wavelet function based on said at least one basis function to generate a plurality of modified wavelet functions; d) comparing the plurality of modified wavelet functions with the at least one basis function; e) selecting a modified wavelet function of said plurality of modified wavelet functions based on the comparison of step d); and f) processing an input image or input orientation field according to said selected modified wavelet function to generate the keypoint descriptor, wherein the prototype wavelet function and the modified wavelet functions are quaternion wavelet functions, the keypoint is arranged to identify an object in an image patch, the transformation comprises a rotation, the transformation comprises a translation, said at least one basis function is an eigenfunction, the object is a two-dimensional object in a temporal sequence of images or a three-dimensional object in three-dimensional volumetric data, or the modifying of step c) is performed in accordance with the formula; - View Dependent Claims (34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44)
-
Specification