Fast fourier transform correlation tracking algorithm with background correction
First Claim
1. A method for tracking an object in an image using Fast Fourier Transforms, comprising:
- identifying a background correction term for a Fast Fourier Transform correlation tracker; and
tracking the object based on a representation of the background correction term that includes a frequency domain sinc function;
wherein the tracking includes zero-padding a reference window to a size of a search window, performing a 2 dimension Fast Fourier Transform of the zero-padded reference window into the frequency domain, and taking a complex conjugate of the transformed zero-padded reference window, performing a 2 dimension Fast Fourier Transform of a search window, performing a complex multiplication of the complex conjugate of the transformed zero-padded reference window and the transformed search window, and multiplying the result by a first factor to obtain a first result in the frequency domain, squaring pixel values of the search window and performing a 2 dimension Fast Fourier Transform of the squared pixel values into the frequency domain, multiplying the transform of the squared pixel values with a sinc function to obtain a second result in the frequency domain, summing the first and second results to form a third result in the frequency domain, performing a 2 dimension inverse Fast Fourier Transform of the third result to obtain a spatial-domain correlation surface, and searching for a minimum of the correlation surface.
3 Assignments
0 Petitions
Accused Products
Abstract
An FFT correlation tracker that is capable of effectively tracking targets against non-uniform backgrounds in realtime, includes a background correction implemented using a FFT with the 2-dimension sinc function. The tracker tracks an object by effectively computing the first and third terms of the mean-square-error function C(s,t) defined as
This is done by first transforming the first and third terms into the frequency domain, where the first term, the background correction term, can be computed much more efficiently in real-time by using the 2-dimension sinc function. Multiplications and additions necessary to carry out the computations in the frequency domain are then performed. Next, the resulting frequency-domain function is transformed back into the spatial domain to form a correlation surface. Finally, a minimum of the resulting correlation surface is found. The location of the minimum corresponds to the location of the object being tracked.
23 Citations
8 Claims
-
1. A method for tracking an object in an image using Fast Fourier Transforms, comprising:
-
identifying a background correction term for a Fast Fourier Transform correlation tracker; and tracking the object based on a representation of the background correction term that includes a frequency domain sinc function; wherein the tracking includes zero-padding a reference window to a size of a search window, performing a 2 dimension Fast Fourier Transform of the zero-padded reference window into the frequency domain, and taking a complex conjugate of the transformed zero-padded reference window, performing a 2 dimension Fast Fourier Transform of a search window, performing a complex multiplication of the complex conjugate of the transformed zero-padded reference window and the transformed search window, and multiplying the result by a first factor to obtain a first result in the frequency domain, squaring pixel values of the search window and performing a 2 dimension Fast Fourier Transform of the squared pixel values into the frequency domain, multiplying the transform of the squared pixel values with a sinc function to obtain a second result in the frequency domain, summing the first and second results to form a third result in the frequency domain, performing a 2 dimension inverse Fast Fourier Transform of the third result to obtain a spatial-domain correlation surface, and searching for a minimum of the correlation surface. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for tracking an object in an image using Fast Fourier Transforms, comprising:
-
identifying a background correction term for a Fast Fourier Transform correlation tracker; and tracking the object based on a representation of the background correction term that includes a frequency domain sinc function; wherein the tracking includes zero-padding a reference window to a size of a search window, performing a 2 dimension Fast Fourier Transform of the zero-padded reference window into the frequency domain, and taking a complex conjugate of the transformed zero-padded reference window, performing a 2 dimension Fast Fourier Transform of a search window, performing a complex multiplication of the complex conjugate of the transformed zero-padded reference window and the transformed search window, and multiplying the result by a first factor to obtain a first result in the frequency domain, obtaining a search window function by squaring pixel values of the search window, performing a 2 dimension Fast Fourier Transform of the search window function into the frequency domain, multiplying the transform of the search window function with a sinc function to obtain a second result in the frequency domain, summing the first and second results to form a third result in the frequency domain, performing a 2 dimension inverse Fast Fourier Transform of the third result to obtain a spatial-domain correlation surface, and searching for a minimum of the correlation surface.
-
-
7. A Fast Fourier Transform correlation tracker, comprising:
-
a computing device with inputs for receiving an input search window image and receiving a reference window image, wherein the computing device tracks the reference window image in the input search window image based on a frequency domain background correction term that includes a 2 dimension sinc function, wherein the tracker; zero-pads a reference window to a size of a search window, performs a 2 dimension Fast Fourier Transform of the zero-padded reference window into the frequency domain, and takes a complex conjugate of the transformed zero-padded reference window; performs a 2 dimension Fast Fourier Transform of a search window; performs a complex multiplication of the complex conjugate of the transformed zero-padded reference window and the transformed search window, and multiplies the result by a first factor to obtain a first result in the frequency domain; squares pixel values of the search window and performs a 2 dimension Fast Fourier Transform of the squared pixel values into the frequency domain; multiplies the transform of the squared pixel values with a sinc function to obtain a second result in the frequency domain; sums the first and second results to form a third result in the frequency domain; performs a 2 dimension inverse Fast Fourier Transform of the third result to obtain a spatial-domain correlation surface; and searches for a minimum of the correlation surface.
-
-
8. A Fast Fourier Transform correlation tracker, comprising:
-
a computing device with inputs for receiving an input search window image and receiving a reference window image, wherein the computing device tracks the reference window image in the input search window image based on a frequency domain background correction term that includes a 2 dimension sinc function, wherein the tracker; zero-pads a reference window to a size of a search window, performs a 2 dimension Fast Fourier Transform of the zero-padded reference window into the frequency domain, and takes a complex conjugate of the transformed zero-padded reference window; performs a 2 dimension Fast Fourier Transform of a search window; performs a complex multiplication of the complex conjugate of the transformed zero-padded reference window and the transformed search window, and multiplies the result by a first factor to obtain a first result in the frequency domain; obtains a search window function by squaring pixel values of the search window; performs a 2 dimension Fast Fourier Transform of the search window function into the frequency domain; multiplies the transform of the search window function with a sinc function to obtain a second result in the frequency domain; sums the first and second results to form a third result in the frequency domain; performs a 2 dimension inverse Fast Fourier Transform of the third result to obtain a spatial-domain correlation surface; and searches for a minimum of the correlation surface.
-
Specification