Robust, on-line, view-based appearance models for visual motion analysis and visual tracking
First Claim
1. A method for generating an appearance model utilizing image data provided in a plurality of sequential image frames, the appearance model defined by a stable component including a first mixing probability and a first data parameter that is calculated using a plurality of image data values respectively provided in a relatively large number of said sequential image frames, the relatively large number being greater than three, the appearance model also including a transient component having a second mixing probability and second data parameter that is calculated using a plurality of image data values respectively provided in a relatively small number of said sequential image frames, wherein the method comprises:
- receiving an image datum corresponding to a most current image frame of the plurality of sequential image frames;
determining a first likelihood value for the stable component and a second likelihood value for the transient component, the first likelihood value indicating a relative consistency between the image datum and the first data parameter, and the second likelihood value indicating a relative consistency between the image datum and the second data parameter; and
updating the first mixing probability of the stable component and the second mixing probability of the transient component using the first and second likelihood values, respectively.
4 Assignments
0 Petitions
Accused Products
Abstract
A robust, adaptive, appearance model is disclosed that includes both a stable model component, learned over a long time course, and a transient component, learned over a relatively short time course (e.g., a 2-frame motion component and/or an outlier processing component). An on-line EM-algorithm is used to adapt the appearance model parameters over time. An implementation of this approach is developed for an appearance model based on the filter responses from a steerable pyramid. The appearance model is used in a motion-based tracking algorithm to provide robustness in the face of image outliers, such as those caused by occlusions. It is also provides the ability to adapt to natural changes in appearance, such as those due to facial expressions, or variations in 3D pose.
51 Citations
17 Claims
-
1. A method for generating an appearance model utilizing image data provided in a plurality of sequential image frames, the appearance model defined by a stable component including a first mixing probability and a first data parameter that is calculated using a plurality of image data values respectively provided in a relatively large number of said sequential image frames, the relatively large number being greater than three, the appearance model also including a transient component having a second mixing probability and second data parameter that is calculated using a plurality of image data values respectively provided in a relatively small number of said sequential image frames, wherein the method comprises:
-
receiving an image datum corresponding to a most current image frame of the plurality of sequential image frames; determining a first likelihood value for the stable component and a second likelihood value for the transient component, the first likelihood value indicating a relative consistency between the image datum and the first data parameter, and the second likelihood value indicating a relative consistency between the image datum and the second data parameter; and updating the first mixing probability of the stable component and the second mixing probability of the transient component using the first and second likelihood values, respectively. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for tracking a selected target object comprising:
-
receiving a current image frame including image datum associated with of the target object; estimating a motion of the target object using an adaptive appearance model including a first image component having parameters that are calculated using a plurality of image data values respectively received over a relatively large number of image frames temporally preceding the current image frame, the relatively large number being greater than three, and a second image component having parameters that are calculated using a plurality of image data values respectively over the relatively small number of said sequential image frames temporally preceding the current image frame; and updating the first and second image components, wherein the parameters of the first component include a first data parameter and a first contribution parameter, wherein the parameters of the second component include a second data parameter and a second contribution parameter, and wherein updating the first and second components comprises; comparing the image datum of the current image frame with the first data parameter of the first component, and recalculating the first and second contribution parameters based upon a difference between the first data parameter and the image datum. - View Dependent Claims (9, 10, 11)
-
-
12. An adaptive appearance model implemented on a processor-controlled machine for identifying an object appearing in a plurality of sequential image frames, the adaptive appearance model comprising:
-
a first image component having parameters defined by image data that remains stable over a relatively large number of said sequential image frames, the relatively large number being greater than three, wherein the parameters of the first image component include a first parameter that is calculated using a plurality of image data values respectively provided in said relatively large number of sequential frames; and a second image component having parameters defined by a relatively small number of said sequential image frames, and means for updating said first and said image components after receiving a current image frame of the plurality of sequential image frames, wherein the parameters of the first component include the first data parameter and a first contribution parameter, wherein the parameters of the second component include a second data parameter and a second contribution parameter, and wherein said means for updating comprises; means for comparing the image datum of the current image frame with the first data parameter of the first component, and means for recalculating the first and second contribution parameters based upon a difference between the first data parameter and the image datum. - View Dependent Claims (13, 14, 15)
-
-
16. An adaptive appearance model implemented on a processor-controlled machine for identifying an object appearing in a plurality of sequential image frames, the adaptive appearance model comprising:
-
a first image component including a first mixing probability having a value that is determined by a first parameter that is calculated using a plurality of image data values respectively provided in a relatively large number of said sequential image frames, the relatively large number being greater than three; a second image component including a second mixing probability having a value determined by a relatively small number of said sequential image frames, and an outlier component including a third mixing probability that is determined by the occurrence of outliers in the image data received in the plurality of image frames. - View Dependent Claims (17)
-
Specification