Robust multi-modal method for recognizing objects
First Claim
Patent Images
1. A method for tracking heads and faces, comprising the steps of:
- activating a channel for collecting data comprising perceived locations of designated features of one of heads and faces;
collecting the data for each feature during a sequence of frames;
generating, for each feature, one or more representation models based on the collected data, wherein for at least one feature, complementary representation models are generated, and wherein each complementary representation model comprises data reflecting the perceived location of the feature to which it corresponds;
comparing the complementary representation models corresponding to the at least one feature to generate correlated data; and
combining the correlated data into a single representation, wherein said comparing step comprises the steps of;
defining a distance metric for each of the complementary representation models corresponding to the at least one feature;
positioning the complementary representation models adjacent a common interface;
measuring the mutual overlap of the complementary representation models; and
collecting, based on the overlap, information representing areas of correlation between the complementary representation models.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for tracking heads and faces is disclosed wherein a variety of different representation models can be used to define individual heads and facial features in a multi-channel capable tracking algorithm. The representation models generated by the channels during a sequence of frames are ultimately combined into a representation comprising a highly robust and accurate tracked output. In a preferred embodiment, the method conducts an initial overview procedure to establish the optimal tracking strategy to be used in light of the particular characteristics of the tracking application.
-
Citations
22 Claims
-
1. A method for tracking heads and faces, comprising the steps of:
-
activating a channel for collecting data comprising perceived locations of designated features of one of heads and faces; collecting the data for each feature during a sequence of frames; generating, for each feature, one or more representation models based on the collected data, wherein for at least one feature, complementary representation models are generated, and wherein each complementary representation model comprises data reflecting the perceived location of the feature to which it corresponds; comparing the complementary representation models corresponding to the at least one feature to generate correlated data; and combining the correlated data into a single representation, wherein said comparing step comprises the steps of; defining a distance metric for each of the complementary representation models corresponding to the at least one feature; positioning the complementary representation models adjacent a common interface; measuring the mutual overlap of the complementary representation models; and collecting, based on the overlap, information representing areas of correlation between the complementary representation models. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method for locating heads and faces in a sequence of frames of images, comprising the steps of:
-
activating a plurality of channels for tracking the heads and faces; gathering, by each channel, data from the tracked images during a sequence of frames; generating, from data gathered by a first channel, a first group of complementary representation models comprising perceived locations of head and facial features; comparing the first group of complementary representation models to generate a first intermediate representation comprising correlated data, and combining the correlated data into a single representation, wherein said comparing step comprises the steps of; positioning the complementary representation models adjacent a common interface; retrieving a comparison function from memory; selecting, based on the identity of the representation models, one or more distances metric; measuring the mutual overlap between the representation models; and storing the data correlating to the representation models. - View Dependent Claims (14, 15, 16, 17)
-
-
18. A method for tracking facial features in images, comprising the steps of:
-
activating a fast channel; collecting a first set of complementary representation models, by the first channel, of designated candidate facial features; determining correlated data between the first set of complementary representation models; generating a first intermediate representation based on the correlated data; activating a second channel; collecting a second set of complementary representation models, by the second channel, of designated candidate facial features; measuring the correlated data between the second set of complementary representation models; generating a second representation based on the correlated data; and combining the first intermediate and second representations to from a tracked output. - View Dependent Claims (19)
-
-
20. A method for tracking facial features in complex images, comprising the steps of:
-
activating a plurality of channels for performing an initial overview sequence; generating, based on data gathered from the overview sequence, one or more representations comprising facial feature candidates; terminating activity on the plurality of channels; determining, based on the one or more representations, an optimal tracking strategy for the images to be tracked by selecting, for one or more additional facial features, representation models which correspond to each additional feature; and reactivating selected channels of the plurality of channels for gathering data from the images to be tracked, wherein said determining step further comprises the steps of selecting, for designated facial features, complementary representation models which correspond to each designated feature, and for one or more additional facial features, unitary representation models which correspond to each additional feature; and generating a first representation from the unitary models; comparing the complementary representation models to generate a second representation comprising correlated data; and combining the first and second representations. - View Dependent Claims (21, 22)
-
Specification