Discriminative motion modeling for human motion tracking
First Claim
1. A method for recognizing and tracking human motion comprising steps of:
- receiving, by an input device, a plurality of learned motion segments representing different learned motions within a motion class, wherein each learned motion segment comprises a plurality of state vectors and each state vector comprises a time stamp, and wherein one of the learned motion segments comprises temporally contiguous state vectors clustered together in a low-dimensional space based on the time stamps;
receiving, by the input device, a representation of human motion having at least one motion from the motion class, the at least one motion comprising a sequence of pose states represented in a high dimensional space;
processing the received representation according to computer-executable instructions stored in a memory that cause a processor to execute steps of;
projecting the sequences of pose states from the high dimensional space to the low dimensional space according to a discriminative model that when applied to the sequence of pose states increases the inter-class separability between pose states of different motion classes and decreases the intra-class separability between pose states of a same motion-class;
determining an integer P nearest neighbors of a first projected pose state in the low dimensional space, the P nearest neighbors from P different learned motion segments;
determining P pose predictions for the P different learned motion segments; and
determining the pose prediction that best matches a current frame of the representation of human motion; and
storing the determined pose prediction to a memory.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method recognizes and tracks human motion from different motion classes. In a learning stage, a discriminative model is learned to project motion data from a high dimensional space to a low dimensional space while enforcing discriminance between motions of different motion classes in the low dimensional space. Additionally, low dimensional data may be clustered into motion segments and motion dynamics learned for each motion segment. In a tracking stage, a representation of human motion is received comprising at least one class of motion. The tracker recognizes and tracks the motion based on the learned discriminative model and the learned dynamics.
39 Citations
25 Claims
-
1. A method for recognizing and tracking human motion comprising steps of:
-
receiving, by an input device, a plurality of learned motion segments representing different learned motions within a motion class, wherein each learned motion segment comprises a plurality of state vectors and each state vector comprises a time stamp, and wherein one of the learned motion segments comprises temporally contiguous state vectors clustered together in a low-dimensional space based on the time stamps; receiving, by the input device, a representation of human motion having at least one motion from the motion class, the at least one motion comprising a sequence of pose states represented in a high dimensional space; processing the received representation according to computer-executable instructions stored in a memory that cause a processor to execute steps of; projecting the sequences of pose states from the high dimensional space to the low dimensional space according to a discriminative model that when applied to the sequence of pose states increases the inter-class separability between pose states of different motion classes and decreases the intra-class separability between pose states of a same motion-class; determining an integer P nearest neighbors of a first projected pose state in the low dimensional space, the P nearest neighbors from P different learned motion segments; determining P pose predictions for the P different learned motion segments; and determining the pose prediction that best matches a current frame of the representation of human motion; and storing the determined pose prediction to a memory. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system for recognizing and tracking human motion comprising:
-
an input device for receiving a representation of human motion having at least one motion from a motion class, the at least one motion comprising a sequence of pose states represented in a high dimensional space, and for receiving a plurality of learned motion segments representing different learned motions within the motion class, wherein each learned motion segment comprises a plurality of state vectors and each state vector comprises a time stamp, and wherein one of the learned motion segments comprises temporally contiguous state vectors clustered together in a low-dimensional space based on the time stamps; a processor adapted to project the sequences of pose states from the high dimensional space to the low dimensional space according to a discriminative model that when applied to the sequence of pose states, increases the inter-class separability between pose states of different motion classes and decreases the intra-class separability between pose states of a same motion class, determining an integer P nearest neighbors of a first projected pose state in the low dimensional space, the P nearest neighbors from P different learned motion segments, determining P pose predictions for the P different learned motion segments, and determining the pose prediction that best matches a current frame of the representation of human motion; and a memory adapted to store the determined pose state.
-
-
14. A computer program product, comprising a computer readable medium storing computer executable code for recognizing and tracking human motion, the computer executable code when executed causing a processor to perform steps of:
-
receiving a plurality of learned motion segments representing different learned motions within a motion class, wherein each learned motion segment comprises a plurality of state vectors and each state vector comprises a time stamp, and wherein one of the learned motion segments comprises temporally contiguous state vectors clustered together in a low-dimensional space based on the time stamps; receiving a representation of human motion having at least one motion from the motion class, the at least one motion comprising a sequence of pose states represented in a high dimensional space; projecting the sequences of pose states from the high dimensional space to the low dimensional space according to a discriminative model that when applied to the sequence of pose states increases the inter-class separability between pose states of different motion classes and decreases the intra-class separability between pose states of a same motion-class; determining an integer P nearest neighbors of a first projected pose state in the low dimensional space, the P nearest neighbors from P different learned motion segments; determining P pose predictions for the P different learned motion segments; and determining the pose prediction that best matches a current frame of the representation of human motion; and storing the determined pose prediction to a memory. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
Specification