METHOD AND DEVICE FOR QUASI-GIBBS STRUCTURE SAMPLING BY DEEP PERMUTATION FOR PERSON IDENTITY INFERENCE
First Claim
1. A method for visual appearance based person identity inference, comprising:
- obtaining a plurality of input images, wherein the input images include a gallery set of images containing persons-of-interest and a probe set of images containing person detections;
extracting N feature maps from the input images using a Deep Neural Network, N being a natural number;
constructing N structure samples of the N feature maps using conditional random field (CRF) graphical models;
learning the N structure samples from an implicit common latent feature space embedded in the N structure samples; and
according to the learned structures, identifying one or more images from the probe set containing a same person-of-interest as an image in the gallery set.
1 Assignment
0 Petitions
Accused Products
Abstract
The present disclosure provides a method and device for visual appearance based person identity inference. The method may include obtaining a plurality of input images. The input images include a gallery set of images containing, persons-of-interest and a probe set of images containing person detections, and one input image corresponds to one person. The method may further include extracting N feature maps from the input images using a Deep Neural Network, N being a natural number; constructing N structure samples of the N feature maps using conditional random field (CRF) graphical models; learning the N structure samples from an implicit common latent feature space embedded in the N structure samples; and according to the learned structures, identifying one or more images from the probe set containing a same person-of-interest as an image in the gallery set.
-
Citations
18 Claims
-
1. A method for visual appearance based person identity inference, comprising:
-
obtaining a plurality of input images, wherein the input images include a gallery set of images containing persons-of-interest and a probe set of images containing person detections; extracting N feature maps from the input images using a Deep Neural Network, N being a natural number; constructing N structure samples of the N feature maps using conditional random field (CRF) graphical models; learning the N structure samples from an implicit common latent feature space embedded in the N structure samples; and according to the learned structures, identifying one or more images from the probe set containing a same person-of-interest as an image in the gallery set. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A device for visual appearance based person identity inference, comprising one or more processors configured to:
-
obtain a plurality of input images, wherein the input images include a gallery set of images containing persons-of-interest and a probe set of images containing person detections, and one input image corresponds to one person; extract N feature maps from the input images using a Deep Neural Network, N being a natural number; construct N structure samples of the N feature naps using conditional random field (CRF) graphical models; learn the N structure samples from an implicit common latent feature space embedded in the N structure samples; and according to the learned structures, identify one or more images from the probe set containing a same person-of-interest as an image in the gallery set. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification