Utilizing deep learning for automatic digital image segmentation and stylization
First Claim
1. In a digital medium environment for editing digital visual media, a method of using deep learning to automatically select individuals portrayed in the digital visual media, the method comprising:
- training, by at least one processor, a neural network utilizing training input generated from a repository of digital training images;
generating, by the at least one processor, with regard to a probe digital image portraying a target individual, a position channel that indicates positions of pixels in the probe digital image relative to the target individual portrayed in the probe digital image by determining a transform between one or more feature points of the target individual and a canonical pose; and
identifying, by the at least one processor, a set of pixels representing the target individual in the probe digital image utilizing the trained neural network and the position channel.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are disclosed for segregating target individuals represented in a probe digital image from background pixels in the probe digital image. In particular, in one or more embodiments, the disclosed systems and methods train a neural network based on two or more of training position channels, training shape input channels, training color channels, or training object data. Moreover, in one or more embodiments, the disclosed systems and methods utilize the trained neural network to select a target individual in a probe digital image. Specifically, in one or more embodiments, the disclosed systems and methods generate position channels, training shape input channels, and color channels corresponding the probe digital image, and utilize the generated channels in conjunction with the trained neural network to select the target individual.
-
Citations
20 Claims
-
1. In a digital medium environment for editing digital visual media, a method of using deep learning to automatically select individuals portrayed in the digital visual media, the method comprising:
-
training, by at least one processor, a neural network utilizing training input generated from a repository of digital training images; generating, by the at least one processor, with regard to a probe digital image portraying a target individual, a position channel that indicates positions of pixels in the probe digital image relative to the target individual portrayed in the probe digital image by determining a transform between one or more feature points of the target individual and a canonical pose; and identifying, by the at least one processor, a set of pixels representing the target individual in the probe digital image utilizing the trained neural network and the position channel. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. In a digital medium environment for editing digital visual media, a method of using deep learning to automatically select individuals portrayed in the digital visual media, the method comprising:
-
accessing; a trained neural network generated from a repository of digital training images, wherein each of the digital training images portrays a training target individual, and a mean digital object mask reflecting a shape based on each of the training target individuals portrayed in the digital training images; generating, with regard to a probe digital image by at least one processor and utilizing the mean digital object mask, a shape input channel comprising an estimated shape of a target individual based on the mean digital object mask by estimating a transform between one or more facial feature points corresponding to the target individual portrayed in the probe digital image and a canonical pose; and identifying, by the at least one processor, a set of pixels representing the target individual in the probe digital image utilizing the trained neural network and the generated shape input channel. - View Dependent Claims (13, 14, 15, 16)
-
-
17. A system for identifying target objects within digital visual media, comprising:
-
at least one processor; and at least one non-transitory computer readable storage medium storing instructions thereon, that, when executed by the at least one processor, cause the system to; generate a plurality of training color channels and a plurality of training shape input channels with regard to a plurality of digital training digital images, wherein each digital training image portrays a target individual; train a neural network utilizing the plurality of training color channels and the plurality of training shape input channels; generate a color channel and a shape input channel with regard to a probe digital image, wherein the probe digital image portrays a target individual, the color channel reflects colors of pixels in the digital training image and the shape input channel comprises an estimated shape of the target individual based on a transform between one or more features of the target individual and a canonical pose; and identify a set of pixels representing the target individual in the probe digital image utilizing the trained neural network, the generated color channel, and the generated shape input channel. - View Dependent Claims (18, 19, 20)
-
Specification