Producing Higher-Quality Samples Of Natural Images
First Claim
1. A method comprising, by one or more computing devices:
- accessing a plurality of generative adversarial networks (GANs) that are each applied to a particular level k of a Laplacian pyramid, wherein each GAN comprises a generative model Gk and a discriminative model Dk, wherein, at each level k;
the generative model Gk takes as input a noise vector zk and outputs a generated image {tilde over (h)}k; and
the discriminative model Dk takes as input either the generated image {tilde over (h)}k or a real image hk drawn from a set of training data, and outputs a probability that the input was the real image hk;
generating a sample image Ĩ
k from the generated images {tilde over (h)}k, wherein the sample image is based at least in part on the probabilities outputted by each of the discriminative models Dk at each level k of the Laplacian pyramid and the generated images {tilde over (h)}k; and
providing the sample image Ĩ
k for display on a display screen of a client device of a user of a communications network.
2 Assignments
0 Petitions
Accused Products
Abstract
In one embodiment, a method includes accessing a plurality of generative adversarial networks (GANs) that are each applied to a particular level k of a Laplacian pyramid. Each GAN may comprise a generative model Gk and a discriminative model Dk. At each level k, the generative model Gk may take as input a noise vector zk and may output a generated image {tilde over (h)}k. At each level k, the discriminative model Dk may take as input either the generated image {tilde over (h)}k or a real image hk, and may output a probability that the input was the real image hk. The method may further include generating a sample image Ĩk from the generated images {tilde over (h)}k, wherein the sample image is based on the probabilities outputted by each of the discriminative models Dk and the generated images {tilde over (h)}k. The method may further include providing the sample image Ĩk for display.
-
Citations
20 Claims
-
1. A method comprising, by one or more computing devices:
-
accessing a plurality of generative adversarial networks (GANs) that are each applied to a particular level k of a Laplacian pyramid, wherein each GAN comprises a generative model Gk and a discriminative model Dk, wherein, at each level k; the generative model Gk takes as input a noise vector zk and outputs a generated image {tilde over (h)}k; and the discriminative model Dk takes as input either the generated image {tilde over (h)}k or a real image hk drawn from a set of training data, and outputs a probability that the input was the real image hk; generating a sample image Ĩ
k from the generated images {tilde over (h)}k, wherein the sample image is based at least in part on the probabilities outputted by each of the discriminative models Dk at each level k of the Laplacian pyramid and the generated images {tilde over (h)}k; andproviding the sample image Ĩ
k for display on a display screen of a client device of a user of a communications network.
-
-
2. The method of claim 1, wherein generating the sample image Ĩ
-
k from the generated images {tilde over (h)}k further comprises;
applying an upsampling operator u(.) to at least one of the generated images {tilde over (h)}k; and combining the upsampled image {tilde over (h)}k with an image Ik 1 generated by a generative model Gk 1, wherein the image Ik 1 was generated at level k 1 in the Laplacian pyramid.
-
k from the generated images {tilde over (h)}k further comprises;
-
3. The method of claim 2, wherein the sample image Ĩ
-
k is generated using the equation Ĩ
k=u(Ĩ
k 1) {tilde over (h)}k.
-
k is generated using the equation Ĩ
-
4. The method of claim 1, wherein the generated images {tilde over (h)}k are generated in a coarse-to-fine fashion.
-
5. The method of claim 1, wherein each level k in the Laplacian pyramid corresponds to a generated image {tilde over (h)}k that comprises a particular number of pixels, wherein as k increases, the number of pixels in {tilde over (h)}k decreases.
-
6. The method of claim 1, wherein the generative model Gk and the discriminative model Dk each take an additional vector of information as input.
-
7. The method of claim 6, wherein the additional vector of information is a generated image lk created by a conditional GAN model.
-
8. One or more computer-readable non-transitory storage media embodying software that is operable when executed to:
-
access a plurality of generative adversarial networks (GANs) that are each applied to a particular level k of a Laplacian pyramid, wherein each GAN comprises a generative model Gk and a discriminative model Dk, wherein, at each level k; the generative model Gk takes as input a noise vector zk and outputs a generated image {tilde over (h)}k; and the discriminative model Dk takes as input either the generated image {tilde over (h)}k or a real image hk drawn from a set of training data, and outputs a probability that the input was the real image hk; generate a sample image Ĩ
k from the generated images {tilde over (h)}k, wherein the sample image is based at least in part on the probabilities outputted by each of the discriminative models Dk at each level k of the Laplacian pyramid and the generated images {tilde over (h)}k; andprovide the sample image Ĩ
k for display on a display screen of a client device of a user of a communications network.
-
-
9. The media of claim 8, wherein generating the sample image Ĩ
-
k from the generated images {tilde over (h)}k further comprises;
applying an upsampling operator u(.) to at least one of the generated images {tilde over (h)}k; and combining the upsampled image {tilde over (h)}k with an image Ik 1 generated by a generative model Gk 1, wherein the image Ik 1 was generated at level k 1 in the Laplacian pyramid.
-
k from the generated images {tilde over (h)}k further comprises;
-
10. The media of claim 9, wherein the sample image Ĩ
-
k is generated using the equation Ĩ
k=u(Ĩ
k 1) {tilde over (h)}k.
-
k is generated using the equation Ĩ
-
11. The media of claim 8, wherein the generated images {tilde over (h)}k are generated in a coarse-to-fine fashion.
-
12. The media of claim 8, wherein each level k in the Laplacian pyramid corresponds to a generated image {tilde over (h)}k that comprises a particular number of pixels, wherein as k increases, the number of pixels in {tilde over (h)}k decreases.
-
13. The media of claim 8, wherein the generative model Gk and the discriminative model Dk each take an additional vector of information as input.
-
14. The media of claim 13, wherein the additional vector of information is a generated image lk created by a conditional GAN model.
-
15. A system comprising:
-
one or more processors; and one or more computer-readable non-transitory storage media coupled to one or more of the processors and comprising instructions operable when executed by one or more of the processors to cause the system to; access a plurality of generative adversarial networks (GANs) that are each applied to a particular level k of a Laplacian pyramid, wherein each GAN comprises a generative model Gk and a discriminative model Dk, wherein, at each level k; the generative model Gk takes as input a noise vector zk and outputs a generated image {tilde over (h)}k; and the discriminative model Dk takes as input either the generated image {tilde over (h)}k or a real image hk drawn from a set of training data, and outputs a probability that the input was the real image hk; generate a sample image Ĩ
k from the generated images {tilde over (h)}k, wherein the sample image is based at least in part on the probabilities outputted by each of the discriminative models Dk at each level k of the Laplacian pyramid and the generated images {tilde over (h)}k; andprovide the sample image Ĩ
k for display on a display screen of a client device of a user of a communications network.
-
-
16. The system of claim 15, wherein generating the sample image Ĩ
-
k from the generated images {tilde over (h)}k further comprises;
applying an upsampling operator u(.) to at least one of the generated images {tilde over (h)}k; and combining the upsampled image {tilde over (h)}k with an image Ik 1 generated by a generative model Gk 1, wherein the image Ik 1 was generated at level k 1 in the Laplacian pyramid.
-
k from the generated images {tilde over (h)}k further comprises;
-
17. The system of claim 16, wherein the sample image Ĩ
-
k is generated using the equation Ĩ
k=u(Ĩ
k 1) {tilde over (h)}k.
-
k is generated using the equation Ĩ
-
18. The system of claim 15, wherein the generated images {tilde over (h)}k are generated in a coarse-to-fine fashion.
-
19. The system of claim 15, wherein each level k in the Laplacian pyramid corresponds to a generated image {tilde over (h)}k that comprises a particular number of pixels, wherein as k increases, the number of pixels in {tilde over (h)}k decreases.
-
20. The system of claim 15, wherein the generative model Gk and the discriminative model Dk each take an additional vector of information as input.
Specification