Computer Vision Faculty Publications

Rethinking conditional GAN training: An approach using geometrically structured latent manifolds

Sameera Ramasinghe, The Australian National University, Australia & Data61-CSIRO
Moshiur Farazi, Data61-CSIRO
Salman Khan, Mohamed bin Zayed University of Artificial IntelligenceFollow
Nick Barnes, The Australian National University, AustraliaFollow
Stephen Gould, The Australian National University, Australia

Document Type

Article

Publication Title

arXiv

Abstract

Conditional GANs (cGAN), in their rudimentary form, suffer from critical drawbacks such as the lack of diversity in generated outputs and distortion between the latent and output manifolds. Although efforts have been made to improve results, they can suffer from unpleasant side-effects such as the topology mismatch between latent and output spaces. In contrast, we tackle this problem from a geometrical perspective and propose a novel training mechanism that increases both the diversity and the visual quality of a vanilla cGAN, by systematically encouraging a bi-lipschitz mapping between the latent and the output manifolds. We validate the efficacy of our solution on a baseline cGAN (i.e., Pix2Pix) which lacks diversity, and show that by only modifying its training mechanism (i.e., with our proposed Pix2Pix-Geo), one can achieve more diverse and realistic outputs on a broad set of image-to-image translation tasks. Codes are available at https://github.com/samgregoost/Rethinking-CGANs. Copyright © 2020, The Authors. All rights reserved.

DOI

arXiv:2011.13055

Publication Date

11-25-2020

Keywords

Computer Vision and Pattern Recognition (cs.CV)

Comments

IR Deposit conditions: non-described

Preprint available on arXiv

Recommended Citation

S. Ramasinghe, M. Farazi, S. Khan, N. Barnes, and S. Gould, "Rethinking conditional GAN training: An approach using geometrically structured latent manifolds", 2020, arXiv:2011.13055

Link to Full Text

COinS

Computer Vision Faculty Publications

Rethinking conditional GAN training: An approach using geometrically structured latent manifolds

Document Type

Publication Title

Abstract

DOI

Publication Date

Keywords

Comments

Recommended Citation

Browse

Contribute

Links

Computer Vision Faculty Publications

Rethinking conditional GAN training: An approach using geometrically structured latent manifolds

Authors

Document Type

Publication Title

Abstract

DOI

Publication Date

Keywords

Comments

Recommended Citation

Share

Browse

Contribute

Links