VecGAN: Image-to-Image translation with interpretable latent directions

Dalva, Yusuf; Dundar, Aysegul; Altındiş, Said Fahri

VecGAN: Image-to-Image translation with interpretable latent directions

buir.contributor.author	Dalva, Yusuf
buir.contributor.author	Dundar, Aysegul
buir.contributor.author	Altındiş, Said Fahri
buir.contributor.orcid	Dalva, Yusuf\|0000-0002-8402-8291
dc.citation.epage	169	en_US
dc.citation.spage	153	en_US
dc.citation.volumeNumber	13676	en_US
dc.contributor.author	Dalva, Yusuf
dc.contributor.author	Dundar, Aysegul
dc.contributor.author	Altındiş, Said Fahri
dc.date.accessioned	2023-02-16T08:52:00Z
dc.date.available	2023-02-16T08:52:00Z
dc.date.issued	2022-10-21
dc.department	Department of Computer Engineering	en_US
dc.description.abstract	We propose VecGAN, an image-to-image translation framework for facial attribute editing with interpretable latent directions. Facial attribute editing task faces the challenges of precise attribute editing with controllable strength and preservation of the other attributes of an image. For this goal, we design the attribute editing by latent space factorization and for each attribute, we learn a linear direction that is orthogonal to the others. The other component is the controllable strength of the change, a scalar value. In our framework, this scalar can be either sampled or encoded from a reference image by projection. Our work is inspired by the latent space factorization works of fixed pretrained GANs. However, while those models cannot be trained end-to-end and struggle to edit encoded images precisely, VecGAN is end-to-end trained for image translation task and successful at editing an attribute while preserving the others. Our extensive experiments show that VecGAN achieves significant improvements over state-of-the-arts for both local and global edits.	en_US
dc.identifier.doi	10.1007/978-3-031-19787-1_9	en_US
dc.identifier.issn	3029743	en_US
dc.identifier.uri	http://hdl.handle.net/11693/111419	en_US
dc.language.iso	English	en_US
dc.relation.isversionof	https://www.doi.org/10.1007/978-3-031-19787-1_9	en_US
dc.source.title	Computer Vision – ECCV 2022	en_US
dc.subject	Image translation	en_US
dc.subject	Generative adversarial networks	en_US
dc.subject	Latent space manipulation	en_US
dc.subject	Face attribute editing	en_US
dc.title	VecGAN: Image-to-Image translation with interpretable latent directions	en_US
dc.type	Article	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: VecGAN_Image_to_Image_Translation_with_Interpretable_Latent_Directions.pdf
Size:: 3.84 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.69 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Scholarly Publications - Computer Engineering