VecGAN: Image-to-Image translation with interpretable latent directions
buir.contributor.author | Dalva, Yusuf | |
buir.contributor.author | Dundar, Aysegul | |
buir.contributor.author | Altındiş, Said Fahri | |
buir.contributor.orcid | Dalva, Yusuf|0000-0002-8402-8291 | |
dc.citation.epage | 169 | en_US |
dc.citation.spage | 153 | en_US |
dc.citation.volumeNumber | 13676 | en_US |
dc.contributor.author | Dalva, Yusuf | |
dc.contributor.author | Dundar, Aysegul | |
dc.contributor.author | Altındiş, Said Fahri | |
dc.date.accessioned | 2023-02-16T08:52:00Z | |
dc.date.available | 2023-02-16T08:52:00Z | |
dc.date.issued | 2022-10-21 | |
dc.department | Department of Computer Engineering | en_US |
dc.description.abstract | We propose VecGAN, an image-to-image translation framework for facial attribute editing with interpretable latent directions. Facial attribute editing task faces the challenges of precise attribute editing with controllable strength and preservation of the other attributes of an image. For this goal, we design the attribute editing by latent space factorization and for each attribute, we learn a linear direction that is orthogonal to the others. The other component is the controllable strength of the change, a scalar value. In our framework, this scalar can be either sampled or encoded from a reference image by projection. Our work is inspired by the latent space factorization works of fixed pretrained GANs. However, while those models cannot be trained end-to-end and struggle to edit encoded images precisely, VecGAN is end-to-end trained for image translation task and successful at editing an attribute while preserving the others. Our extensive experiments show that VecGAN achieves significant improvements over state-of-the-arts for both local and global edits. | en_US |
dc.description.provenance | Submitted by Aleyna Demirkıran (demirkiranaleyna99@gmail.com) on 2023-02-16T08:52:00Z No. of bitstreams: 1 VecGAN_Image_to_Image_Translation_with_Interpretable_Latent_Directions.pdf: 4027888 bytes, checksum: 827bf1fef5ec2bafa0b68e9a30efb404 (MD5) | en |
dc.description.provenance | Made available in DSpace on 2023-02-16T08:52:00Z (GMT). No. of bitstreams: 1 VecGAN_Image_to_Image_Translation_with_Interpretable_Latent_Directions.pdf: 4027888 bytes, checksum: 827bf1fef5ec2bafa0b68e9a30efb404 (MD5) Previous issue date: 2022-10-21 | en |
dc.identifier.doi | 10.1007/978-3-031-19787-1_9 | en_US |
dc.identifier.issn | 03029743 | |
dc.identifier.uri | http://hdl.handle.net/11693/111419 | |
dc.language.iso | English | en_US |
dc.relation.isversionof | https://www.doi.org/10.1007/978-3-031-19787-1_9 | en_US |
dc.source.title | Computer Vision – ECCV 2022 | en_US |
dc.subject | Image translation | en_US |
dc.subject | Generative adversarial networks | en_US |
dc.subject | Latent space manipulation | en_US |
dc.subject | Face attribute editing | en_US |
dc.title | VecGAN: Image-to-Image translation with interpretable latent directions | en_US |
dc.type | Article | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- VecGAN_Image_to_Image_Translation_with_Interpretable_Latent_Directions.pdf
- Size:
- 3.84 MB
- Format:
- Adobe Portable Document Format
- Description:
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.69 KB
- Format:
- Item-specific license agreed upon to submission
- Description: