VecGAN: Image-to-Image translation with interpretable latent directions

Dalva, Yusuf; Dundar, Aysegul; Altındiş, Said Fahri

VecGAN: Image-to-Image translation with interpretable latent directions

Files

VecGAN_Image_to_Image_Translation_with_Interpretable_Latent_Directions.pdf (3.84 MB)

Date

2022-10-21

Authors

Dalva, Yusuf

Dundar, Aysegul

Altındiş, Said Fahri

BUIR Usage Stats

3
views

66
downloads

Citation Stats

Abstract

We propose VecGAN, an image-to-image translation framework for facial attribute editing with interpretable latent directions. Facial attribute editing task faces the challenges of precise attribute editing with controllable strength and preservation of the other attributes of an image. For this goal, we design the attribute editing by latent space factorization and for each attribute, we learn a linear direction that is orthogonal to the others. The other component is the controllable strength of the change, a scalar value. In our framework, this scalar can be either sampled or encoded from a reference image by projection. Our work is inspired by the latent space factorization works of fixed pretrained GANs. However, while those models cannot be trained end-to-end and struggle to edit encoded images precisely, VecGAN is end-to-end trained for image translation task and successful at editing an attribute while preserving the others. Our extensive experiments show that VecGAN achieves significant improvements over state-of-the-arts for both local and global edits.

Source Title

Computer Vision – ECCV 2022

Keywords

Image translation, Generative adversarial networks, Latent space manipulation, Face attribute editing

Permalink

http://hdl.handle.net/11693/111419

Published Version (Please cite this version)

https://www.doi.org/10.1007/978-3-031-19787-1_9

Collections

Scholarly Publications - Computer Engineering

Language

English

Type

Article

Full item page

VecGAN: Image-to-Image translation with interpretable latent directions

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

VecGAN: Image-to-Image translation with interpretable latent directions

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type