Warping the residuals for image editing with StyleGAN

Yıldırım, Ahmet Burak; Pehlivan, Hamza; Dündar, Ayşegül

Warping the residuals for image editing with StyleGAN

Files

Warping_the_residuals_for_image_editing_with_StyleGAN.pdf (4.31 MB)

Date

2024-11-18

Authors

Yıldırım, Ahmet Burak

Pehlivan, Hamza

Dündar, Ayşegül

BUIR Usage Stats

9
views

129
downloads

Citation Stats

Abstract

StyleGAN models show editing capabilities via their semantically interpretable latent organizations which require successful GAN inversion methods to edit real images. Many works have been proposed for inverting images into StyleGAN's latent space. However, their results either suffer from low fidelity to the input image or poor editing qualities, especially for edits that require large transformations. That is because low bit rate latent spaces lose many image details due to the information bottleneck even though it provides an editable space. On the other hand, higher bit rate latent spaces can pass all the image details to StyleGAN for perfect reconstruction of images but suffer from low editing qualities. In this work, we present a novel image inversion architecture that extracts high-rate latent features and includes a flow estimation module to warp these features to adapt them to edits. This is because edits often involve spatial changes in the image, such as adjustments to pose or smile. Thus, high-rate latent features must be accurately repositioned to match their new locations in the edited image space. We achieve this by employing flow estimation to determine the necessary spatial adjustments, followed by warping the features to align them correctly in the edited image. Specifically, we estimate the flows from StyleGAN features of edited and unedited latent codes. By estimating the high-rate features and warping them for edits, we achieve both high-fidelity to the input image and high-quality edits. We run extensive experiments and compare our method with state-of-the-art inversion methods. Qualitative metrics and visual comparisons show significant improvements.

Source Title

International Journal of Computer Vision

Publisher

Springer New York LLC

Keywords

GAN inversion, Image editing, Generative adversarial networks

Permalink

https://hdl.handle.net/11693/116939

Published Version (Please cite this version)

https://dx.doi.org/10.1007/s11263-024-02301-6

Rights

https://creativecommons.org/licenses/by/4.0/

Collections

Scholarly Publications - Computer Engineering

Language

English

Type

Article

Full item page

Warping the residuals for image editing with StyleGAN

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Rights

Collections

Language

Type

Warping the residuals for image editing with StyleGAN

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Rights

Collections

Language

Type