Real image editing with StyleGAN

buir.advisorBoral, Ayşegül Dündar
dc.contributor.authorPehlivan, Hamza
dc.date.accessioned2023-09-13T11:16:15Z
dc.date.available2023-09-13T11:16:15Z
dc.date.copyright2023-09
dc.date.issued2023-09
dc.date.submitted2023-09-11
dc.departmentDepartment of Computer Engineering
dc.descriptionCataloged from PDF version of article.
dc.descriptionThesis (Master's): Bilkent University, Department of Computer Engineering, İhsan Doğramacı Bilkent University, 2023.
dc.descriptionIncludes bibliographical references (leaves 37-42).
dc.description.abstractWe present a novel image inversion framework and a training pipeline to achieve high-fidelity image inversion with high-quality attribute editing. Inverting real images into StyleGAN’s latent space is an extensively studied problem, yet the trade-off between image reconstruction fidelity and image editing quality remains an open challenge. The low-rate latent spaces are limited in their expressiveness power for high-fidelity reconstruction. On the other hand, high-rate latent spaces result in degradation in editing quality. In this work, to achieve high-fidelity inversion, we learn residual features in higher latent codes that lower latent codes were not able to encode. This enables preserving image details in reconstruction. To achieve high-quality editing, we learn how to transform the residual features for adapting to manipulations in latent codes. We train the framework to extract residual features and transform them via a novel architecture pipeline and cycle consistency losses. We run extensive experiments and compare our method with state-of-the-art inversion methods. Qualitative metrics and visual comparisons show significant improvements.
dc.description.degreeM.S.
dc.description.statementofresponsibilityby Hamza Pehlivan
dc.format.extentx, 42 leaves : charts, photography ; 30 cm.
dc.identifier.itemidB162502
dc.identifier.urihttps://hdl.handle.net/11693/113861
dc.language.isoEnglish
dc.publisherBilkent University
dc.rightsinfo:eu-repo/semantics/openAccess
dc.subjectGenerative Adversarial Networks (GANs)
dc.subjectGAN Inversion
dc.subjectImage editing with GANs
dc.titleReal image editing with StyleGAN
dc.title.alternativeStyleGAN ile gerçek resim düzenleme
dc.typeThesis

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
B162502.pdf
Size:
7.61 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.01 KB
Format:
Item-specific license agreed upon to submission
Description: