Image inpainting with diffusion models and generative adversarial networks

buir.advisorBoral, Ayşegül Dündar
dc.contributor.authorYıldırım, Ahmet Burak
dc.date.accessioned2024-05-24T11:28:15Z
dc.date.available2024-05-24T11:28:15Z
dc.date.copyright2024-05
dc.date.issued2024-05
dc.date.submitted2024-05-23
dc.descriptionCataloged from PDF version of article.
dc.descriptionThesis (Master's): Bilkent University, Department of Computer Engineering, İhsan Doğramacı Bilkent University, 2024.
dc.descriptionIncludes bibliographical references (leaves 57-67).
dc.description.abstractWe present two novel approaches to image inpainting, a task that involves erasing unwanted pixels from images and filling them in a semantically consistent and realistic way. The first approach uses natural language input to determine which object to remove from an image. We construct a dataset named GQA-Inpaint for this task and train a diffusion-based inpainting model on it, which can remove objects from images based on text prompts. The second approach tackles the challenging task of inverting erased images into StyleGAN’s latent space for realistic inpainting and editing. For this task, we propose learning an encoder and a mixing network to combine encoded features of erased images with StyleGAN’s mapped features from random samples. To achieve diverse inpainting results for the same erased image, we combine the encoded features and randomly sampled style vectors via the mixing network. We compare our methods with different evaluation metrics that measure the quality of the models and show significant quantitative and qualitative improvements.
dc.description.provenanceMade available in DSpace on 2024-05-24T11:28:15Z (GMT). No. of bitstreams: 1 B120556.pdf: 29237425 bytes, checksum: a452dd72c439b5b561a8ee4b56d1bbc9 (MD5) Previous issue date: 2024-05en
dc.description.statementofresponsibilityby Ahmet Burak Yıldırım
dc.format.extentxi, 67 leaves : illustrations, charts ; 30 cm.
dc.identifier.itemidB120556
dc.identifier.urihttps://hdl.handle.net/11693/115171
dc.language.isoEnglish
dc.rightsinfo:eu-repo/semantics/openAccess
dc.subjectImage inpainting
dc.subjectDiffusion models
dc.subjectGenerative adversarial networks
dc.subjectInstruction-based inpainting
dc.subjectImage editing.
dc.titleImage inpainting with diffusion models and generative adversarial networks
dc.title.alternativeDifüzyon modelleri ve çekişmeli üretici ağlar ile görüntü tamamlama
dc.typeThesis
thesis.degree.disciplineComputer Engineering
thesis.degree.grantorBilkent University
thesis.degree.levelMaster's
thesis.degree.nameMS (Master of Science)

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
B120556.pdf
Size:
27.88 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.01 KB
Format:
Item-specific license agreed upon to submission
Description: