Image inpainting with diffusion models and generative adversarial networks
buir.advisor | Boral, Ayşegül Dündar | |
dc.contributor.author | Yıldırım, Ahmet Burak | |
dc.date.accessioned | 2024-05-24T11:28:15Z | |
dc.date.available | 2024-05-24T11:28:15Z | |
dc.date.copyright | 2024-05 | |
dc.date.issued | 2024-05 | |
dc.date.submitted | 2024-05-23 | |
dc.description | Cataloged from PDF version of article. | |
dc.description | Thesis (Master's): Bilkent University, Department of Computer Engineering, İhsan Doğramacı Bilkent University, 2024. | |
dc.description | Includes bibliographical references (leaves 57-67). | |
dc.description.abstract | We present two novel approaches to image inpainting, a task that involves erasing unwanted pixels from images and filling them in a semantically consistent and realistic way. The first approach uses natural language input to determine which object to remove from an image. We construct a dataset named GQA-Inpaint for this task and train a diffusion-based inpainting model on it, which can remove objects from images based on text prompts. The second approach tackles the challenging task of inverting erased images into StyleGAN’s latent space for realistic inpainting and editing. For this task, we propose learning an encoder and a mixing network to combine encoded features of erased images with StyleGAN’s mapped features from random samples. To achieve diverse inpainting results for the same erased image, we combine the encoded features and randomly sampled style vectors via the mixing network. We compare our methods with different evaluation metrics that measure the quality of the models and show significant quantitative and qualitative improvements. | |
dc.description.provenance | Made available in DSpace on 2024-05-24T11:28:15Z (GMT). No. of bitstreams: 1 B120556.pdf: 29237425 bytes, checksum: a452dd72c439b5b561a8ee4b56d1bbc9 (MD5) Previous issue date: 2024-05 | en |
dc.description.statementofresponsibility | by Ahmet Burak Yıldırım | |
dc.format.extent | xi, 67 leaves : illustrations, charts ; 30 cm. | |
dc.identifier.itemid | B120556 | |
dc.identifier.uri | https://hdl.handle.net/11693/115171 | |
dc.language.iso | English | |
dc.rights | info:eu-repo/semantics/openAccess | |
dc.subject | Image inpainting | |
dc.subject | Diffusion models | |
dc.subject | Generative adversarial networks | |
dc.subject | Instruction-based inpainting | |
dc.subject | Image editing. | |
dc.title | Image inpainting with diffusion models and generative adversarial networks | |
dc.title.alternative | Difüzyon modelleri ve çekişmeli üretici ağlar ile görüntü tamamlama | |
dc.type | Thesis | |
thesis.degree.discipline | Computer Engineering | |
thesis.degree.grantor | Bilkent University | |
thesis.degree.level | Master's | |
thesis.degree.name | MS (Master of Science) |