Diverse inpainting and editing with semantic conditioning
buir.advisor | Boral, Ayşegül Dündar | |
dc.contributor.author | Sivük, Hakan | |
dc.date.accessioned | 2024-09-18T07:43:49Z | |
dc.date.available | 2024-09-18T07:43:49Z | |
dc.date.copyright | 2024-09 | |
dc.date.issued | 2024-09 | |
dc.date.submitted | 2024-09-17 | |
dc.description | Cataloged from PDF version of article. | |
dc.description | Thesis (Master's): Bilkent University, Department of Computer Engineering, İhsan Doğramacı Bilkent University, 2024. | |
dc.description | Includes bibliographical references (leaves 38-45). | |
dc.description.abstract | Semantic image editing involves filling in pixels according to a given semantic map, a complex task that demands contextual harmony and precise adherence to the semantic map. Most previous approaches attempt to encode all information from the erased image, but when adding an object like a car, its style cannot be inferred only from the context. Models capable of producing diverse results often struggle with smooth integration between generated and existing parts of the image. Moreover, existing methods lack a mechanism to encode the styles of fully and partially visible objects differently, limiting their effectiveness. In this work, we introduce a framework incorporating a novel mechanism to distinguish between visible and partially visible objects, leading to more consistent style encoding and improved final outputs. Through extensive comparisons with existing conditional image generation and semantic editing methods, our experiments demonstrate that our approach significantly outperforms the state-of-the-art. In addition to improved quantitative results, our method provides greater diversity in outcomes. For code and a demo, please visit our project page at https://github.com/hakansivuk/DivSem. | |
dc.description.provenance | Submitted by İlknur Sarıkaya (ilknur.sarikaya@bilkent.edu.tr) on 2024-09-18T07:43:49Z No. of bitstreams: 1 B162651.pdf: 34055504 bytes, checksum: 3e59d5309ee5a913d31e6bcb8b5128e7 (MD5) | en |
dc.description.provenance | Made available in DSpace on 2024-09-18T07:43:49Z (GMT). No. of bitstreams: 1 B162651.pdf: 34055504 bytes, checksum: 3e59d5309ee5a913d31e6bcb8b5128e7 (MD5) Previous issue date: 2024-09 | en |
dc.description.statementofresponsibility | by Hakan Sivük | |
dc.format.extent | x, 45 leaves : color illustrations, charts ; 30 cm. | |
dc.identifier.itemid | B162651 | |
dc.identifier.uri | https://hdl.handle.net/11693/115820 | |
dc.language.iso | English | |
dc.rights | info:eu-repo/semantics/openAccess | |
dc.subject | Semantic image editing | |
dc.subject | Conditional image inpainting | |
dc.subject | Conditional image outpainting | |
dc.subject | Generative adversarial networks | |
dc.title | Diverse inpainting and editing with semantic conditioning | |
dc.title.alternative | Semantik koşullama ile çeşitli tamamlama ve düzenleme | |
dc.type | Thesis | |
thesis.degree.discipline | Computer Engineering | |
thesis.degree.grantor | Bilkent University | |
thesis.degree.level | Master's | |
thesis.degree.name | MS (Master of Science) |