Diverse inpainting and editing with semantic conditioning

buir.advisorBoral, Ayşegül Dündar
dc.contributor.authorSivük, Hakan
dc.date.accessioned2024-09-18T07:43:49Z
dc.date.available2024-09-18T07:43:49Z
dc.date.copyright2024-09
dc.date.issued2024-09
dc.date.submitted2024-09-17
dc.descriptionCataloged from PDF version of article.
dc.descriptionThesis (Master's): Bilkent University, Department of Computer Engineering, İhsan Doğramacı Bilkent University, 2024.
dc.descriptionIncludes bibliographical references (leaves 38-45).
dc.description.abstractSemantic image editing involves filling in pixels according to a given semantic map, a complex task that demands contextual harmony and precise adherence to the semantic map. Most previous approaches attempt to encode all information from the erased image, but when adding an object like a car, its style cannot be inferred only from the context. Models capable of producing diverse results often struggle with smooth integration between generated and existing parts of the image. Moreover, existing methods lack a mechanism to encode the styles of fully and partially visible objects differently, limiting their effectiveness. In this work, we introduce a framework incorporating a novel mechanism to distinguish between visible and partially visible objects, leading to more consistent style encoding and improved final outputs. Through extensive comparisons with existing conditional image generation and semantic editing methods, our experiments demonstrate that our approach significantly outperforms the state-of-the-art. In addition to improved quantitative results, our method provides greater diversity in outcomes. For code and a demo, please visit our project page at https://github.com/hakansivuk/DivSem.
dc.description.provenanceSubmitted by İlknur Sarıkaya (ilknur.sarikaya@bilkent.edu.tr) on 2024-09-18T07:43:49Z No. of bitstreams: 1 B162651.pdf: 34055504 bytes, checksum: 3e59d5309ee5a913d31e6bcb8b5128e7 (MD5)en
dc.description.provenanceMade available in DSpace on 2024-09-18T07:43:49Z (GMT). No. of bitstreams: 1 B162651.pdf: 34055504 bytes, checksum: 3e59d5309ee5a913d31e6bcb8b5128e7 (MD5) Previous issue date: 2024-09en
dc.description.statementofresponsibilityby Hakan Sivük
dc.format.extentx, 45 leaves : color illustrations, charts ; 30 cm.
dc.identifier.itemidB162651
dc.identifier.urihttps://hdl.handle.net/11693/115820
dc.language.isoEnglish
dc.rightsinfo:eu-repo/semantics/openAccess
dc.subjectSemantic image editing
dc.subjectConditional image inpainting
dc.subjectConditional image outpainting
dc.subjectGenerative adversarial networks
dc.titleDiverse inpainting and editing with semantic conditioning
dc.title.alternativeSemantik koşullama ile çeşitli tamamlama ve düzenleme
dc.typeThesis
thesis.degree.disciplineComputer Engineering
thesis.degree.grantorBilkent University
thesis.degree.levelMaster's
thesis.degree.nameMS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
B162651.pdf
Size:
32.48 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.1 KB
Format:
Item-specific license agreed upon to submission
Description: