Face inpainting with pre-trained image transformers

Gönç, Kaan; Sağlam, Baturay; Kozat, Süleyman S.; Dibeklioğlu, Hamdi

Face inpainting with pre-trained image transformers

buir.contributor.author	Gönç, Kaan
buir.contributor.author	Sağlam, Baturay
buir.contributor.author	Kozat, Süleyman S.
buir.contributor.author	Dibeklioğlu, Hamdi
buir.contributor.orcid	Sağlam, Baturay\|0000-0002-8324-5980
buir.contributor.orcid	Kozat, Süleyman S.\|0000-0002-6488-3848
dc.citation.epage	[4]	en_US
dc.citation.spage	[1]	en_US
dc.contributor.author	Gönç, Kaan
dc.contributor.author	Sağlam, Baturay
dc.contributor.author	Kozat, Süleyman S.
dc.contributor.author	Dibeklioğlu, Hamdi
dc.coverage.spatial	Safranbolu, Turkey	en_US
dc.date.accessioned	2023-02-14T07:00:22Z
dc.date.available	2023-02-14T07:00:22Z
dc.date.issued	2022-08-29
dc.department	Department of Computer Engineering	en_US
dc.department	Department of Electrical and Electronics Engineering	en_US
dc.description	Conference Name: 2022 30th Signal Processing and Communications Applications Conference (SIU)	en_US
dc.description	Date of Conference: 15-18 May 2022	en_US
dc.description.abstract	Image inpainting is an underdetermined inverse problem that allows various contents to fill in the missing or damaged regions realistically. Convolutional neural networks (CNNs) are commonly used to create aesthetically pleasing content, yet CNNs have restricted perception fields for collecting global characteristics. Transformers enable long-range relationships to be modeled and different content generated with autoregressive modeling of pixel-sequence distributions using image-level attention mechanism. However, the current approaches to inpainting with transformers are limited to task-specific datasets and require larger-scale data. We introduce an approach to image inpainting by leveraging pre-trained vision transformers to remedy this issue. Experiments show that our approach can outperform CNN-based approaches and have a remarkable performance closer to the task-specific transformer methods.	en_US
dc.description.abstract	Görüntü yamalama, bir görüntüdeki çeşitli içeriklerin eksik veya hasarlı bölgelerini gerçekçi bir şekilde doldurulmasına izin veren, belirsiz bir ters problemdir. Evrişimli Sinir Ağları (ESA veya Convolutional Neural Networks) estetik açıdan hoş içerik oluşturmak için yaygın olarak kullanılmaktadır ancak ESA’lar küresel özellikleri toplamak için sınırlı algı alanlarına sahiptir. Dönüştürücüler (Transformers), uzun menzilli ilişkilerin modellenmesini ve görüntü düzeyinde dikkat (attention) mekanizması kullanılarak piksel dizisi dağılımlarının otoregresif modellemesi ile farklı içeriklerin oluşturulmasını sağlamaktadır. Bununla birlikte, Dönüştürücülerle yamalamaya yönelik mevcut yaklaşımlar, göreve özgü veri kümeleriyle sınırlıdır ve daha büyük ölçekli veriler gerektirmektedir. Bu bildiri, bahsi geçen sorunu çözmek için önceden eğitilmiş görüntü Dönüştürücülerden yararlanarak görüntü yamalamaya bir yaklaşım getirmektedir. Gerçekleştirilen deneyler, yaklaşımımızın ESA tabanlı yaklaşımlardan daha iyi performans gösterebileceğini ve göreve özel Dönüştürücü bazlı yöntemlere daha yakın ve dikkate değer bir performansa sahip oldugunu belirtmektedir.
dc.identifier.doi	10.1109/SIU55565.2022.9864676	en_US
dc.identifier.eisbn	978-1-6654-5092-8	en_US
dc.identifier.issn	2165-0608	en_US
dc.identifier.uri	http://hdl.handle.net/11693/111229	en_US
dc.language.iso	Turkish	en_US
dc.publisher	IEEE	en_US
dc.relation.isversionof	https://www.doi.org/10.1109/SIU55565.2022.9864676	en_US
dc.source.title	Signal Processing and Communications Applications Conference (SIU)	en_US
dc.subject	Image inpainting	en_US
dc.subject	Transformers	en_US
dc.subject	Deep generative models	en_US
dc.subject	Görüntü yamalama
dc.subject	Dönüştürücüler
dc.subject	Derin üretken modeller
dc.title	Face inpainting with pre-trained image transformers	en_US
dc.type	Conference Paper	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Face_Inpainting_with_Pre-trained_Image_Transformers.pdf
Size:: 2.24 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.69 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Scholarly Publications - Computer Engineering
Scholarly Publications - Electrical and Electronics Engineering