Modeling spatial context in transformer-based whole slide image classification

Erkan, Cihan

Modeling spatial context in transformer-based whole slide image classification

buir.advisor	Aksoy, Selim
dc.contributor.author	Erkan, Cihan
dc.date.accessioned	2023-09-22T05:59:31Z
dc.date.available	2023-09-22T05:59:31Z
dc.date.copyright	2023-09
dc.date.issued	2023-09
dc.date.submitted	2023-09-19
dc.department	Department of Computer Engineering
dc.description	Cataloged from PDF version of article.
dc.description	Thesis (Master's): Bilkent University, Department of Computer Engineering, İhsan Doğramacı Bilkent University, 2023.
dc.description	Includes bibliographical references (leaves 42-46).
dc.description.abstract	The common method for histopathology image classiﬁcation is to sample small patches from the large whole slide images and make predictions based on aggregations of patch representations. Transformer models provide a promising alternative with their ability to capture long-range dependencies of patches and their potential to detect representative regions, thanks to their novel self-attention strategy. However, as sequence-based architectures, transformers are unable to directly capture the two-dimensional nature of images. Modeling the spatial con-text of an image for a transformer requires two steps. In the ﬁrst step the patches of the image are ordered as a 1-dimensional sequence, then the order information is injected to the model. However, commonly used spatial context modeling methods cannot accurately capture the distribution of the patches as they are designed to work on images with a ﬁxed size. We propose novel spatial context modeling methods in an eﬀort to make the model be aware of the spatial context of the patches as neighboring patches usually form diagnostically relevant structures. We achieve that by generating sequences that preserve the locality of the patches. We test the generated sequences by utilizing various information injection strategies. We evaluate the performance of the proposed transformer-based whole slide image classiﬁcation framework on a lung dataset obtained from The Cancer Genome Atlas. Our experimental evaluations show that the proposed sequence generation method that utilizes space-ﬁlling curves to model the spatial context performs better than both baseline and state-of-the-art methods by achieving 87.6% accuracy.
dc.description.degree	M.S.
dc.description.statementofresponsibility	by Cihan Erkan
dc.format.extent	x, 46 leaves : color illustrations, charts ; 30 cm.
dc.identifier.itemid	B162527
dc.identifier.uri	https://hdl.handle.net/11693/113887
dc.language.iso	English
dc.publisher	Bilkent University
dc.rights	info:eu-repo/semantics/openAccess
dc.subject	Digital pathology
dc.subject	Space-ﬁlling curves
dc.subject	Vision transformer
dc.subject	Whole slide image classiﬁcation
dc.title	Modeling spatial context in transformer-based whole slide image classification
dc.title.alternative	Dönüştürücü tabanlı tüm slayt sınıflandırmasında uzaysal bağlamın modellenmesi
dc.type	Thesis

Files

Original bundle

Now showing 1 - 1 of 1

Name:: B162527.pdf
Size:: 12.51 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Dept. of Computer Engineering - Master's degree