Modeling spatial context in transformer-based whole slide image classification

buir.advisorAksoy, Selim
dc.contributor.authorErkan, Cihan
dc.date.accessioned2023-09-22T05:59:31Z
dc.date.available2023-09-22T05:59:31Z
dc.date.copyright2023-09
dc.date.issued2023-09
dc.date.submitted2023-09-19
dc.departmentDepartment of Computer Engineering
dc.descriptionCataloged from PDF version of article.
dc.descriptionThesis (Master's): Bilkent University, Department of Computer Engineering, İhsan Doğramacı Bilkent University, 2023.
dc.descriptionIncludes bibliographical references (leaves 42-46).
dc.description.abstractThe common method for histopathology image classification is to sample small patches from the large whole slide images and make predictions based on aggregations of patch representations. Transformer models provide a promising alternative with their ability to capture long-range dependencies of patches and their potential to detect representative regions, thanks to their novel self-attention strategy. However, as sequence-based architectures, transformers are unable to directly capture the two-dimensional nature of images. Modeling the spatial con-text of an image for a transformer requires two steps. In the first step the patches of the image are ordered as a 1-dimensional sequence, then the order information is injected to the model. However, commonly used spatial context modeling methods cannot accurately capture the distribution of the patches as they are designed to work on images with a fixed size. We propose novel spatial context modeling methods in an effort to make the model be aware of the spatial context of the patches as neighboring patches usually form diagnostically relevant structures. We achieve that by generating sequences that preserve the locality of the patches. We test the generated sequences by utilizing various information injection strategies. We evaluate the performance of the proposed transformer-based whole slide image classification framework on a lung dataset obtained from The Cancer Genome Atlas. Our experimental evaluations show that the proposed sequence generation method that utilizes space-filling curves to model the spatial context performs better than both baseline and state-of-the-art methods by achieving 87.6% accuracy.
dc.description.degreeM.S.
dc.description.statementofresponsibilityby Cihan Erkan
dc.format.extentx, 46 leaves : color illustrations, charts ; 30 cm.
dc.identifier.itemidB162527
dc.identifier.urihttps://hdl.handle.net/11693/113887
dc.language.isoEnglish
dc.publisherBilkent University
dc.rightsinfo:eu-repo/semantics/openAccess
dc.subjectDigital pathology
dc.subjectSpace-filling curves
dc.subjectVision transformer
dc.subjectWhole slide image classification
dc.titleModeling spatial context in transformer-based whole slide image classification
dc.title.alternativeDönüştürücü tabanlı tüm slayt sınıflandırmasında uzaysal bağlamın modellenmesi
dc.typeThesis

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
B162527.pdf
Size:
12.51 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: