Survival prediction via partial ordering in feature space and sample space

buir.advisorTaştan, Öznur
dc.contributor.authorBüyüközkan, Mustafa
dc.departmentDepartment of Computer Engineeringen_US
dc.descriptionCataloged from PDF version of thesis.en_US
dc.descriptionIncludes bibliographical references (leaves 69-84).en_US
dc.descriptionThesis (M.S.): Bilkent University, Department of Computer Engineering, İhsan Doğramacı Bilkent University, 2016.en_US
dc.description.abstractPredicting the survival of a cancer patient is critical for choosing patient specific treatment strategies and is traditionally based on clinical or pathological factors such as patient age and tumor stage. In this thesis, we present two methodologies to build effective and interpretable survival models that utilize high-dimensional molecular profiles made available through next-gen sequencing technologies. Firstly, we present a method that focuses on partial ordering in the feature space. Existing models rely on the individual molecular quantities recorded in tumors; however, cancer is a complex disease where molecular mechanisms are dysregulated in various ways. This study, based on a system level perspective, incorporates the partial ordering of molecules (POF) in lieu of individual quantities. This strategy not only unveils predictive features with direct relevance to the biological mechanism and but also yields better performance in survival prediction compared to multivariate `1 penalized Cox proportional hazard and Random Survival Forest models. Testing the partial order representation of features in the subgroup identification task, we find that these features yield groups of patients, which are more quantifably distinct in terms of survival distributions. Secondly, we develop a survival prediction method based on ranking and support vector machines { Ranking Survival Vector Machines (RsurVM). RsurVM obtains a pairwise ranking of the patient survival times by learning to rank. It focuses on optimizing the most commonly used metric concordance index and can handle the censored data without making any assumptions. Our extensive tests on the ovarian adenocarcinoma patient molecular data demonstrate that RsurVM achieves better survival predictions regardless of the input molecular data (mRNA, protein, miRNA, Copy number variation and DNA methylation) than the two most commonly used methods: Cox-proportional hazards model and Random Survival Forest.en_US
dc.description.statementofresponsibilityby Mustafa Büyüközkan.en_US
dc.format.extentxiv, 84 leaves : charts.en_US
dc.publisherBilkent Universityen_US
dc.subjectSurvival estimationen_US
dc.subjectPairwise rankingen_US
dc.subjectPartial orderingen_US
dc.subjectBiologically interpretible featuresen_US
dc.titleSurvival prediction via partial ordering in feature space and sample spaceen_US
dc.title.alternativeÖznitelik ya da örneklem uzayında kısmi sıralama yoluyla sağkalım tahminlemeen_US
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
thesis - Mustafa_Buyukozkan.pdf
2.07 MB
Adobe Portable Document Format
Full printable version
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
1.71 KB
Item-specific license agreed upon to submission