Survival prediction via partial ordering in feature space and sample space

buir.advisorTaştan, Öznur
dc.contributor.authorBüyüközkan, Mustafa
dc.date.accessioned2016-05-10T13:25:28Z
dc.date.available2016-05-10T13:25:28Z
dc.date.copyright2016-03
dc.date.issued2016-03
dc.date.submitted2016-05-10
dc.descriptionCataloged from PDF version of thesis.en_US
dc.descriptionIncludes bibliographical references (leaves 69-84).en_US
dc.descriptionThesis (M.S.): Bilkent University, Department of Computer Engineering, İhsan Doğramacı Bilkent University, 2016.en_US
dc.description.abstractPredicting the survival of a cancer patient is critical for choosing patient specific treatment strategies and is traditionally based on clinical or pathological factors such as patient age and tumor stage. In this thesis, we present two methodologies to build effective and interpretable survival models that utilize high-dimensional molecular profiles made available through next-gen sequencing technologies. Firstly, we present a method that focuses on partial ordering in the feature space. Existing models rely on the individual molecular quantities recorded in tumors; however, cancer is a complex disease where molecular mechanisms are dysregulated in various ways. This study, based on a system level perspective, incorporates the partial ordering of molecules (POF) in lieu of individual quantities. This strategy not only unveils predictive features with direct relevance to the biological mechanism and but also yields better performance in survival prediction compared to multivariate `1 penalized Cox proportional hazard and Random Survival Forest models. Testing the partial order representation of features in the subgroup identification task, we find that these features yield groups of patients, which are more quantifably distinct in terms of survival distributions. Secondly, we develop a survival prediction method based on ranking and support vector machines { Ranking Survival Vector Machines (RsurVM). RsurVM obtains a pairwise ranking of the patient survival times by learning to rank. It focuses on optimizing the most commonly used metric concordance index and can handle the censored data without making any assumptions. Our extensive tests on the ovarian adenocarcinoma patient molecular data demonstrate that RsurVM achieves better survival predictions regardless of the input molecular data (mRNA, protein, miRNA, Copy number variation and DNA methylation) than the two most commonly used methods: Cox-proportional hazards model and Random Survival Forest.en_US
dc.description.provenanceSubmitted by Betül Özen (ozen@bilkent.edu.tr) on 2016-05-10T13:25:28Z No. of bitstreams: 1 thesis - Mustafa_Buyukozkan.pdf: 2174332 bytes, checksum: 2876397cf9d9f7362550cb94586cc1bd (MD5)en
dc.description.provenanceMade available in DSpace on 2016-05-10T13:25:28Z (GMT). No. of bitstreams: 1 thesis - Mustafa_Buyukozkan.pdf: 2174332 bytes, checksum: 2876397cf9d9f7362550cb94586cc1bd (MD5) Previous issue date: 2016-03en
dc.description.statementofresponsibilityby Mustafa Büyüközkan.en_US
dc.embargo.release2018-03-10
dc.format.extentxiv, 84 leaves : charts.en_US
dc.identifier.itemidB153162
dc.identifier.urihttp://hdl.handle.net/11693/29092
dc.language.isoEnglishen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectSurvival estimationen_US
dc.subjectPairwise rankingen_US
dc.subjectPartial orderingen_US
dc.subjectBiologically interpretible featuresen_US
dc.titleSurvival prediction via partial ordering in feature space and sample spaceen_US
dc.title.alternativeÖznitelik ya da örneklem uzayında kısmi sıralama yoluyla sağkalım tahminlemeen_US
dc.typeThesisen_US
thesis.degree.disciplineComputer Engineering
thesis.degree.grantorBilkent University
thesis.degree.levelMaster's
thesis.degree.nameMS (Master of Science)

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis - Mustafa_Buyukozkan.pdf
Size:
2.07 MB
Format:
Adobe Portable Document Format
Description:
Full printable version
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: