Diverse SNP selection for epistasis test prioritization

buir.advisorÇiçek, A. Ercüment
dc.contributor.authorÇaylak, Gizem
dc.date.accessioned2019-08-16T06:58:49Z
dc.date.available2019-08-16T06:58:49Z
dc.date.copyright2019-08
dc.date.issued2019-08
dc.date.submitted2019-08-05
dc.descriptionCataloged from PDF version of article.en_US
dc.descriptionThesis (M.S.): Bilkent University, Department of Computer Engineering, İhsan Doğramacı Bilkent University, 2019.en_US
dc.descriptionIncludes bibliographical references (leaves 60-67).en_US
dc.description.abstractGenome-wide association studies explain a fraction of the underlying heritability of genetic diseases. Epistatic interactions between two or more loci help closing the gap and identifying those complex interactions provides a promising road to a better understanding of complex traits. Unfortunately, sheer number of loci combinations to consider and hypotheses to test prohibit the process both computationally and statistically. This is true even if only pairs of loci are considered. Epistasis prioritization algorithms have proven useful for reducing the computational burden and limiting the number of tests to perform. While current methods aim at avoiding linkage disequilibrium and covering the case cohort, none aims at diversifying the topological layout of the selected SNPs which can detect complementary variants. In this thesis, a two stage pipeline to prioritize epistasis test is proposed. In the first step, a submodular set function is optimized to select a diverse set of SNPs that span the underlying genome to (i) avoid linkage disequilibrium and (ii) pair SNPs that relate to complementary function. In the second step, selected SNPs are used as seeds to a fast epistasis detection algorithm. The algorithm is compared with the state-of-the-art method LinDen on three datasets retrieved from Wellcome Trust Case Control Consortium: type two diabates, hypertension and bipolar disorder. The results show that the pipeline drastically reduces the number of tests to perform while the number of statistically significant epistatic pairs discovered increases.en_US
dc.description.provenanceSubmitted by Betül Özen (ozen@bilkent.edu.tr) on 2019-08-16T06:58:49Z No. of bitstreams: 1 Diverse_SNP_Selection_for_Epistasis_Test_Prioritization.pdf: 738180 bytes, checksum: ed7a970788ca1b20f7ac0f7a94b74d46 (MD5)en
dc.description.provenanceMade available in DSpace on 2019-08-16T06:58:49Z (GMT). No. of bitstreams: 1 Diverse_SNP_Selection_for_Epistasis_Test_Prioritization.pdf: 738180 bytes, checksum: ed7a970788ca1b20f7ac0f7a94b74d46 (MD5) Previous issue date: 2019-08en
dc.description.statementofresponsibilityby Gizem Çaylaken_US
dc.format.extentxviii, 67 leaves : charts ; 30 cm.en_US
dc.identifier.itemidB160073
dc.identifier.urihttp://hdl.handle.net/11693/52333
dc.language.isoEnglishen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectGWASen_US
dc.subjectEpistasis test prioritizationen_US
dc.subjectSNP selectionen_US
dc.titleDiverse SNP selection for epistasis test prioritizationen_US
dc.title.alternativeEpistatik test önceliklendirme için çesitli SNP seçilimien_US
dc.typeThesisen_US
thesis.degree.disciplineComputer Engineering
thesis.degree.grantorBilkent University
thesis.degree.levelMaster's
thesis.degree.nameMS (Master of Science)

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Diverse_SNP_Selection_for_Epistasis_Test_Prioritization.pdf
Size:
720.88 KB
Format:
Adobe Portable Document Format
Description:
Full printable version
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: