Potpourri: an epistasis test prioritization algorithm via diverse SNP selection

Çaylak, Gizem; Çiçek, A. Ercüment

Potpourri: an epistasis test prioritization algorithm via diverse SNP selection

buir.contributor.author	Çaylak, Gizem
buir.contributor.author	Çiçek, A. Ercüment
dc.citation.epage	244	en_US
dc.citation.spage	243	en_US
dc.citation.volumeNumber	12074 LNBI	en_US
dc.contributor.author	Çaylak, Gizem	en_US
dc.contributor.author	Çiçek, A. Ercüment	en_US
dc.contributor.editor	Schwartz, R.
dc.coverage.spatial	Padua, Italy	en_US
dc.date.accessioned	2021-03-04T08:14:20Z
dc.date.available	2021-03-04T08:14:20Z
dc.date.issued	2020
dc.department	Department of Computer Engineering	en_US
dc.description	Date of Conference: 10-13 May 2020	en_US
dc.description	Conference Name: 24th Annual Conference on Research in Computational Molecular Biology, RECOMB 2020	en_US
dc.description.abstract	Genome-wide association studies explain a fraction of the underlying heritability of genetic diseases. Investigating epistatic interactions between two or more loci help closing this gap. Unfortunately, sheer number of loci combinations to process and hypotheses to test prohibit the process both computationally and statistically. Epistasis test prioritization algorithms rank likely-epistatic SNP pairs to limit the number of tests. Yet, they still su_er from very low precision. It was shown in the literature that selecting SNPs that are individually correlated with the phenotype and also diverse with respect to genomic location, leads to better phenotype prediction due to genetic complementation. Here, we hypothesize that an algorithm that pairs SNPs from such diverse regions and carefully ranks the pairs can detect statistically more meaningful pairs and can improve prediction power. We propose an epistasis test prioritization algorithm which optimizes a submodular set function to select a diverse and complementary set of genomic regions that span the underlying genome. SNP pairs from these regions are then further ranked w.r.t. their co-coverage of the case cohort. We compare our algorithm with the state- of-the-art on three GWAS and show that (i) we substantially improve precision (from 0.003 to 0.652) while maintaining the signi_cance of selected pairs, (ii) decrease the number of tests by 25 folds, and (iii) decrease the runtime by 4 folds. We also show that promoting SNPs from regulatory/coding regions improves the precision (up to 0.8).	en_US
dc.identifier.doi	10.1007/978-3-030-45257-5_22	en_US
dc.identifier.isbn	9783030452568	en_US
dc.identifier.issn	0302-9743	en_US
dc.identifier.uri	http://hdl.handle.net/11693/75767	en_US
dc.language.iso	English	en_US
dc.publisher	Springer	en_US
dc.relation.isversionof	https://dx.doi.org/10.1007/978-3-030-45257-5_22	en_US
dc.source.title	Lecture Notes in Computer Science	en_US
dc.title	Potpourri: an epistasis test prioritization algorithm via diverse SNP selection	en_US
dc.type	Conference Paper	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Potpourri_an_epistasis_test_prioritization_algorithm_via_diverse_snp_selection.pdf
Size:: 339.59 KB
Format:: Adobe Portable Document Format
Description:: View / Download

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Scholarly Publications - Computer Engineering