Computational pan-genomics: status, promises and challenges

buir.contributor.authorAlkan, Can
dc.citation.epage135en_US
dc.citation.issueNumber1en_US
dc.citation.spage118en_US
dc.citation.volumeNumber19en_US
dc.contributor.authorThe Computational Pan-Genomics Consortiumen_US
dc.contributor.authorAlkan, Canen_US
dc.date.accessioned2019-02-12T09:21:27Z
dc.date.available2019-02-12T09:21:27Z
dc.date.issued2018-01-01en_US
dc.departmentDepartment of Computer Engineeringen_US
dc.description.abstractMany disciplines, from human genetics and oncology to plant breeding, microbiology and virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes. In case of Homo sapiens, the number of sequenced genomes will approach hundreds of thousands in the next few years. Simply scaling up established bioinformatics pipelines will not be sufficient for leveraging the full potential of such rich genomic data sets. Instead, novel, qualitatively different computational methods and paradigms are needed.We will witness the rapid extension of computational pan-genomics, a new sub-area of research in computational biology. In this article, we generalize existing definitions and understand a pangenome as any collection of genomic sequences to be analyzed jointly or to be used as a reference. We examine already available approaches to construct and use pan-genomes, discuss the potential benefits of future technologies and methodologies and review open challenges from the vantage point of the above-mentioned biological disciplines. As a prominent example for a computational paradigm shift, we particularly highlight the transition from the representation of reference genomes as strings to representations as graphs. We outline how this and other challenges from different application domains translate into common computational problems, point out relevant bioinformatics techniques and identify open problems in computer science. With this review, we aim to increase awareness that a joint approach to computational pangenomics can help address many of the problems currently faced in various domains.en_US
dc.description.provenanceSubmitted by Türkan Cesur (cturkan@bilkent.edu.tr) on 2019-02-12T09:21:27Z No. of bitstreams: 1 Computational_pan-genomics_status,_promises_and_challenges.pdf: 903992 bytes, checksum: 2b2c84334cef5822f82c3117979c892a (MD5)en
dc.description.provenanceMade available in DSpace on 2019-02-12T09:21:27Z (GMT). No. of bitstreams: 1 Computational_pan-genomics_status,_promises_and_challenges.pdf: 903992 bytes, checksum: 2b2c84334cef5822f82c3117979c892a (MD5) Previous issue date: 2018-01-01en
dc.identifier.doi10.1093/bib/bbw089en_US
dc.identifier.eissn1477-4054
dc.identifier.issn1467-5463
dc.identifier.urihttp://hdl.handle.net/11693/49309
dc.language.isoEnglishen_US
dc.publisherOxford University Pressen_US
dc.relation.isversionofhttp://doi.org/10.1093/bib/bbw089en_US
dc.source.titleBriefings in Bioinformaticsen_US
dc.subjectPan-genomeen_US
dc.subjectSequence graphen_US
dc.subjectRead mappingen_US
dc.subjectHaplotypesen_US
dc.subjectData structuresen_US
dc.titleComputational pan-genomics: status, promises and challengesen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Computational_pan-genomics_status,_promises_and_challenges.pdf
Size:
882.8 KB
Format:
Adobe Portable Document Format
Description:
Full printable version

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: