Now showing items 1-20 of 29

    • Accelerating genome analysis: a primer on an ongoing journey 

      Alser, M.; Zülal, Bingöl; Cali, D. S.; Kim, J.; Ghose, S.; Alkan, Can; Mutlu, Onur (IEEE, 2020)
      Genome analysis fundamentally starts with a process known as read mapping, where sequenced fragments of an organism's genome are compared against a reference genome. Read mapping is currently a major bottleneck in the ...
    • Apollo: A sequencing-technology-independent, scalable and accurate assembly polishing algorithm 

      Fırtına, C.; Kim, J. S.; Alser, M.; Şenol Cali, D.; Çiçek, A. Ercüment; Alkan, Can; Mutlu, Onur (Oxford University Press, 2020-03)
      Motivation: Third-generation sequencing technologies can sequence long reads that contain as many as 2 million base pairs. These long reads are used to construct an assembly (i.e. the subject’s genome), which is further ...
    • Author Correction: A robust benchmark for detection of germline large deletions and insertions 

      Zook, J. M.; Hansen, N. F.; Olson, N. D.; Chapman, L.; Mullikin, J. C.; Xiao, C.; Sherry, S.; Koren, S.; Phillippy, A. M.; Boutros, P. C.; Sahraeian, S. M. E.; Huang, V.; Rouette, A.; Alexander, N.; Mason, C. E.; Hajirasouliha, I.; Ricketts, C.; Lee, J.; Tearle, R.; Fiddes, I. T.; Barrio, A. M.; Wala, J.; Carroll, A.; Ghaffari, N.; Rodriguez, O. L.; Bashir, A.; Jackman, S.; Farrell, J. J.; Wenger, A. M.; Alkan, Can; Söylev, A.; Schatz, M. C.; Garg, S.; Church, G.; Marschall, T.; Chen, K.; Fan, X.; English, A. C.; Rosenfeld, J. A.; Zhou, W.; Mills, R. E.; Sage, J. M.; Davis, J. R.; Kaiser, M. D.; Oliver, J. S.; Catalano, A. P.; Chaisson, M. J. P.; Spies, N.; Sedlazeck, F. J.; Salit, M. (Nature Research, 2020)
      New technologies and analysis methods are enabling genomic structural variants (SVs) to be detected with ever-increasing accuracy, resolution and comprehensiveness. To help translate these methods to routine research and ...
    • Automatic characterization of copy number polymorphism using high throughput sequencing 

      Alkan, Can (TÜBİTAK, 2020)
      Genome structural variation, broadly defined as alterations longer than 50 bp, are important sources for genetic variation among humans, including those that cause complex diseases such as autism, developmental delay, and ...
    • BISER: fast characterization of segmental duplication structure in multiple genome assemblies 

      Išerić, Hamza; Alkan, Can; Hach, Faraz; Numanagić, Ibrahim (Schloss Dagstuhl- Leibniz-Zentrum fur Informatik, 2021-07-22)
      The increasing availability of high-quality genome assemblies raised interest in the characterization of genomic architecture. Major architectural parts, such as common repeats and segmental duplications (SDs), increase ...
    • Building and improving reference genome assemblies 

      Steinberg, K. M.; Schneider, V. A.; Alkan, Can; Montague, M. J.; Warren, W. C.; Church, D. M.; Wilson, R. K. (IEEE, 2017-01)
      A genome sequence assembly provides the foundation for studies of genotypic and phenotypic variation, genome structure, and evolution of the target organism. In the past four decades, there has been a surge of new sequencing ...
    • Can you really anonymize the donors of genomic data in today’s digital world? 

      Alser, Mohammed; Almadhoun, Nour; Nouri, Azita; Alkan, Can; Ayday, Erman (Springer, 2016-09)
      The rapid progress in genome sequencing technologies leads to availability of high amounts of genomic data. Accelerating the pace of biomedical breakthroughs and discoveries necessitates not only collecting millions of ...
    • Characterizing microsatellite polymorphisms using assembly-based and mapping-based tools 

      Demir, Gülfem; Alkan, Can (Scientific and Technical Research Council of Turkey, 2019)
      Microsatellite polymorphism has always been a challenge for genome assembly and sequence alignment due to sequencing errors, short read lengths, and high incidence of polymerase slippage in microsatellite regions. Despite ...
    • Computational pan-genomics: status, promises and challenges 

      The Computational Pan-Genomics Consortium; Alkan, Can (Oxford University Press, 2018-01-01)
      Many disciplines, from human genetics and oncology to plant breeding, microbiology and virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes. In case of Homo sapiens, the number of ...
    • CONGA: Copy number variation genotyping in ancient genomes and low-coverage sequencing data 

      Söylev, Arda; Çokoglu, Sevim Seda; Koptekin, Dilek; Alkan, Can; Somel, Mehmet (Public Library of Science, 2022-12-14)
      To date, ancient genome analyses have been largely confined to the study of single nucleotide polymorphisms (SNPs). Copy number variants (CNVs) are a major contributor of disease and of evolutionary adaptation, but identifying ...
    • Discovery of tandem and interspersed segmental duplications using high-throughput sequencing 

      Söylev, Arda; Le, T. M.; Amini, H.; Alkan, Can; Hormozdiari, F. (Oxford University Press, 2019-04)
      Motivation: Several algorithms have been developed that use high-throughput sequencing technology to characterize structural variations (SVs). Most of the existing approaches focus on detecting relatively simple types of ...
    • Evaluation of genome scaffolding tools using pooled clone sequencin 

      Dal, Elif; Alkan, Can (Scientific and Technical Research Council of Turkey, 2018)
      DNA sequencing technologies hold great promise in generating information that will guide scientists to understand how the genome affects human health and organismal evolution. The process of generating raw genome sequence ...
    • Fast characterization of segmental duplication structure in multiple genome assemblies 

      Išerić, Hamza; Alkan, Can; Hach, Faraz; Numanagić, Ibrahim (BioMed Central Ltd, 2022-12)
      Motivation: The increasing availability of high-quality genome assemblies raised interest in the characterization of genomic architecture. Major architectural elements, such as common repeats and segmental duplications ...
    • Fast characterization of segmental duplications in genome assemblies 

      Numanagić, I.; Gökkaya, Alim S.; Zhang, L.; Berger, B.; Alkan, Can; Hach, F. (Oxford University Press, 2018)
      Motivation Segmental duplications (SDs) or low-copy repeats, are segments of DNA > 1 Kbp with high sequence identity that are copied to other regions of the genome. SDs are among the most important sources of evolution, a ...
    • FastRemap: a tool for quickly remapping reads between genome assemblies 

      Kim, J. S.; Firtina, C.; Cavlak, M. B.; Cali, D. S.; Alkan, Can; Mutlu, Onur (Oxford University Press, 2022-08-17)
      Motivation: A genome read dataset can be quickly and efficiently remapped from one reference to another similar reference (e.g., between two reference versions or two similar species) using a variety of tools, e.g., the ...
    • GateKeeper-GPU: fast and accurate pre-alignment filtering in short read mapping 

      Bingöl, Zülal; Alser, Mohammed; Mutlu, Onur; Öztürk, Özcan; Alkan, Can (IEEE, 2021-06-24)
      We introduce GateKeeper-GPU, a fast and accurate pre-alignment filter that efficiently reduces the need for expensive sequence alignment. GateKeeper-GPU improves the filtering accuracy of GateKeeper, and by exploiting the ...
    • GenASM: a high-performance, low-power approximate string matching acceleration framework for genome sequence analysis 

      Şenol-Çalı, D.; Kalsi, G. S.; Bingöl, Zülal; Fırtına, C.; Subramanian, L.; Kim, J. S.; Ausavarungnirun, R.; Alser, M.; Gomez-Luna, J.; Boroumand, A.; Norion, A.; Scibisz, A.; Subramoneyon, S.; Alkan, Can; Ghose, S.; Mutlu, Onur (IEEE Computer Society, 2020)
      Genome sequence analysis has enabled significant advancements in medical and scientific areas such as personalized medicine, outbreak tracing, and the understanding of evolution. To perform genome sequencing, devices extract ...
    • Hercules: a profile HMM-based hybrid error correction algorithm for long reads 

      Fırtına, Can; Bar-Joseph, Z.; Alkan, Can; Çicek, A. Ercüment (Oxford University Press, 2018)
      Choosing whether to use second or third generation sequencing platforms can lead to trade-offs between accuracy and read length. Several types of studies require long and accurate reads. In such cases researchers often ...
    • Implications of the first complete human genome assembly 

      Alkan, Can; Carbone, Lucia; Dennis, Megan; Ernst, Jason; Evrony, Gilad; Girirajan, Santhosh; Leung, Danny Chi Yeu; Cheng, Clooney C.Y.; MacAlpine, David; Ni, Ting; Ramsay, Michèle; Rowe, Helen (Cold Spring Harbor Laboratory Press, 2022-03-31)
    • Improving genome assemblies using multi-platform sequence data 

      Kavak, P.; Ergüner, B.; Üstek, D.; Yüksel, B.; Saǧıroǧlu, M. Ş.; Güngör, T.; Alkan, Can (Springer, 2015-09)
      Accurate de novo assembly using short reads generated by next generation sequencing technologies is still an open problem. Although there are several assembly algorithms developed for data generated with different sequencing ...