Accelerating read mapping with FastHASH
BioMed Central Ltd.
Please cite this item using this persistent URLhttp://hdl.handle.net/11693/21174
With the introduction of next-generation sequencing (NGS) technologies, we are facing an exponential increase in the amount of genomic sequence data. The success of all medical and genetic applications of next-generation sequencing critically depends on the existence of computational techniques that can process and analyze the enormous amount of sequence data quickly and accurately. Unfortunately, the current read mapping algorithms have difficulties in coping with the massive amounts of data generated by NGS. We propose a new algorithm, FastHASH, which drastically improves the performance of the seed-and-extend type hash table based read mapping algorithms, while maintaining the high sensitivity and comprehensiveness of such methods. FastHASH is a generic algorithm compatible with all seed-and-extend class read mapping algorithms. It introduces two main techniques, namely Adjacency Filtering, and Cheap K-mer Selection. We implemented FastHASH and merged it into the codebase of the popular read mapping program, mrFAST. Depending on the edit distance cutoffs, we observed up to 19-fold speedup while still maintaining 100% sensitivity and high comprehensiveness. © 2013 Xin et al.
- Research Paper 
Showing items related by title, author, creator and subject.
Huddleston J.; Ranade, S.; Malig, M.; Antonacci F.; Chaisson, M.; Hon L.; Sudmant P.H.; Graves, T.A.; Alkan, C.; Dennis, M.Y.; Wilson, R.K.; Turner, S.W.; Korlach J.; Eichler, E.E. (Cold Spring Harbor Laboratory Press, 2014)Obtaining high-quality sequence continuity of complex regions of recent segmental duplication remains one of the major challenges of finishing genome assemblies. In the human and mouse genomes, this was achieved by targeting ...
Eslami Rasekh M.; Chiatante G.; Miroballo M.; Tang J.; Ventura M.; Amemiya C.T.; Eichler E.E.; Antonacci F.; Alkan C. (BioMed Central Ltd., 2017)Background: Although many algorithms are now available that aim to characterize different classes of structural variation, discovery of balanced rearrangements such as inversions remains an open problem. This is mainly due ...
Dalkic, E.; Kuscu, C.; Sucularli, C.; Aydin I.T.; Akcali, K.C.; Konu O. (2006)Robo2, a member of the robo gene family, functions as a repulsive axon guidance receptor as well as a regulator of cell migration and tissue morphogenesis in different taxa. In this study, a novel isoform of the zebrafish ...