Accelerating read mapping with FastHASH

Xin, H.; Lee, D.; Hormozdiari, F.; Yedkar, S.; Mutlu, O.; Alkan C.

Accelerating read mapping with FastHASH

dc.citation.epage	13	en_US
dc.citation.spage	1	en_US
dc.citation.volumeNumber	14	en_US
dc.contributor.author	Xin, H.	en_US
dc.contributor.author	Lee, D.	en_US
dc.contributor.author	Hormozdiari, F.	en_US
dc.contributor.author	Yedkar, S.	en_US
dc.contributor.author	Mutlu, O.	en_US
dc.contributor.author	Alkan C.	en_US
dc.date.accessioned	2016-02-08T09:42:26Z
dc.date.available	2016-02-08T09:42:26Z
dc.date.issued	2013	en_US
dc.department	Department of Computer Engineering	en_US
dc.description.abstract	With the introduction of next-generation sequencing (NGS) technologies, we are facing an exponential increase in the amount of genomic sequence data. The success of all medical and genetic applications of next-generation sequencing critically depends on the existence of computational techniques that can process and analyze the enormous amount of sequence data quickly and accurately. Unfortunately, the current read mapping algorithms have difficulties in coping with the massive amounts of data generated by NGS. We propose a new algorithm, FastHASH, which drastically improves the performance of the seed-and-extend type hash table based read mapping algorithms, while maintaining the high sensitivity and comprehensiveness of such methods. FastHASH is a generic algorithm compatible with all seed-and-extend class read mapping algorithms. It introduces two main techniques, namely Adjacency Filtering, and Cheap K-mer Selection. We implemented FastHASH and merged it into the codebase of the popular read mapping program, mrFAST. Depending on the edit distance cutoffs, we observed up to 19-fold speedup while still maintaining 100% sensitivity and high comprehensiveness. © 2013 Xin et al.	en_US
dc.identifier.doi	10.1186/1471-2164-14-S1-S13	en_US
dc.identifier.issn	1471-2164	en_US
dc.identifier.uri	http://hdl.handle.net/11693/21174	en_US
dc.language.iso	English	en_US
dc.publisher	BioMed Central Ltd.	en_US
dc.relation.isversionof	http://dx.doi.org/10.1186/1471-2164-14-S1-S13	en_US
dc.source.title	BMC Genomics	en_US
dc.subject	Reference genome	en_US
dc.subject	True location	en_US
dc.subject	Hash table	en_US
dc.subject	Dynamic programming algorithm	en_US
dc.subject	Edit distance	en_US
dc.title	Accelerating read mapping with FastHASH	en_US
dc.type	Article	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Accelerating read mapping with FastHASH.pdf
Size:: 1.46 MB
Format:: Adobe Portable Document Format
Description:: Full printable version

Download

Collections

Scholarly Publications - Computer Engineering