Accelerating read mapping with FastHASH

Xin, H.; Lee, D.; Hormozdiari, F.; Yedkar, S.; Mutlu, O.; Alkan C.

Accelerating read mapping with FastHASH

Files

Accelerating read mapping with FastHASH.pdf (1.46 MB)

Date

2013

Authors

Xin, H.

Lee, D.

Hormozdiari, F.

Yedkar, S.

Mutlu, O.

Alkan C.

BUIR Usage Stats

5
views

24
downloads

Citation Stats

Attention Stats

Abstract

With the introduction of next-generation sequencing (NGS) technologies, we are facing an exponential increase in the amount of genomic sequence data. The success of all medical and genetic applications of next-generation sequencing critically depends on the existence of computational techniques that can process and analyze the enormous amount of sequence data quickly and accurately. Unfortunately, the current read mapping algorithms have difficulties in coping with the massive amounts of data generated by NGS. We propose a new algorithm, FastHASH, which drastically improves the performance of the seed-and-extend type hash table based read mapping algorithms, while maintaining the high sensitivity and comprehensiveness of such methods. FastHASH is a generic algorithm compatible with all seed-and-extend class read mapping algorithms. It introduces two main techniques, namely Adjacency Filtering, and Cheap K-mer Selection. We implemented FastHASH and merged it into the codebase of the popular read mapping program, mrFAST. Depending on the edit distance cutoffs, we observed up to 19-fold speedup while still maintaining 100% sensitivity and high comprehensiveness. © 2013 Xin et al.

Source Title

BMC Genomics

Publisher

BioMed Central Ltd.

Keywords

Reference genome, True location, Hash table, Dynamic programming algorithm, Edit distance

Permalink

http://hdl.handle.net/11693/21174

Published Version (Please cite this version)

http://dx.doi.org/10.1186/1471-2164-14-S1-S13

Collections

Scholarly Publications - Computer Engineering

Language

English

Type

Article

Full item page

Accelerating read mapping with FastHASH

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Attention Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Accelerating read mapping with FastHASH

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Attention Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type