Alser, M.Hassan, H.Xin, H.Ergin, O.Mutlu, O.Alkan C.2019-02-132019-02-132017-11-011367-4803http://hdl.handle.net/11693/49407High throughput DNA sequencing (HTS) technologies generate an excessive number of small DNA segments -called short reads- that cause significant computational burden. To analyze the entire genome, each of the billions of short reads must be mapped to a reference genome based on the similarity between a read and ‘candidate’ locations in that reference genome. The similarity measurement, called alignment, formulated as an approximate string matching problem, is the computational bottleneck because: (i) it is implemented using quadratic-time dynamic programming algorithms and (ii) the majority of candidate locations in the reference genome do not align with a given read due to high dissimilarity. Calculating the alignment of such incorrect candidate locations consumes an overwhelming majority of a modern read mapper’s execution time. Therefore, it is crucial to develop a fast and effective filter that can detect incorrect candidate locations and eliminate them before invoking computationally costly alignment algorithms.EnglishGateKeeper: a new hardware architecture for accelerating pre-alignment in DNA short read mappingArticle10.1093/bioinformatics/btx3421367-4811