MAGNET: understanding and improving the accuracy of genome pre-alignment filtering

dc.citation.epage10en_US
dc.citation.issueNumber2en_US
dc.citation.spage1en_US
dc.citation.volumeNumber13en_US
dc.contributor.authorAlser, M.en_US
dc.contributor.authorMutlu, O.en_US
dc.contributor.authorAlkan C.en_US
dc.date.accessioned2019-02-13T10:45:05Z
dc.date.available2019-02-13T10:45:05Z
dc.date.issued2017en_US
dc.departmentDepartment of Computer Engineeringen_US
dc.description.abstractIn the era of high throughput DNA sequencing (HTS) technologies, calculating the edit distance (i.e.,the minimum number of substitutions, insertions, and deletionsbetween a pair of sequences) forbillions of genomicsequences is the computational bottleneck intoday’s read mappers. The shifted Hamming distance (SHD) algorithm proposes afast filtering strategy that can rapidly filter out invalid mappings that have more edits than allowed. However, SHD shows high inaccuracy in its filtering by admitting invalid mappings to be marked as correct ones. This wastesthe execution time and imposesa large computational burden. In this work, we comprehensively investigate foursources that lead to the filtering inaccuracy. We propose MAGNET, anewfiltering strategy that maintains high accuracy across different edit distance thresholds and data sets. It significantly improvestheaccuracy of pre-alignment filtering by one to twoordersof magnitude.The MATLAB implementationsof MAGNETand SHDareopen source and available at:https://github.com/BilkentCompGen/MAGNET.en_US
dc.identifier.issn1820-4503en_US
dc.identifier.urihttp://hdl.handle.net/11693/49418en_US
dc.language.isoEnglishen_US
dc.publisherI P S Ien_US
dc.source.titleTransactions on Internet Researchen_US
dc.subjectHigh throughput DNA sequencingen_US
dc.subjectRead mappingen_US
dc.subjectRead alignmenten_US
dc.subjectFalse positivesen_US
dc.titleMAGNET: understanding and improving the accuracy of genome pre-alignment filteringen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
MAGNET understanding_and_improvingthe_accuracy_of_genome_pre-alignment_filtering.pdf
Size:
770.32 KB
Format:
Adobe Portable Document Format
Description:
Full printable version

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: