Large structural variation discovery using long reads with several degrees of error
buir.advisor | Alkan, Can | |
dc.contributor.author | Ebren, Ezgi | |
dc.date.accessioned | 2021-02-03T13:05:41Z | |
dc.date.available | 2021-02-03T13:05:41Z | |
dc.date.copyright | 2020-12 | |
dc.date.issued | 2020-12 | |
dc.date.submitted | 2021-02-01 | |
dc.description | Cataloged from PDF version of article. | en_US |
dc.description | Thesis (M.S.): Bilkent University, Department of Computer Engineering, İhsan Doğramacı Bilkent University, 2020. | en_US |
dc.description | Includes bibliographical references (leaves 36-45). | en_US |
dc.description.abstract | Genomic structural variations (SVs) are briefly defined as large-scale alterations of DNA content, copy, and organization. Although significant progress has been made since the introduction of high throughput sequencing (HTS) in character-izing SVs, accurate detection of complex SVs and balanced rearrangements still remains elusive due to the sequence complexity at the breakpoints. Until very recently, the difficulty of read mapping in such regions when the reads were short and the high error rates of long read platforms kept the problem challenging. However, with the introduction of the Pacific Biosciences’ High Fidelity (HiFi) sequencing methodology, powerful SV detection and breakpoint resolution be-came possible as a result of its capability to produce highly accurate (> 99%) long reads (10 − 20 kbps). Here, we introduce DALEK, a novel algorithm that aims to use long-read tech-nologies to discover large structural variations with high break-point resolution. DALEK uses split read and read depth signatures from long read data to dis-cover large (≥ 10 kbps) deletions, inversions and segmental duplications. We also develop methods to detect large SVs in existing high-error Oxford Nanopore Technologies data. | en_US |
dc.description.provenance | Submitted by Betül Özen (ozen@bilkent.edu.tr) on 2021-02-03T13:05:41Z No. of bitstreams: 1 10377737.pdf: 1133300 bytes, checksum: 6abd15b670556584d5be4c4343e70784 (MD5) | en |
dc.description.provenance | Made available in DSpace on 2021-02-03T13:05:41Z (GMT). No. of bitstreams: 1 10377737.pdf: 1133300 bytes, checksum: 6abd15b670556584d5be4c4343e70784 (MD5) Previous issue date: 2021-01 | en |
dc.description.statementofresponsibility | by Ezgi Ebren | en_US |
dc.embargo.release | 2021-07-28 | |
dc.format.extent | xi, 47 leaves : charts ; 30 cm. | en_US |
dc.identifier.itemid | B155544 | |
dc.identifier.uri | http://hdl.handle.net/11693/54977 | |
dc.language.iso | English | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.subject | Structural variation | en_US |
dc.subject | Deletion | en_US |
dc.subject | Inversion | en_US |
dc.subject | Segmental duplication | en_US |
dc.subject | HiFi reads | en_US |
dc.subject | Long reads | en_US |
dc.title | Large structural variation discovery using long reads with several degrees of error | en_US |
dc.title.alternative | Farklı hata oranlarına sahip uzun okumalar ile büyük yapısal varyasyon tespiti | en_US |
dc.type | Thesis | en_US |
thesis.degree.discipline | Computer Engineering | |
thesis.degree.grantor | Bilkent University | |
thesis.degree.level | Master's | |
thesis.degree.name | MS (Master of Science) |