Discovery of large genomic inversions using long range information

dc.citation.epage12en_US
dc.citation.issueNumber65en_US
dc.citation.spage1en_US
dc.citation.volumeNumber18en_US
dc.contributor.authorRasekh, M. E.en_US
dc.contributor.authorChiatante, G.en_US
dc.contributor.authorMiroballo, M.en_US
dc.contributor.authorTang, J.en_US
dc.contributor.authorVentura M.en_US
dc.contributor.authorAmemiya, C. T.en_US
dc.contributor.authorEichler, E. E.en_US
dc.contributor.authorAntonacci, F.en_US
dc.contributor.authorAlkan C.en_US
dc.date.accessioned2018-04-12T11:01:21Z
dc.date.available2018-04-12T11:01:21Z
dc.date.issued2017en_US
dc.departmentDepartment of Computer Engineeringen_US
dc.description.abstractAlthough many algorithms are now available that aim to characterize different classes of structural variation, discovery of balanced rearrangements such as inversions remains an open problem. This is mainly due to the fact that breakpoints of such events typically lie within segmental duplications or common repeats, which reduces the mappability of short reads. The algorithms developed within the 1000 Genomes Project to identify inversions are limited to relatively short inversions, and there are currently no available algorithms to discover large inversions using high throughput sequencing technologies. Results: Here we propose a novel algorithm, Valor, to discover large inversions using new sequencing methods that provide long range information such as 10X Genomics linked-read sequencing, pooled clone sequencing, or other similar technologies that we commonly refer to as long range sequencing. We demonstrate the utility of Valor using both pooled clone sequencing and 10X Genomics linked-read sequencing generated from the genome of an individual from the HapMap project (NA12878). We also provide a comprehensive comparison of Valor against several state-of-the-art structural variation discovery algorithms that use whole genome shotgun sequencing data. Conclusions: In this paper, we show that Valor is able to accurately discover all previously identified and experimentally validated large inversions in the same genome with a low false discovery rate. Using Valor, we also predicted a novel inversion, which we validated using fluorescent in situ hybridization. Valor is available at https://github.com/BilkentCompGen/Valor. © 2017 The Author(s).en_US
dc.description.provenanceMade available in DSpace on 2018-04-12T11:01:21Z (GMT). No. of bitstreams: 1 bilkent-research-paper.pdf: 179475 bytes, checksum: ea0bedeb05ac9ccfb983c327e155f0c2 (MD5) Previous issue date: 2017en
dc.identifier.doi10.1186/s12864-016-3444-1en_US
dc.identifier.eissn1471-2164
dc.identifier.urihttp://hdl.handle.net/11693/37048
dc.language.isoEnglishen_US
dc.publisherBioMed Central Ltd.en_US
dc.relation.isversionofhttp://dx.doi.org/10.1186/s12864-016-3444-1en_US
dc.source.titleBMC Genomicsen_US
dc.subjectStructural variationen_US
dc.subjectLong range sequencingen_US
dc.subjectLinked-readsen_US
dc.subjectInversionen_US
dc.subjectRead cloudsen_US
dc.titleDiscovery of large genomic inversions using long range informationen_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Discovery of large genomic inversions.pdf
Size:
3.04 MB
Format:
Adobe Portable Document Format
Description:
Full printable version