CONGA: Copy number variation genotyping in ancient genomes and low-coverage sequencing data

buir.contributor.authorAlkan, Can
buir.contributor.orcidAlkan, Can|0000-0002-5443-0706
dc.citation.epage32en_US
dc.citation.issueNumber12en_US
dc.citation.spage1en_US
dc.citation.volumeNumber18en_US
dc.contributor.authorSöylev, Arda
dc.contributor.authorÇokoglu, Sevim Seda
dc.contributor.authorKoptekin, Dilek
dc.contributor.authorAlkan, Can
dc.contributor.authorSomel, Mehmet
dc.date.accessioned2023-03-01T11:23:44Z
dc.date.available2023-03-01T11:23:44Z
dc.date.issued2022-12-14
dc.departmentDepartment of Computer Engineeringen_US
dc.description.abstractTo date, ancient genome analyses have been largely confined to the study of single nucleotide polymorphisms (SNPs). Copy number variants (CNVs) are a major contributor of disease and of evolutionary adaptation, but identifying CNVs in ancient shotgun-sequenced genomes is hampered by typical low genome coverage (<1×) and short fragments ([removed]1 kbps with F-scores >0.75 at ≥1×, and distinguish between heterozygous and homozygous states. We used CONGA to genotype 10,002 outgroup-ascertained deletions across a heterogenous set of 71 ancient human genomes spanning the last 50,000 years, produced using variable experimental protocols. A fraction of these (21/71) display divergent deletion profiles unrelated to their population origin, but attributable to technical factors such as coverage and read length. The majority of the sample (50/71), despite originating from nine different laboratories and having coverages ranging from 0.44×-26× (median 4×) and average read lengths 52-121 bps (median 69), exhibit coherent deletion frequencies. Across these 50 genomes, inter-individual genetic diversity measured using SNPs and CONGA-genotyped deletions are highly correlated. CONGA-genotyped deletions also display purifying selection signatures, as expected. CONGA thus paves the way for systematic CNV analyses in ancient genomes, despite the technical challenges posed by low and variable genome coverage. © 2022 Söylev et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.en_US
dc.description.provenanceSubmitted by Cem Çağatay Akgün (cem.akgun@bilkent.edu.tr) on 2023-03-01T11:23:44Z No. of bitstreams: 1 CONGA_Copy_number_variation_genotyping_in_ancient_genomes_and_low-coverage_sequencing_data.pdf: 2025441 bytes, checksum: ea13ea435b265413e875666ca728daf9 (MD5)en
dc.description.provenanceMade available in DSpace on 2023-03-01T11:23:44Z (GMT). No. of bitstreams: 1 CONGA_Copy_number_variation_genotyping_in_ancient_genomes_and_low-coverage_sequencing_data.pdf: 2025441 bytes, checksum: ea13ea435b265413e875666ca728daf9 (MD5) Previous issue date: 2022-12-14en
dc.identifier.doi10.1371/journal.pcbi.1010788en_US
dc.identifier.issn1553734Xen_US
dc.identifier.urihttp://hdl.handle.net/11693/111995en_US
dc.language.isoEnglishen_US
dc.publisherPublic Library of Scienceen_US
dc.relation.isversionofhttps://dx.doi.org/10.1371/journal.pcbi.1010788en_US
dc.source.titlePLoS Computational Biologyen_US
dc.subjectDNA Copy Number Variationsen_US
dc.subjectGenetics, Populationen_US
dc.subjectGenome, Humanen_US
dc.subjectGenomicsen_US
dc.subjectGenotypeen_US
dc.subjectHumansen_US
dc.subjectPolymorphism, Single Nucleotideen_US
dc.titleCONGA: Copy number variation genotyping in ancient genomes and low-coverage sequencing dataen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
CONGA_Copy_number_variation_genotyping_in_ancient_genomes_and_low-coverage_sequencing_data.pdf
Size:
1.93 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.69 KB
Format:
Item-specific license agreed upon to submission
Description: