Browsing by Subject "Gene mapping"
Now showing 1 - 6 of 6
- Results Per Page
- Sort Options
Item Open Access Early postzygotic mutations contribute to de novo variation in a healthy monozygotic twin pair(B M J Group, 2014) Dal, G. M.; Ergüner, B.; Saǧıroǧlu, M. S.; Yüksel, B.; Onat, O. E.; Alkan C.; Özçelik, T.Background: Human de novo single-nucleotide variation (SNV) rate is estimated to range between 0.82-1.70×10-8 mutations per base per generation. However, contribution of early postzygotic mutations to the overall human de novo SNV rate is unknown. Methods: We performed deep whole-genome sequencing (more than 30-fold coverage per individual) of the whole-blood-derived DNA samples of a healthy monozygotic twin pair and their parents. We examined the genotypes of each individual simultaneously for each of the SNVs and discovered de novo SNVs regarding the timing of mutagenesis. Putative de novo SNVs were validated using Sanger-based capillary sequencing. Results: We conservatively characterised 23 de novo SNVs shared by the twin pair, 8 de novo SNVs specific to twin I and 1 de novo SNV specific to twin II. Based on the number of de novo SNVs validated by Sanger sequencing and the number of callable bases of each twin, we calculated the overall de novo SNV rate of 1.31×10-8 and 1.01×10-8 for twin I and twin II, respectively. Of these, rates of the early postzygotic de novo SNVs were estimated to be 0.34×10-8 for twin I and 0.04×10-8 for twin II. Conclusions: Early postzygotic mutations constitute a substantial proportion of de novo mutations in humans. Therefore, genome mosaicism resulting from early mitotic events during embryogenesis is common and could substantially contribute to the development of diseases.Item Open Access Fast and accurate mapping of complete genomics reads(Academic Press, 2015) Lee, D.; Hormozdiari, F.; Xin, H.; Hach, F.; Mutlu, O.; Alkan C.Many recent advances in genomics and the expectations of personalized medicine are made possible thanks to power of high throughput sequencing (HTS) in sequencing large collections of human genomes. There are tens of different sequencing technologies currently available, and each HTS platform have different strengths and biases. This diversity both makes it possible to use different technologies to correct for shortcomings; but also requires to develop different algorithms for each platform due to the differences in data types and error models. The first problem to tackle in analyzing HTS data for resequencing applications is the read mapping stage, where many tools have been developed for the most popular HTS methods, but publicly available and open source aligners are still lacking for the Complete Genomics (CG) platform. Unfortunately, Burrows-Wheeler based methods are not practical for CG data due to the gapped nature of the reads generated by this method. Here we provide a sensitive read mapper (sirFAST) for the CG technology based on the seed-and-extend paradigm that can quickly map CG reads to a reference genome. We evaluate the performance and accuracy of sirFAST using both simulated and publicly available real data sets, showing high precision and recall rates.Item Open Access An integrated map of structural variation in 2,504 human genomes(Nature Publishing Group, 2015) Sudmant, P. H.; Rausch, T.; Gardner, E. J.; Handsaker, R. E.; Abyzov, A.; Huddleston, J.; Zhang, Y.; Ye, K.; Jun, G.; Fritz, M. Hsi-Yang; Konkel, M. K.; Malhotra, A.; Stütz, A. M.; Shi, X.; Casale, F. P.; Chen, J.; Hormozdiari, F.; Dayama, G.; Chen, K.; Malig, M.; Chaisson, M. J. P.; Walter, K.; Meiers, S.; Kashin, S.; Garrison, E.; Auton, A.; Lam, H. Y. K.; Mu, X. J.; Alkan, C.; Antaki, D.; Bae, T.; Cerveira, E.; Chines, P.; Chong, Z.; Clarke, L.; Dal, E.; Ding, L.; Emery, S.; Fan, X.; Gujral, M.; Kahveci, F.; Kidd, J. M.; Kong, Y.; Lameijer, Eric-Wubbo; McCarthy, S.; Flicek, P.; Gibbs, R. A.; Marth, G.; Mason, C. E.; Menelaou, A.; Muzny, D. M.; Nelson, B. J.; Noor, A.; Parrish, N. F.; Pendleton, M.; Quitadamo, A.; Raeder, B.; Schadt, E. E.; Romanovitch, M.; Schlattl, A.; Sebra, R.; Shabalin, A. A.; Untergasser, A.; Walker J. A.; Wang, M.; Yu, F.; Zhang, C.; Zhang, J.; Zheng-Bradley, X.; Zhou, W.; Zichner, T.; Sebat, J.; Batzer, M. A.; McCarroll, S. A.; Mills, R. E.; Gerstein, M. B.; Bashir, A.; Stegle, O.; Devine, S. E.; Lee, C.; Eichler, E. E.; Korbel, J. O.Structural variants are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations. Analysing this set, we identify numerous gene-intersecting structural variants exhibiting population stratification and describe naturally occurring homozygous gene knockouts that suggest the dispensability of a variety of human genes. We demonstrate that structural variants are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of structural variant complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex structural variants with multiple breakpoints likely to have formed through individual mutational events. Our catalogue will enhance future studies into structural variant demography, functional impact and disease association.Item Open Access mrsFAST-Ultra: a compact, SNP-aware mapper for high performance sequencing applications(Oxford University Press, 2014) Hach, F.; Sarrafi, I.; Hormozdiari, F.; Alkan C.; Eichler, E. E.; Sahinalp, S. C.High throughput sequencing (HTS) platforms generate unprecedented amounts of data that introduce challenges for processing and downstream analysis. While tools that report the 'best' mapping location of each read provide a fast way to process HTS data, they are not suitable for many types of downstream analysis such as structural variation detection, where it is important to report multiple mapping loci for each read. For this purpose we introduce mrsFAST-Ultra, a fast, cache oblivious, SNP-aware aligner that can handle the multi-mapping of HTS reads very efficiently. mrsFAST-Ultra improves mrsFAST, our first cache oblivious read aligner capable of handling multi-mapping reads, through new and compact index structures that reduce not only the overall memory usage but also the number of CPU operations per alignment. In fact the size of the index generated by mrsFAST-Ultra is 10 times smaller than that of mrsFAST. As importantly, mrsFAST-Ultra introduces new features such as being able to (i) obtain the best mapping loci for each read, and (ii) return all reads that have at most n mapping loci (within an error threshold), together with these loci, for any user specified n. Furthermore, mrsFAST-Ultra is SNP-aware, i.e. it can map reads to reference genome while discounting the mismatches that occur at common SNP locations provided by db-SNP; this significantly increases the number of reads that can be mapped to the reference genome. Notice that all of the above features are implemented within the index structure and are not simple post-processing steps and thus are performed highly efficiently. Finally, mrsFAST-Ultra utilizes multiple available cores and processors and can be tuned for various memory settings. Our results show that mrsFAST-Ultra is roughly five times faster than its predecessor mrsFAST. In comparison to newly enhanced popular tools such as Bowtie2, it is more sensitive (it can report 10 times or more mappings per read) and much faster (six times or more) in the multi-mapping mode. Furthermore, mrsFAST-Ultra has an index size of 2GB for the entire human reference genome, which is roughly half of that of Bowtie2. mrsFAST-Ultra is open source and it can be accessed at http://mrsfast.sourceforge.net. © 2014 The Author(s).Item Open Access Multiplex systems for the amplification of short tandem repeat loci: evaluation of laser fluorescence detection(1997) Ricciardone, M. D.; Lins, A. M.; Schumm, J. W.; Holland, M. M.Short tandem repeat (STR) loci are ideal markers for personal identification and for genomic mapping. Two fluorescent multiplex systems, each designed for simultaneous PCR amplification of four polymorphic STR loci (HUMCSF1PO, HUMTPOX, HUMTH01 and HUMVWFA31, and HUMF13A01, HUMFESFPS, HUMBFXIII and HUMLIPOL), were evaluated on three laser fluorescence detection instruments. Concordant DNA typing results were obtained with all three detection methods. These fluorescent multiplex STR systems offer an accurate, reproducible and versatile method of DNA profiling that is well-suited for forensic identity testing and other genetic analyses.Item Open Access Novel VLDLR microdeletion identified in two Turkish siblings with pachygyria and pontocerebellar atrophy(Springer, 2010) Kolb, L. E.; Arlier, Z.; Yalcinkaya, C.; Ozturk, A. K.; Moliterno, J. A.; Erturk, O.; Bayrakli, F.; Korkmaz, B.; DiLuna, M. L.; Yasuno, K.; Bilguvar, K.; Ozcelik, T.; Tuysuz, B.; State, M. W.; Gunel, M.Congenital ataxia with cerebellar hypoplasia is a heterogeneous group of disorders that presents with motor disability, hypotonia, incoordination, and impaired motor development. Among these, disequilibrium syndrome describes a constellation of findings including non-progressive cerebellar ataxia, mental retardation, and cerebellar hypoplasia following an autosomal recessive pattern of inheritance and can be caused by mutations in the Very Low Density Lipoprotein Receptor (VLDLR). Interestingly, while the majority of patients with VLDL-associated cerebellar hypoplasia in the literature use bipedal gait, the previously reported patients of Turkish decent have demonstrated similar neurological sequelae, but rely on quadrupedal gait. We present a consanguinous Turkish family with two siblings with cerebellar atrophy, predominantly frontal pachygyria and ataxic bipedal gait, who were found to have a novel homozygous deletion in the VLDLR gene identified by using high-density single nucleotide polymorphism microarrays for homozygosity mapping and identification of CNVs within these regions. Discovery of disease causing homozygous deletions in the present Turkish family capable of maintaining bipedal movement exemplifies the phenotypic heterogeneity of VLDLR-associated cerebellar hypoplasia and ataxia.