Causal mutation discovery using next generation sequencing data: development and application of a pipeline to reduce false positive calls and to map regions of shared homozygosity and IBD

dc.contributor.authorGülsüner, Süleymanen_US
dc.contributor.authorWalsh, T.en_US
dc.contributor.authorWatts, A. C.en_US
dc.contributor.authorLee, M. K.en_US
dc.contributor.authorÖzçelik, Tayfunen_US
dc.contributor.authorKing, M. C.en_US
dc.coverage.spatialSan Francisco, CA, USAen_US
dc.date.accessioned2019-07-08T12:46:13Z
dc.date.available2019-07-08T12:46:13Z
dc.date.issued2012-11en_US
dc.departmentDepartment of Molecular Biology and Geneticsen_US
dc.descriptionConference Name: 62nd Annual Meeting of the American Society of Human Genetics, ASHG 2012en_US
dc.descriptionDate of Conference: 06-10 November 2012en_US
dc.description.abstractNext generation sequencing technologies have brought enormous successes for disease gene discovery but also challenges for data analysis, particularly in genomic regions with low or low quality sequence coverage. Errors in variant calling may lead to missing true variants or to calling many false positives. The false discovery rate can be reduced by optimizing variant calling thresholds such as quality of base pair identification, mapping, and alignment. However, such optimization strategies are often associated with the loss of true variants. We present and apply a pipeline for variant identification and verification using aligned sequences of related individuals. It is comprised of three modules: (1) an identification pipeline for de novo variants where data of parents and siblings are aligned in order to rule out false positive calls in children, false negative calls in parents, and indel artifacts; (2) a homozygosity mapping and IBD analysis module; and (3) a variant read depth module that reveals variants that may have been missed due to sequence coverage and quality issues. We applied module (1) to a large trio-based gene discovery project and reduced the number of variant calling errors by 74%, thereby significantly streamlining the experimental validation protocol for potential de novo variants. We also applied the pipeline to the discovery of the gene responsible for mega corpus callosum and microcephaly with developmental delay, and epilepsy in a brother and sister whose unaffected parents were first cousins. Our error correction pipeline significantly improved homozygosity mapping and IBD analysis and facilitated the rapid identification of the causal allele in this family.en_US
dc.description.provenanceSubmitted by Zeynep Aykut (zeynepay@bilkent.edu.tr) on 2019-07-08T12:46:13Z No. of bitstreams: 1 W-Progress.pdf: 179475 bytes, checksum: ea0bedeb05ac9ccfb983c327e155f0c2 (MD5)en
dc.description.provenanceMade available in DSpace on 2019-07-08T12:46:13Z (GMT). No. of bitstreams: 1 W-Progress.pdf: 179475 bytes, checksum: ea0bedeb05ac9ccfb983c327e155f0c2 (MD5) Previous issue date: 2012-11en
dc.identifier.urihttp://hdl.handle.net/11693/52149
dc.language.isoEnglishen_US
dc.publisherAmerican Society of Human Geneticsen_US
dc.source.title62nd Annual Meeting of the American Society of Human Genetics, ASHG 2012en_US
dc.titleCausal mutation discovery using next generation sequencing data: development and application of a pipeline to reduce false positive calls and to map regions of shared homozygosity and IBDen_US
dc.typeConference Paperen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
W-Progress.pdf
Size:
175.27 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: