Causal mutation discovery using next generation sequencing data: development and application of a pipeline to reduce false positive calls and to map regions of shared homozygosity and IBD

Gülsüner, Süleyman; Walsh, T.; Watts, A. C.; Lee, M. K.; Özçelik, Tayfun; King, M. C.

Causal mutation discovery using next generation sequencing data: development and application of a pipeline to reduce false positive calls and to map regions of shared homozygosity and IBD

dc.contributor.author	Gülsüner, Süleyman	en_US
dc.contributor.author	Walsh, T.	en_US
dc.contributor.author	Watts, A. C.	en_US
dc.contributor.author	Lee, M. K.	en_US
dc.contributor.author	Özçelik, Tayfun	en_US
dc.contributor.author	King, M. C.	en_US
dc.coverage.spatial	San Francisco, CA, USA	en_US
dc.date.accessioned	2019-07-08T12:46:13Z
dc.date.available	2019-07-08T12:46:13Z
dc.date.issued	2012-11	en_US
dc.department	Department of Molecular Biology and Genetics	en_US
dc.description	Conference Name: 62nd Annual Meeting of the American Society of Human Genetics, ASHG 2012	en_US
dc.description	Date of Conference: 06-10 November 2012	en_US
dc.description.abstract	Next generation sequencing technologies have brought enormous successes for disease gene discovery but also challenges for data analysis, particularly in genomic regions with low or low quality sequence coverage. Errors in variant calling may lead to missing true variants or to calling many false positives. The false discovery rate can be reduced by optimizing variant calling thresholds such as quality of base pair identification, mapping, and alignment. However, such optimization strategies are often associated with the loss of true variants. We present and apply a pipeline for variant identification and verification using aligned sequences of related individuals. It is comprised of three modules: (1) an identification pipeline for de novo variants where data of parents and siblings are aligned in order to rule out false positive calls in children, false negative calls in parents, and indel artifacts; (2) a homozygosity mapping and IBD analysis module; and (3) a variant read depth module that reveals variants that may have been missed due to sequence coverage and quality issues. We applied module (1) to a large trio-based gene discovery project and reduced the number of variant calling errors by 74%, thereby significantly streamlining the experimental validation protocol for potential de novo variants. We also applied the pipeline to the discovery of the gene responsible for mega corpus callosum and microcephaly with developmental delay, and epilepsy in a brother and sister whose unaffected parents were first cousins. Our error correction pipeline significantly improved homozygosity mapping and IBD analysis and facilitated the rapid identification of the causal allele in this family.	en_US
dc.identifier.uri	http://hdl.handle.net/11693/52149
dc.language.iso	English	en_US
dc.publisher	American Society of Human Genetics	en_US
dc.source.title	62nd Annual Meeting of the American Society of Human Genetics, ASHG 2012	en_US
dc.title	Causal mutation discovery using next generation sequencing data: development and application of a pipeline to reduce false positive calls and to map regions of shared homozygosity and IBD	en_US
dc.type	Conference Paper	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: W-Progress.pdf
Size:: 175.27 KB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Scholarly Publications - Work in Progress