Reconstructing complex regions of genomes using long-read sequencing technology

Huddleston, J.; Ranade, S.; Malig, M.; Antonacci, F.; Chaisson, M.; Hon, L.; Sudmant, P. H.; Alkan C.; Eichler, E. E.; Graves, T. A.; Dennis, M. Y.; Wilson, R. K.; Turner, S. W.; Korlach,  J.

Reconstructing complex regions of genomes using long-read sequencing technology

Files

8295.pdf (299.1 KB)

Date

2014

Authors

Huddleston, J.

Ranade, S.

Malig, M.

Antonacci, F.

Chaisson, M.

Hon, L.

Sudmant, P. H.

Alkan C.

Eichler, E. E.

Graves, T. A.

BUIR Usage Stats

0
views

14
downloads

Citation Stats

Attention Stats

Abstract

Obtaining high-quality sequence continuity of complex regions of recent segmental duplication remains one of the major challenges of finishing genome assemblies. In the human and mouse genomes, this was achieved by targeting large-insert clones using costly and laborious capillary-based sequencing approaches. Sanger shotgun sequencing of clone inserts, however, has now been largely abandoned, leaving most of these regions unresolved in newer genome assemblies generated primarily by next-generation sequencing hybrid approaches. Here we show that it is possible to resolve regions that are complex in a genome-wide context but simple in isolation for a fraction of the time and cost of traditional methods using long-read single molecule, real-time (SMRT) sequencing and assembly technology from Pacific Biosciences (PacBio). We sequenced and assembled BAC clones corresponding to a 1.3-Mbp complex region of chromosome 17q21.31, demonstrating 99.994% identity to Sanger assemblies of the same clones. We targeted 44 differences using Illumina sequencing and find that PacBio and Sanger assemblies share a comparable number of validated variants, albeit with different sequence context biases. Finally, we targeted a poorly assembled 766-kbp duplicated region of the chimpanzee genome and resolved the structure and organization for a fraction of the cost and time of traditional finishing approaches. Our data suggest a straightforward path for upgrading genomes to a higher quality finished state.

Source Title

Genome Research

Publisher

Cold Spring Harbor Laboratory Press

Keywords

Segmental duplication, Assembly, PacBio, Sanger, Capillary, Assembling complex genomic regions with long reads

Permalink

http://hdl.handle.net/11693/12706

Published Version (Please cite this version)

http://dx.doi.org/10.1101/gr.168450.113

Collections

Scholarly Publications - Computer Engineering

Language

English

Type

Article

Full item page

Reconstructing complex regions of genomes using long-read sequencing technology

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Attention Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Reconstructing complex regions of genomes using long-read sequencing technology

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Attention Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type