LSTM Home > LSTM Research > LSTM Online Archive

Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes

De Maio, Nicola, Shaw, Liam P., Hubbard, Alasdair ORCID: https://orcid.org/0000-0001-6668-9179, George, Sophie, Sanderson, Nick, Swann, Jeremy, Wick, Ryan, AbuOun, Manal, Stubberfield, Emma, Hoosdally, Sarah J., Crook, Derrick W., Peto, Timothy E. A., Sheppard, Anna E., Bailey, Mark J., Read, Daniel S., Anjum, Muna F., Walker, A. Sarah and Stoesser, Nicole (2019) 'Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes'. Microbial Genomics, Vol September.

[img] Text
hubbard.docx - Accepted Version
Available under License Creative Commons Attribution.

Download (139kB)
[img] Text
Supplementary-Figures.docx - Accepted Version
Available under License Creative Commons Attribution.

Download (1MB)
[img]
Preview
Text
mgen000294.pdf - Published Version
Available under License Creative Commons Attribution.

Download (886kB) | Preview

Abstract

Illumina sequencing allows rapid, cheap and accurate whole genome bacterial analyses, but short reads (<300 bp) do not usually enable complete genome assembly. Long-read sequencing greatly assists with resolving complex bacterial genomes, particularly when combined with short-read Illumina data (hybrid assembly). However, it is not clear how different long-read sequencing methods impact on assembly accuracy. Relative automation of the assembly process is also crucial to facilitating high-throughput complete bacterial genome reconstruction, avoiding multiple bespoke filtering and data manipulation steps. In this study, we compared hybrid assemblies for 20 bacterial isolates, including two reference strains, using Illumina sequencing and long reads from either Oxford Nanopore Technologies (ONT) or from SMRT Pacific Biosciences (PacBio) sequencing platforms. We chose isolates from the Enterobacteriaceae family, as these frequently have highly plastic, repetitive genetic structures and complete genome reconstruction for these species is relevant for a precise understanding of the epidemiology of antimicrobial resistance. We de novo assembled genomes using the hybrid assembler Unicycler and compared different read processing strategies, as well as comparing to long-read only assembly with Flye followed by short-read polishing with Pilon. Hybrid assembly with either PacBio or ONT reads facilitated high-quality genome reconstruction, and was superior to the long-read assembly and polishing approach evaluated with respect to accuracy and completeness. Combining ONT and Illumina reads fully resolved most genomes without additional manual steps, and at a lower consumables cost per isolate in our setting. Automated hybrid assembly is a powerful tool for complete and accurate bacterial genome assembly.

Item Type: Article
Subjects: QU Biochemistry > Genetics > QU 460 Genomics. Proteomics
QU Biochemistry > Genetics > QU 550 Genetic techniques. PCR. Chromosome mapping
Faculty: Department: Biological Sciences > Department of Tropical Disease Biology
Digital Object Identifer (DOI): https://doi.org/10.1099/mgen.0.000294
Depositing User: Cathy Waldron
Date Deposited: 16 Sep 2019 09:35
Last Modified: 16 Sep 2019 09:35
URI: https://archive.lstmed.ac.uk/id/eprint/11998

Statistics

View details

Actions (login required)

Edit Item Edit Item