LSTM Home > LSTM Research > LSTM Online Archive

rPinecone: Define sub-lineages of a clonal expansion via a phylogenetic tree

Wailan, Alexander M., Coll, Francesca, Heinz, Eva, Tonkin-Hill, Gerry, Corander, Jukka, Feasey, Nicholas ORCID: and Thomson, Nicholas (2019) 'rPinecone: Define sub-lineages of a clonal expansion via a phylogenetic tree'. Microbial Genomics, Vol 5, Issue 4.

E Heinz rPinecone Define sub-lineages.pdf - Published Version
Available under License Creative Commons Attribution.

Download (3MB) | Preview


The ability to distinguish different circulating pathogen clones from each other is a fundamental requirement to understand the epidemiology of infectious diseases. Phylogenetic analysis of genomic data can provide a powerful platform to identify lineages within bacterial populations, and thus inform outbreak investigation and transmission dynamics. However, resolving differences between pathogens associated with low-variant (LV) populations carrying low median pairwise single nucleotide variant (SNV) distances remains a major challenge. Here we present rPinecone, an R package designed to define sub-lineages within closely related LV populations. rPinecone uses a root-to-tip directional approach to define sub-lineages within a phylogenetic tree according to SNV distance from the ancestral node. The utility of this software was demonstrated using both simulated outbreaks and real genomic data of two LV populations: a hospital outbreak of methicillin-resistant Staphylococcus aureus and endemic Salmonella Typhi from rural Cambodia. rPinecone identified the transmission branches of the hospital outbreak and geographically confined lineages in Cambodia. Sub-lineages identified by rPinecone in both analyses were phylogenetically robust. It is anticipated that rPinecone can be used to discriminate between lineages of bacteria from LV populations where other methods fail, enabling a deeper understanding of infectious disease epidemiology for public health purposes.

Item Type: Article
Subjects: W General Medicine. Health Professions > W 83 Telemedicine (General)
W General Medicine. Health Professions > W 26.5 Informatics. Health informatics
QU Biochemistry > Genetics > QU 460 Genomics. Proteomics
Faculty: Department: Biological Sciences > Vector Biology Department
Digital Object Identifer (DOI):
Depositing User: Stacy Murtagh
Date Deposited: 10 Apr 2019 15:04
Last Modified: 31 May 2019 14:44


View details

Actions (login required)

Edit Item Edit Item