Data for: "A robust SNP barcode for typing Mycobacterium tuberculosis complex strains"

Coll, FORCID logo, Mcnerney, R, Guerra-Assunção, JA, Glynn, JRORCID logo, Perdigão, J, Viveiros, M, Portugal, I, Pain, A, Martin, N and Clark, TGORCID logo (2014). Data for: "A robust SNP barcode for typing Mycobacterium tuberculosis complex strains". [Dataset]. London School of Hygiene & Tropical Medicine, London, United Kingdom. 10.17037/DATA.00000414.
Copy

Strain-specific genomic diversity in the Mycobacterium tuberculosis complex (MTBC) is an important factor in pathogenesis that may affect virulence, transmissibility, host response and emergence of drug resistance. Several systems have been proposed to classify MTBC samples into distinct lineages and families, using molecular genotypes (e.g. spoligotyping or MIRU-VNTR), regions of difference (RD) and single nucleotide polymorphisms (SNPs). However, these systems lack accuracy or resolution due to the limited genetic variation studied or small number of samples used in their construction. We identified ~90k SNP across a global strain collection of 1,590 genomes. The SNP-based phylogeny was consistent with the gold-standard RD classification system and revealed fine-scale diversity. Of the ~7k strain-specific SNPs identified, a minimum set of 61 markers are proposed to discriminate accurately and robustly known circulating strains. This SNP barcode may be used to classify clinical isolates and evaluate tools to control TB, including therapeutics and vaccines whose effectiveness may vary by strain-type.

Additional Information

Dataset held for preservation purposes. The tab-delimited dataset may be obtained from http://pathogenseq.lshtm.ac.uk/downloads/Coll2014_LinSpeSNPs_final.csv.

Keywords

Bacterial genetics, Genetic variation, Phylogenomics

Coll2014_LinSpeSNPs_final.tab
subject
Data
Creative Commons: Attribution
Available under Creative Commons: Attribution 3.0
info
Dataset
description
text/plain
folder_info
500kB

Download

Atom BibTeX OpenURL ContextObject in Span Multiline CSV OpenURL ContextObject Dublin Core (with Type as Type) MPEG-21 DIDL Data Cite XML EndNote HTML Citation JSON METS MODS RDF+N3 RDF+N-Triples RDF+XML Reference Manager Refer Simple Metadata ASCII Citation EP3 XML
Export

Downloads