<?xml version='1.0' encoding='utf-8'?>
<eprints xmlns='http://eprints.org/ep2/data/2.0'>
  <eprint id='https://datacompass.lshtm.ac.uk/id/eprint/414'>
    <eprintid>414</eprintid>
    <rev_number>24</rev_number>
    <documents>
      <document id='https://datacompass.lshtm.ac.uk/id/document/10444'>
        <docid>10444</docid>
        <rev_number>5</rev_number>
        <files>
          <file id='https://datacompass.lshtm.ac.uk/id/file/48716'>
            <fileid>48716</fileid>
            <datasetid>document</datasetid>
            <objectid>10444</objectid>
            <filename>Coll2014_LinSpeSNPs_final.tab</filename>
            <mime_type>text/plain</mime_type>
            <hash>4e9ad8b00bec62faf1b915a3fdd680fe7d7a09f1f8d7bfcec4420fd2d5e5b8a9</hash>
            <hash_type>SHA256</hash_type>
            <filesize>500016</filesize>
            <mtime>2021-03-23 15:39:54</mtime>
            <url>https://datacompass.lshtm.ac.uk/id/eprint/414/1/Coll2014_LinSpeSNPs_final.tab</url>
          </file>
        </files>
        <eprintid>414</eprintid>
        <pos>1</pos>
        <placement>1</placement>
        <mime_type>text/plain</mime_type>
        <format>Text</format>
        <language>en</language>
        <security>public</security>
        <license>cc_by</license>
        <main>Coll2014_LinSpeSNPs_final.tab</main>
        <content>data</content>
        <retention_period>indefinite</retention_period>
        <formatdesc>Dataset</formatdesc>
      </document>
    </documents>
    <eprint_status>archive</eprint_status>
    <userid>3</userid>
    <dir>disk0/00/00/04/14</dir>
    <datestamp>2017-04-28 09:55:10</datestamp>
    <lastmod>2025-06-23 15:45:59</lastmod>
    <status_changed>2017-04-28 09:55:10</status_changed>
    <type>data_collection</type>
    <metadata_visibility>show</metadata_visibility>
    <creators>
      <item>
        <name>
          <family>Coll</family>
          <given>Francesc</given>
        </name>
        <orcid>0000-0002-7882-2325</orcid>
        <lshtm_flag>TRUE</lshtm_flag>
        <staffid>6c2b7e9afcb260630b636d81fd3bfaf0</staffid>
      </item>
      <item>
        <name>
          <family>Mcnerney</family>
          <given>Ruth</given>
        </name>
        <lshtm_flag>TRUE</lshtm_flag>
        <staffid>742e523ff9b133b222c41fb882843cdb</staffid>
      </item>
      <item>
        <name>
          <family>Guerra-Assunção</family>
          <given>José Afonso</given>
        </name>
        <lshtm_flag>FALSE</lshtm_flag>
      </item>
      <item>
        <name>
          <family>Glynn</family>
          <given>Judith R.</given>
        </name>
        <orcid>0000-0001-9325-4576</orcid>
        <lshtm_flag>TRUE</lshtm_flag>
        <staffid>780a51774911d12b019335185a7bde49</staffid>
      </item>
      <item>
        <name>
          <family>Perdigão</family>
          <given>João</given>
        </name>
        <lshtm_flag>FALSE</lshtm_flag>
      </item>
      <item>
        <name>
          <family>Viveiros</family>
          <given>Miguel</given>
        </name>
        <lshtm_flag>FALSE</lshtm_flag>
      </item>
      <item>
        <name>
          <family>Portugal</family>
          <given>Isabel</given>
        </name>
        <lshtm_flag>FALSE</lshtm_flag>
      </item>
      <item>
        <name>
          <family>Pain</family>
          <given>Arnab</given>
        </name>
        <lshtm_flag>FALSE</lshtm_flag>
      </item>
      <item>
        <name>
          <family>Martin</family>
          <given>Nigel</given>
        </name>
        <lshtm_flag>FALSE</lshtm_flag>
      </item>
      <item>
        <name>
          <family>Clark</family>
          <given>Taane G.</given>
        </name>
        <orcid>0000-0001-8985-9265</orcid>
        <lshtm_flag>TRUE</lshtm_flag>
        <staffid>d6c0a9ae313d576cb0129f8b148489e7</staffid>
      </item>
    </creators>
    <corp_creators>
      <item>Study consortium</item>
    </corp_creators>
    <title>Data for: &quot;A robust SNP barcode for typing Mycobacterium tuberculosis complex strains&quot;</title>
    <divisions>
      <item>EPID</item>
      <item>EPNC</item>
      <item>ITCR</item>
      <item>ITPM</item>
    </divisions>
    <keywords>
      <item>Bacterial genetics</item>
      <item>Genetic variation</item>
      <item>Phylogenomics</item>
    </keywords>
    <note>Dataset held for preservation purposes. The tab-delimited dataset may be obtained from http://pathogenseq.lshtm.ac.uk/downloads/Coll2014_LinSpeSNPs_final.csv.</note>
    <abstract>Strain-specific genomic diversity in the Mycobacterium tuberculosis complex (MTBC) is an important factor in pathogenesis that may affect virulence, transmissibility, host response and emergence of drug resistance. Several systems have been proposed to classify MTBC samples into distinct lineages and families, using molecular genotypes (e.g. spoligotyping or MIRU-VNTR), regions of difference (RD) and single nucleotide polymorphisms (SNPs). However, these systems lack accuracy or resolution due to the limited genetic variation studied or small number of samples used in their construction. We identified ~90k SNP across a global strain collection of 1,590 genomes. The SNP-based phylogeny was consistent with the gold-standard RD classification system and revealed fine-scale diversity. Of the ~7k strain-specific SNPs identified, a minimum set of 61 markers are proposed to discriminate accurately and robustly known circulating strains. This SNP barcode may be used to classify clinical isolates and evaluate tools to control TB, including therapeutics and vaccines whose effectiveness may vary by strain-type.</abstract>
    <date>2014</date>
    <date_type>published</date_type>
    <publisher>London School of Hygiene &amp; Tropical Medicine</publisher>
    <id_number>10.17037/DATA.00000414</id_number>
    <data_type>Dataset</data_type>
    <copyright_holders>
      <item>Study consortium</item>
    </copyright_holders>
    <collection_mode>
      <item>Experiment</item>
    </collection_mode>
    <full_text_status>public</full_text_status>
    <place_of_pub>London, United Kingdom</place_of_pub>
    <funders>
      <item>Bloomsbury Colleges PhD Studentship fund</item>
      <item>Fundação para a Ciência e Tecnologia Post-doctoral fellowship fund</item>
      <item>King Abdullah University of Science and Technology</item>
      <item>Wellcome Trust</item>
    </funders>
    <resourcetype>
      <resourcetypegeneral>Dataset</resourcetypegeneral>
      <resourcetype>Quantitative</resourcetype>
    </resourcetype>
    <language>
      <item>en</item>
    </language>
    <repo_link>
      <item>
        <title>A robust SNP barcode for typing Mycobacterium tuberculosis complex strains.</title>
        <link>http://researchonline.lshtm.ac.uk/id/eprint/1911930</link>
      </item>
    </repo_link>
    <funders_info>
      <item>
        <funder_name>Bloomsbury Colleges PhD Studentship fund</funder_name>
      </item>
      <item>
        <funder_name>Fundação para a Ciência e Tecnologia Post-doctoral fellowship fund</funder_name>
      </item>
      <item>
        <funder_name>King Abdullah University of Science and Technology</funder_name>
        <funder_id>http://dx.doi.org/10.13039/501100004052</funder_id>
      </item>
      <item>
        <funder_name>Wellcome Trust</funder_name>
        <funder_id>http://dx.doi.org/10.13039/100010269</funder_id>
      </item>
    </funders_info>
    <ukri_date_sub>2014</ukri_date_sub>
  </eprint>
</eprints>
