Casual and close contact data for buildings in South Africa and Zambia - User Guide

Permanent Identifier

Data Creators

Richard White & Peter Dodd

Data Description

The dataset contains data on buildings visited during a 24 hour period by 3211 adults from Zambia and Western Cape South Africa. Data includes basic demographic information, building function, visit duration, and number of adults/youths and children (5-12 years) present. Data were collected using a retrospective interviewer-administered questionnaire.

Data Collection Methods

The sampling frame for this study was adults (≥18 years) enrolled in the ZAMSTAR [18] final TB prevalence survey carried out in 2010 in 16 communities in Zambia and 8 communities in the Western Cape, South Africa. The 2010 TB prevalence survey recruited between 4000 and 5000 individuals per community by visiting all households in randomly selected standard enumeration areas (SEAs). This study consisted of a subsequent cross-sectional face-to-face interview survey of TB prevalence survey enrolees that took place in February and March 2011 in Zambia, and in May and July 2011 in Western Cape. Four SEAs from each ZAMSTAR community were randomly selected proportional to size, and within each SEA ten individuals were randomly selected from four age and gender strata: men aged 18-29 years, men aged ≥30 years, women aged 18-29 years, and women aged ≥30 years (160 per community). Individuals were not eligible if they had not spent the previous night in the SEA or did not provide informed consent. If an individual was ineligible or was not found after two visits, another individual was randomly selected from the same stratum in that SEA. Recruitment was planned to continue until 10 individuals per SEA were selected within each stratum.

Interviews were carried out in participants’ homes by trained field staff using a standardized questionnaire that was piloted in Zambia, following a qualitative survey in Zambia that rapidly gathered data on places of significance to TB transmission, children’s space and popular knowledge of TB transmission. Interviewees were asked to list buildings (other than their home) that they had entered the day before the interview (from midnight to midnight). Buildings were considered to be enclosed areas with walls and a roof, excluding transport. For each building they listed, they were asked:

Key dates

Data collected between February - July 2011.


Human population


Dataset has been anonymised prior to deposit


Ethical approval was awarded by the LSHTM Ethics Committee.


Social contacts, Tuberculosis, Infectious disease transmission, South Africa, Zambia

Language of written material


Associated Roles

Role Forename Surname Faculty / Dept Institution
Data Creator Richard White Epidemiology & Population Health > Infectious Disease Epidemiology London School of Hygiene & Tropical Medicine
Data Creator Peter Dodd School of Health and Related Research > Health Economics and Decision Science University of Sheffield

File Description

Title Filename File type Description
Buildings dataset buildings_dataset.csv Comma Separated Values Dataset of reported building visits
Individuals dataset individuals_dataset.csv   Comma Separated Values Dataset of individual demographic information
Dataset key Codebook.html Comma Separated Values Description of key variables