10.17037/DATA.00004571
Data from the ERASE-TB study, used to develop prediction models for Mtb infection among adolescent and adult tuberculosis household contacts and evaluate the risk of Mtb infection by HIV status. This is a cross-sectional dataset, with one row per person, indicating the participant interferon gamma release assay status, demographics and clinical history, and data on the index person with tuberculosis in the household (index case) and household-level variables. All candidate predictors for prediction model development are included.
The following variables are contained in the ERASE Prediction Model dataset.
Variable name | Variable label | Answer label | Answer code | Variable Type |
id | Participant ID number | Numeric | ||
Age | Participant age | Open ended | Numeric | |
Sex | Participant sex | String | ||
Female | ||||
Male | ||||
Education | Highest education level of participant | String | ||
Illiterate/No education completed | ||||
Any primary school | ||||
Any secondary school | ||||
Any college/Vocational | ||||
Smoking_Cat | Smoking status category of participant | String | ||
Never smoked | ||||
Former Smoker | ||||
Current smoker | ||||
HIV_Status | HIV status of participant | String | ||
Negative | ||||
Newly positive | ||||
Known Positive | ||||
HIV_ART | For participants who are HIV positive, ART status | String | ||
On ART | ||||
Not on ART | ||||
Not applicable | NA | |||
Previous_TB | Whether the participant has ever had TB in the past | String | ||
No | ||||
Yes | ||||
Don't know/Refuse to answer | ||||
IGRA_Result_Baseline | IGRA result at baseline visit | String | ||
IFN negative | ||||
IFN positive | ||||
NA | ||||
IGRA_Result_Overall | IGRA result overall (i.e. using follow up results to infer baseline, if baseline missing, as per described methods) | String | ||
IFN negative | ||||
IFN positive | ||||
Index_Xpert_Grade | Semi-quantitative Xpert grade of TB index person with TB at the time of TB diagnosis | String | ||
Medium | ||||
High | ||||
Index_Age_Cat | Age of the index person with TB (category) | String | ||
18–25 years | ||||
26–35 years | ||||
>35 years | ||||
Index_Sex | Sex of the index person with TB | String | ||
Female | ||||
Male | ||||
Index_Cough | Whether the index person with TB reported cough | String | ||
No | ||||
Yes | ||||
Index_SymptomDur_Days | Duration of index case symptoms prior to TB diagnosis (in days) | Integer | ||
Not applicable | NA | |||
Index_HIV_Status | HIV and ART status of the TB index case | String | ||
Negative | ||||
Positive, on ART | ||||
Positive, not on ART | ||||
Unknown | ||||
Index_Spouse | Is this participant the spouse of the index TB case? | String | ||
Spouse | No | |||
Other relative | Yes | |||
Index_Share_Meals | Did this participant share meals with the TB index case? | String | ||
No | ||||
Yes | ||||
Index_Share_Sleeping | Did this participant sleep in the same room as the TB index case? | String | ||
No | ||||
Yes | ||||
Don't know/Refuse to answer | ||||
Index_Share_Bed | Did this participant sleep in the same bed as the TB index case? | String | ||
No | ||||
Yes | ||||
Don't know/Refuse to answer | ||||
Not applicable | NA | |||
Index_Caring | Did this participant have caring responsibilities for the TB index case? | String | ||
No | ||||
Yes | ||||
Don't know/Refuse to answer | ||||
Index_Contact_Freq | Frequency of contact between the participant and the index person with TB | String | ||
Daily | ||||
4-6 days per week | ||||
1-3 days per week | ||||
<1 day per week | ||||
Don't know/Refuse to answer | ||||
Index_Contact_Dur | Duration of contact between the participant and the index person with TB each day | String | ||
Just a short time (<1 hr per day) | ||||
Less than half of the day (1-6 hrs per day) | ||||
Most of the day (>6 hrs per day) | ||||
Don't know/Refuse to answer | ||||
Proximity_Score | Proximity score to index case (0-10) - composite variable based on the 10 questions adapted from the Mandalakas score | Integer | ||
Hhold_CrowdingUN | Was this participant living in a household that is considered crowded according to the uN definition (≥3 people per habitable room) | String | ||
No | ||||
Yes | ||||
Hhold_Food_Insecurity | Household considered food insecure. This was defined as answering yes to 'Have there been days with insufficient food in the last 6 months?' | String | ||
No | ||||
Yes | ||||
Don't know/Refuse to answer | ||||
Hhold_ResidenceArea | Area of household residence | String | ||
Rural | ||||
Peri-Urban | ||||
Urban | ||||
NA | ||||
Hhold_Poverty | Household income <1.90 United States Dollars per person per day | String | ||
<1.90 USD | ||||
≥1.90 USD | ||||
NA | ||||
Income_Bracket | Household income in quintiles | String | ||
Lowest 20% | ||||
2nd lowest 20% | ||||
3rd lowest 20% | ||||
4th lowest 20% | ||||
Highest 20% | ||||
Not applicable | NA | |||
Baseline_TB | Whether participant was diagnosed with TB at the baseline study visit | String | ||
No | ||||
Yes | ||||
Site | Study site | String | ||
Mozambique | ||||
Tanzania | ||||
Zimbabwe |