ERASE-TB: prediction models for Mtb infection dataset – Data Codebook

Persistent identifier

10.17037/DATA.00004571

Description

Data from the ERASE-TB study, used to develop prediction models for Mtb infection among adolescent and adult tuberculosis household contacts and evaluate the risk of Mtb infection by HIV status. This is a cross-sectional dataset, with one row per person, indicating the participant interferon gamma release assay status, demographics and clinical history, and data on the index person with tuberculosis in the household (index case) and household-level variables. All candidate predictors for prediction model development are included.

Data codebook

The following variables are contained in the ERASE Prediction Model dataset.

Variable name Variable label Answer label Answer code Variable Type
id Participant ID number     Numeric
Age Participant age Open ended   Numeric
Sex Participant sex     String
      Female  
      Male  
Education Highest education level of participant     String
      Illiterate/No education completed  
      Any primary school  
      Any secondary school  
      Any college/Vocational  
Smoking_Cat Smoking status category of participant     String
      Never smoked  
      Former Smoker  
      Current smoker  
HIV_Status HIV status of participant     String
      Negative  
      Newly positive  
      Known Positive  
HIV_ART For participants who are HIV positive, ART status     String
      On ART  
      Not on ART  
    Not applicable NA  
Previous_TB Whether the participant has ever had TB in the past     String
      No  
      Yes  
      Don't know/Refuse to answer  
IGRA_Result_Baseline IGRA result at baseline visit     String
      IFN negative  
      IFN positive  
      NA  
IGRA_Result_Overall IGRA result overall (i.e. using follow up results to infer baseline, if baseline missing, as per described methods)     String
      IFN negative  
      IFN positive  
Index_Xpert_Grade Semi-quantitative Xpert grade of TB index person with TB at the time of TB diagnosis     String
      Medium  
      High  
Index_Age_Cat Age of the index person with TB (category)     String
      18–25 years  
      26–35 years  
      >35 years  
Index_Sex Sex of the index person with TB     String
      Female  
      Male  
Index_Cough Whether the index person with TB reported cough     String
      No  
      Yes  
Index_SymptomDur_Days Duration of index case symptoms prior to TB diagnosis (in days)     Integer
    Not applicable NA  
Index_HIV_Status HIV and ART status of the TB index case     String
      Negative  
      Positive, on ART  
      Positive, not on ART  
      Unknown  
Index_Spouse Is this participant the spouse of the index TB case?     String
    Spouse No  
    Other relative Yes  
Index_Share_Meals Did this participant share meals with the TB index case?     String
      No  
      Yes  
Index_Share_Sleeping Did this participant sleep in the same room as the TB index case?     String
      No  
      Yes  
      Don't know/Refuse to answer  
Index_Share_Bed Did this participant sleep in the same bed as the TB index case?     String
      No  
      Yes  
      Don't know/Refuse to answer  
    Not applicable NA  
Index_Caring Did this participant have caring responsibilities for the TB index case?     String
      No  
      Yes  
      Don't know/Refuse to answer  
Index_Contact_Freq Frequency of contact between the participant and the index person with TB     String
      Daily  
      4-6 days per week  
      1-3 days per week  
      <1 day per week  
      Don't know/Refuse to answer  
Index_Contact_Dur Duration of contact between the participant and the index person with TB each day     String
      Just a short time (<1 hr per day)  
      Less than half of the day (1-6 hrs per day)  
      Most of the day (>6 hrs per day)  
      Don't know/Refuse to answer  
Proximity_Score Proximity score to index case (0-10) - composite variable based on the 10 questions adapted from the Mandalakas score     Integer
Hhold_CrowdingUN Was this participant living in a household that is considered crowded according to the uN definition (≥3 people per habitable room)     String
      No  
      Yes  
Hhold_Food_Insecurity Household considered food insecure. This was defined as answering yes to 'Have there been days with insufficient food in the last 6 months?'     String
      No  
      Yes  
      Don't know/Refuse to answer  
Hhold_ResidenceArea Area of household residence     String
      Rural  
      Peri-Urban  
      Urban  
      NA  
Hhold_Poverty Household income <1.90 United States Dollars per person per day     String
      <1.90 USD  
      ≥1.90 USD  
      NA  
Income_Bracket Household income in quintiles     String
      Lowest 20%  
      2nd lowest 20%  
      3rd lowest 20%  
      4th lowest 20%  
      Highest 20%  
    Not applicable NA  
Baseline_TB Whether participant was diagnosed with TB at the baseline study visit     String
      No  
      Yes  
Site Study site     String
      Mozambique  
      Tanzania  
      Zimbabwe