10.17037/DATA.00001305
Three annual community-based cross-sectional surveys were conducted at baseline and following 12 and 24 months as part of a cluster-randomised, controlled trial - the TUMIKIA Trial - comparing annual or biannual community-wide treatment targeting all ages with annual school-based treatment targeting 2–14 year olds.
Household demographics, socioeconomic status, and water and sanitation conditions were collected through structured questionnaires. Structured observations were made of household sanitation facilities. Individuals aged two years and above were randomly selected during households surveys and requested to provide stool samples, which were assessed for presence and intensity of STH infection using the Kato-Katz thick smear method. Information on recent deworming, sanitation and hygiene behaviours, and shoe-wearing was also collected from individuals who provided stool samples.
This record contains the quantitative dataset and associated codebook.
| Variable Name | Variable Label | Answer Label | Answer Code | Variable Type |
| cu_id | Cluster ID | Numeric | ||
| village_id | Village ID | Numeric | ||
| corr_hh_id | Household ID | Numeric | ||
| corr_uniqid | Individual ID | Numeric | ||
| hk_inf | Hookworm infection status | Numeric | ||
| N/A | -9/. | |||
| Yes | 1 | |||
| No | 0 | |||
| as_inf | Ascaris lumbricoides infection status | Numeric | ||
| N/A | -9/. | |||
| Yes | 1 | |||
| No | 0 | |||
| tr_inf | ||||
| Trichuris trichiura infection status | Numeric | |||
| N/A | -9/. | |||
| Yes | 1 | |||
| No | 0 | |||
| hk_epg | Hookworm eggs per gram | Open ended | Numeric | |
| tr_epg | Trichuris trichiura eggs per gram | Numeric | ||
| as_epg | Ascaris lumbricoides eggs per gram | Numeric | ||
| hk_eggs | Hookworm total eggs on slides read | Numeric | ||
| tr_eggs | Trichuris trichiura total eggs on slides read | Numeric | ||
| as_eggs | Ascaris lumbricoides total eggs on slides read | Numeric | ||
| hk_grams | Grams of stool (1/12 if two slides, 1/24 if one slide) | Numeric | ||
| tr_grams | Grams of stool (1/12 if two slides, 1/24 if one slide) | Numeric | ||
| as_grams | Grams of stool (1/12 if two slides, 1/24 if one slide) | Numeric | ||
| hk_inten_cat | Hookworm intensity category based on WHO | Numeric | ||
| hk_inf==0 | No infection | 0 | ||
| hk_inten>=1 & hk_inten<2000 | Light | 1 | ||
| hk_inten>=2000 & hk_inten<4000 | Mod | 2 | ||
| hk_inten>=4000 & hk_inten!=. | Heavy | 3 | ||
| hk_modhi | Hookworm intensity recode | Numeric | ||
| hk_inten_cat==1 | light | 0 | ||
| hk_inten_cat==2|hk_inten_cat==3 | mod/heavy | 1 | ||
| tr_inten_cat | Trichuris trichiura intensity category based on WHO | Numeric | ||
| tr_inf==0 | No infection | 0 | ||
| tr_inten>=1 & tr_inten<1000 | Light | 1 | ||
| tr_inten>=1000 & tr_inten<10000 | Mod | 2 | ||
| tr_inten>=10000 & tr_inten!=. | Heavy | 3 | ||
| tr_modhi | Trichuris trichiura intensity recode | Numeric | ||
| tr_inten_cat==1 | Light | 0 | ||
| tr_inten_cat==2|tr_inten_cat==3 | Mod/Heavy | 1 | ||
| as_inten_cat | Ascaris lumbricoides intensity category based on WHO | Numeric | ||
| as_inf==0 | No infection | 0 | ||
| as_inten>=1 & as_inten<5000 | Light | 1 | ||
| as_inten>=5000 & as_inten<50000 | Mod | 2 | ||
| as_inten>=50000 & as_inten!=. | Heavy | 3 | ||
| as_modhi | Ascaris lumbricoides intensity recode | Numeric | ||
| as_inten_cat==1 | Light | 0 | ||
| as_inten_cat==2|as_inten_cat==3 | Mod/Heavy | 1 | ||
| any_sth | Any STH infection of any intensity | Numeric | ||
| Not applicable | -9/. | |||
| hk_inf==0 & tr_inf==0 & as_inf==0 | hk_inf==0 & tr_inf==. & as_inf==0 | hk_inf==0 & tr_inf==0 & as_inf==. | hk_inf==0 & tr_inf==. & as_inf==. | No | 0 | ||
|
hk_inf==1 & tr_inf==1 & as_inf==1 | hk_inf==1 & tr_inf==. & as_inf==. | hk_inf==1 & tr_inf==. & as_inf==0 | hk_inf==0 & tr_inf==0 & as_inf==. /// hk_inf==1 & tr_inf==1 & as_inf==0 | hk_inf==1 & tr_inf==1 & as_inf==. | hk_inf==1 & tr_inf==0 & as_inf==1 | hk_inf==1 & tr_inf==. & as_inf==1 /// hk_inf==1 & tr_inf==0 & as_inf==0 | hk_inf==1 & tr_inf==. & as_inf==. | hk_inf==1 & tr_inf==. & as_inf==0 | hk_inf==0 & tr_inf==0 & as_inf==. /// hk_inf==0 & tr_inf==1 & as_inf==1 | hk_inf==0 & tr_inf==1 & as_inf==0 | hk_inf==0 & tr_inf==1 & as_inf==. /// hk_inf==0 & tr_inf==0 & as_inf==1 | hk_inf==0 & tr_inf==. & as_inf==1 |
Yes | 1 | Numeric | |
| any_modhi | Any STH infection of moderate or high intensity | |||
| Not applicable | -9/. | |||
|
as_modhi==0 & tr_modhi==0 & hk_modhi==0 /// (as_modhi==. | tr_modhi==.) & hk_modhi==0 |
No | 0 | ||
| as_modhi==1 | tr_modhi==1 | hk_modhi==1 | Yes | 1 | ||
| mixed_inf | Any mixed STH infection | Numeric | ||
| No | 0 | |||
| hk_inf==1 & tr_inf==1 | hk_inf==1 & as_inf==1 | as_inf==1 & tr_inf==1 | Yes | 1 | ||
| hk_vlight | Any hookworm infection of very light intensity | Numeric | ||
| hk_inten>=1000 | No | 0 | ||
| hk_inten<1000 & hk_inten!=0 | Yes | 1 | ||
| hk_light | Any hookworm infection of light intensity | Numeric | ||
| hk_inten>=2000 | No | 0 | ||
| hk_inten<2000 & hk_inten!=0 | Yes | 1 | ||
| age | Participant age in years | Numeric | ||
| sex | Participant gender | Numeric | ||
| Male | 1 | |||
| Female | 2 | |||
| agecat | Participant age category | Numeric | ||
| <5 years | 1 | |||
| 5-14 years | 2 | |||
| 15+ | 3 | |||
| school_attend | Baseline only - Attend school? | Numeric | ||
| No | 0 | |||
| Yes | 1 | |||
| school_level | Baseline only - Type of school | Numeric | ||
| Primary | 1 | |||
| Secondary | 2 | |||
| bicycle | Own bicycle | Numeric | ||
| No | 0 | |||
| Yes | 1 | |||
| electricity | Has electricity | Numeric | ||
| No | 0 | |||
| Yes | 1 | |||
| nbr_residents | ||||
| hh_san_access | Household sanitation access (any toilet facility) (Binary variable generated from JMP S1. Sanitation facility ("What kind of toilet facility do members of this household usually use"?) | Numeric | ||
| No | 0 | |||
| Yes | 1 | |||
| hh_water_le30 | Household water access <=30 min (Binary variable generated from JMP W4.Time to collect water ("How long does it take to go there, get water, and come back?). | Numeric | ||
| No | 0 | |||
| Yes | 1 | |||
| hh_san_washslab | Does the latrine have a washable slab? | Numeric | ||
| No | 0 | |||
| Yes | 1 | |||
| hh_hwfac_access | Designated hand-washing facilities available | Numeric | ||
| Yes (tile or cement) | 1 | |||
| No (natural materials eg timber) | 2 | |||
| tx | Have you/they received treatment for worms in the last year? | Numeric | ||
| No | 0 | |||
| Yes | 1 | |||
| tx_where | Where did you/they receive treatment? | Numeric | ||
| School | 1 | |||
| Health Centre | 2 | |||
| Home | 3 | |||
| Community Programme | 4 | |||
| Shop | 5 | |||
| Workplace | 6 | |||
| shoe | Observe: What sort of shoes are they wearing? | Numeric | ||
| No shoes | 0 | |||
| Shoes | 1 | |||
| water_improved | Improved water source | Numeric | ||
| Non-improved | 0 | |||
| Improved | 1 | |||
| floor_bincat | Flooring material | Numeric | ||
| Flooring material | 1 | |||
| Earth / sand | 2 | |||
| remote | Remoteness of household based on GPS | Numeric | ||
| <4km majroad | 0 | |||
| >4km majroad | 1 | |||
| hh_san_access_shared | Household sanitation access and shared | Numeric | ||
| No reported access | 0 | |||
| Reported shared access | 1 | |||
| Reported private access | 2 | |||
| urban2015 | Household urbanicity category (2015 population estimate obtained from WorldPop [www.worldpop.org]) | Numeric | ||
| Urban | 2 | |||
| Periurban | 1 | |||
| Rural | 0 | |||
| ai | Aridity Index (<0.03 hyper arid; 0.03-0.2 arid; 0.2-0.5 arid; 0.5-0.65 dry sub-humid; >0.65 humid) | Numeric | ||
| dem30 | Elevation in meters | Numeric | ||
| sand | Soil sand content | Numeric | ||
| ph_kcl | Soil pH | Numeric | ||
| evi_2014_ann | Enhanced vegetation index | Numeric | ||
| distmajorroads | Distance from household to major road (meters) | Numeric | ||
| sand_cat | Tertiles of soil sand percentage (sand) | Numeric | ||
| Low | 0 | |||
| Medium | 1 | |||
| High | 2 | |||
| EVI_cat | Tertiles of 2014 enhanced vegetation index annual measure (evi_2014_ann) | Numeric | ||
| Low | 0 | |||
| Medium | 1 | |||
| High | 2 | |||
| ph_kcl_cat | Tertiles of soil pH (ph_kcl) | Numeric | ||
| Low | 0 | |||
| Medium | 1 | |||
| High | 2 | |||
| dem_cat | Tertiles of digital elevation model (dem30, elevation in meters) | Numeric | ||
| Low | 0 | |||
| Medium | 1 | |||
| High | 2 | |||
| road_cat | Tertiles of road access (distmajorroads) | Numeric | ||
| Low | 0 | |||
| Medium | 1 | |||
| High | 2 | |||
| arid_cat | Categorization of aridity measure | Numeric | ||
| >0.2 - <=0.5 | Semi-arid | 1 | ||
| >0.5 - <=0.65 | Dry sub-humid | 2 | ||
| >0.65 | Humid | 3 | ||
| SES_score | Household socioeconomic status score based on factor analysis | Numeric | ||
| SES_quint | 5 quantiles of SES_score | Numeric | ||
| ses_3 | Recode of SES_quint | Numeric | ||
| SES_quint=1 | Poorest | 1 | ||
| SES_quint=2,3,4 | Middle | 2 | ||
| SES_quint=5 | Least Poor | 3 | ||
| survey | Survey type | Numeric | ||
| Baseline | 1 | |||
| Midline | 2 | |||
| Endline | 3 | |||
| hh_status | What is the status of the household you are visiting? | Numeric | ||
| Household located | 1 | |||
| Cannot identify/locate household | 2 | |||
| Not a household | 3 | |||
| Household moved away | 4 | |||
| Household absent for extended period of time | 5 | |||
| Household is listed twice on sampling list | 6 | |||
| Other | 88 | |||
| consent | Was the study explained using the information sheet and written consent to continue? | Numeric | ||
| Yes | 1 | |||
| No, refused | 2 | |||
| No, suitable member not present | 3 | |||
| attend_school | Does participant attend school? | Numeric | ||
| Not in education | 1 | |||
| ECD | 2 | |||
| Primary school | 3 | |||
| Secondary school | 4 | |||
| College | 5 | |||
| Adult education | 6 | |||
| nbr_n | Number of individuals in household | Numeric | ||
| nbr_m | Number of male individuals in household | Numeric | ||
| nbr_f | Number of female individuals in household | Numeric | ||
| nbr_agecat1 | Number of pre-school age children in household | Numeric | ||
| nbr_agecat2 | Number of school age children in household | Numeric | ||
| nbr_agecat3 | Number of adults in household | Numeric | ||
| nbr_agecat1_m | Number of male pre-school age children in household | Numeric | ||
| nbr_agecat1_f | Number of female pre-school age children in household | Numeric | ||
| nbr_agecat2_m | Number of male school age children in household | Numeric | ||
| nbr_agecat2_f | Number of female school age children in household | Numeric | ||
| nbr_agecat3_m | Number of male adults (>=15) in household | Numeric | ||
| nbr_agecat3_f | Number of female adults (>=15) in household | Numeric | ||
| hk_cluster_prev_baseline | Cluster-level prevalence of hookworm infection at baseline | Numeric | ||
| tr_cluster_prev_baseline | Cluster-level prevalence of Trichuris trichiura at baseline | Numeric | ||
| as_cluster_prev_baseline | Cluster-level prevalence of Ascaris lumbricoides at baseline | Numeric | ||
| hk_cluster_epg_baseline | Cluster-level hookworm eggs per gram (EPG) at baseline | Numeric | ||
| tr_cluster_epg_baseline | Cluster-level Trichuris trichiura eggs per gram (EPG) at baseline | Numeric | ||
| as_cluster_epg_baseline | Cluster-level Ascaris lumbricoides eggs per gram (EPG) at baseline | Numeric | ||
| SES_score_basemean | Cluster-level mean average socioeconomic status (SES) at baseline | Numeric | ||
| urban2015_basemode | Cluster-level modal average urbanicity category at baseline | Numeric | ||
| Urban | 0 | |||
| Periurban | 1 | |||
| Rural | 2 | |||
| water_improved_basemean | Cluster-level mean average access to improved water source at baseline | Numeric | ||
| hh_san_access_shared_basemode | Cluster-level modal average access to sanitation at baseline | Numeric | ||
| None | 0 | |||
| Shared access | 1 | |||
| Private access | 2 | |||
| arid_cat_basemode | Cluster-level modal average aridity category (Consortium for Spatial Information, [http://www.cgiar-csi.org/.]) | Numeric | ||
| Semi-arid | 1 | |||
| Dry sub-humid | 2 | |||
| Humid | 3 | |||
| cu_hardtoreach | Hard to reach cluster (>75% of households <4 km from major road) | Numeric | ||
| Not hard to reach | 0 | |||
| Hard to reach | 1 | |||
| cu_size_rand | Cluster size | Numeric | ||
| <841 hhs | 0 | |||
| >841 hhs | 1 | |||
| cu_subco_rand | Subcounty ID | 1 to 4 | Numeric | |
| study_arm | Study arm | Numeric | ||
| Control | 1 | |||
| Annual | 2 | |||
| Biannual | 3 |