Codebook for dataset located at https:doi.org/10.17037/DATA.00000781
Variable | Data type | derived variable? | No of observations | Description | Variable contains blank/missing values | Reason for missing values | Value coding |
age_mcv | ordered categorical | yes | 269 | age category on day of blood sample analysed - used for derived variable of microcytic | No | Not applicable | 0 = "0-2yr" 1 = "2-5yr" 2 = "6-10" 3 = "11-14yr" 4 = "15-74 yr" |
agehplc | continuous | no | 232 | age on date of hplc assessment of HbF | Yes | Only available for those who had HbF measured by HPLC |   |
agetod_integer | integer | no | 269 | age on date of blood sample analysed in whole years | No | Not applicable |   |
agp_high | binary | yes | 266 | AGP >= 120 mg\dL | Yes | not able to be assessed in all samples | 0=<120 mg\dL
1=>=120mg\dL |
agp_mgdl | continuous | no | 266 | Serum AGP in mg\dL | Yes | not able to be assessed in all samples |   |
athal | ordered categorical | no | 238 | Alpha-Thalassemia genotype - no of copies of 3.7 deletion - | Yes | not available for all participants | 0 ="Wild Type" 1=" -/a a/a" 2= "-/a -/a" |
bilirubin_lysis | continuous | no | 248 | Indirect bilirubin (umol/L) | Yes | not able to be assessed in all samples |   |
bilirubind | continuous | no | 258 | Direct bilirubin (umol/L) | Yes | not able to be assessed in all samples |   |
bilirubint | continuous | no | 258 | Total bilirubin (umol/L) | Yes | not able to be assessed in all samples |   |
crp_high | binary | yes | 266 | CRP >= 5mg/L | Yes | not able to be assessed in all samples | 0=<5mg/L 1=>=5mg/L |
crp_mgL | continuous | no | 266 | Serum crp in mg/L | Yes | not able to be assessed in all samples |   |
datetod | date | no | 269 | date of sample analysed and of complete blood count data - including hgb, wbc, mcv mchc mch, reticulocyte where available and 02 saturation | no |   |   |
diffdays | integer | yes | 107 | Number of days between nearest recorded admission to Muhimbili National Hospital and sample date | yes | not all children had admissions |   |
epo | continuous | no | 241 | Serum erythriopoietin (uIU/mL) | yes | not able to be assessed in all samples |   |
epo_ln | continuous | yes | 241 | Natural log of erythropoeitin continuos variable | yes | not able to be assessed in all samples |   |
ferritin_ln | continuous | yes | 266 | Natural log of ferritin variable | yes |   |   |
ferritin_ugL | continuous | no | 266 | Serum ferritin in ug/L | yes | not able to be assessed in all samples |   |
g6pd | ordered categorical | no | 216 | G6PD status based on A- genotype [-202 & 376 SNPS} | yes | not available in all children | 0 = "Normal" 1 = "Homozygous Female" 2 = "Homozygous Male or Female" |
hbf_final | continuous | yes | 228 | Fetal haemoglobin (%), excluding those who had s (HbF at < age 5 years & hepcidin sample >age 5 years) |
yes | not available in all children or meeting criteria |   |
hep | continuous | no | 255 | Serum hepcidin (ng/mL) | yes | not able to be assessed in all samples |   |
hep_eii | binary | yes | 255 | hepcidin < 5.5 ng/ml | yes | not able to be assessed in all samples | 1 if <5.5 ng/mL "0" if >5.5 ng/mL |
hep_ln | continuous | yes | 255 | Natural log of hepcidin continuous variable | yes | not able to be assessed in all samples |   |
hgb | continuous | no | 264 | Haemoglobin on day of sample | yes | missing from source database |   |
hgbssv | continuous | yes | 269 | Averaged Haemoglobin (g\dL) at steady state routine clinic visits | no | Not applicable | |
hgbssv_OR | binary | yes | 269 | Categorical variable used in logistic regression highest or lowest quartile of averaged steady -state haemoglobin |
no | See associated publication Lee et al for more details | 0=highest quartile 1=lowest quartile |
hyperfe | binary | yes | 269 | Hyperferremia (>70 ug/L) if inflammation is normal ( if CRP<5g/L and AGP <120 mg\dL) | yes | not able to be assessed in all samples | 0=normal or low 1=high ferritin |
hypofe | binary | yes | 266 | Hypoferremia < 30g/L | yes | not able to be assessed in all samples | 0 =Ferritin>=30g/L 1 =Ferritin< 30g/L |
id | integer | no | 269 | unique participant identifier - linked through separate linkage file to demographic_id, the unique study ID of all participants in Muhimbili Sickle Cohort and associated databases | no |   |   |
inflammation | binary | yes | 266 | presence of inflammation: CRP>5g/L or AGP>120 mg\dL | yes | not able to be assessed in all samples | 0=CRP<5g/L & AGP<120mg\dL 1=CRP>=5mg/L OR AGP>=120mg\dL |
iron_umoll | continuous | no | 264 | serum iron in ummol/L | yes | not able to be assessed in all samples |   |
ldh | continuous | no | 190 | Lactate Dehydrogenase (IU/L) | yes | not available for all participants |   |
mch | continuous | no | 264 | Mean cell haemoglobin concentration (g\dL) | yes | missing from source database |   |
mchc | continuous | no | 264 | Mean cell haemoglobin concentration (g\dL) | yes | missing from source database |   |
mcv | continuous | yes | 260 | Mean cell volume (fL/cell) | yes | missing from source database |   |
microcytic | binary | yes | 264 | microcytic Yes/No with age-specific MCV cut-offs  Cut-offs for MCV (fL) were MCV <75 if age 1&ndash3 years,, MCV b<79 if aged 3&ndash 5 years, MCV <80 if age 6&ndash11 years, MCV < 82 if age 11-14 years, MCV <85 if aged 15&ndash74 years |
yes | source data missing from source database | 0= normal 1=microcytic |
reticulocyte | continuous | no | 151 | Reticuloctye percentage on day of smple analysis | yes | missing from source database (technique not performed on all samples at all times) |   |
sat | continuous | no | 217 | Oxygen saturation (Room Air) pulse oximetry on day of sample analysed | yes | missing from source database |   |
sex | binary |   | 269 | Sex of participant | no | Not applicable | 0= Female 1=Male |
stfr_F_def | binary | yes | 259 |   possible Iron deficient by sTfR-Ferritin index [Phiri et al 2009 J Clinical Pathology] |
yes | not able to be assessed in all samples | 0 = "sTfR-F index <5.6" 1 = "sTfR-F index => 5.6" |
stfr_F_index | continuous | yes | 259 | Calculated by the function [sTfR[ug/mL]/ln(ferritin)] | yes | not able to be assessed in all samples |   |
stfr_ugml | continuous | no | 259 | Serum transferrin receptor (ug/mL) | yes | not able to be assessed in all samples |   |
tfr_mgdl | continuous | no |   | Transferrin (mg\dL) |   | not able to be assessed in all samples |   |
tibc | continuous | no | 266 | Total Iron Binding Capacity (umol/L) | yes | not able to be assessed in all samples |   |
transferrin_gl | continuous | no |   | Serum transferrin in g/L |   | not able to be assessed in all samples |   |
trf_sat | continuous | no |   | Transferrin saturation calculated as [(serum iron conc/TIBC)*100] |   | not able to be assessed in all samples |   |
trf_sat_def | binary | yes |   | Categorical variable of Transferrin saturation |   | not able to be assessed in all samples | 0 = ">15%" 1 "=< 16%" |
visitno | integer | no |   |   sample analysed per participant, ordered by date of sample collection, range 1-3 with oldest to most recent . E.g. some patients had more than one sample analysed collected at different visit dates. |
  |   |   |
wbc | continuous | no | 264 | Total white blood cell count (x10^9/L) on day of sample analysed |   | missing from source database |   |