Predictive Modelling for Stillbirths and Neonatal Deaths in Sub-Saharan Africa

Akuze, JORCID logo; Victor, AORCID logo; Bourke, M; Wanduru, PORCID logo; Blencowe, HORCID logo and Ohuma, EOORCID logo (2026). Predictive Modelling for Stillbirths and Neonatal Deaths in Sub-Saharan Africa. [Dataset]. Zenodo. https://doi.org/10.5281/zenodo.19037183
Copy

Initial release of the reproducible analytical pipeline for data harmonisation and predictive modelling of stillbirths and neonatal deaths in sub-Saharan Africa (SSA), integrating seven contributing studies: the Action Leveraging Evidence to Reduce Perinatal Mortality and Morbidity trial (ALERT), the Every Newborn-INDEPTH study (EN-INDEPTH), the Preterm Birth Initiative (PTBi), the Pregnancy Care Integrating Translational Science Everywhere cohort (PRECISE), the WHO Multi-Country Survey on Maternal and Newborn Health (WHOMCS), the Neonatal Care Outcomes Project Study (NCOPS), and Demographic and Health Surveys (DHS). The unified dataset contains 5,996,390 birth records from 66 countries. The pipeline implements a structured five-stage harmonisation framework: (1) ethical data acquisition and governance; (2) variable mapping across 13 harmonised domains using standardised domain-prefix naming conventions; (3) value standardisation and recoding using structured case-when logic with regular-expression pattern matching; (4) linkage of environmental and climate data from ERA5, CHIRPS, SRTM, ACAG and MODIS sources at 99.3% completeness; and (5) quality assurance including range validation, cross-tabulations and logical consistency checking. The modelling pipeline benchmarks classical statistical methods (logistic regression, generalised estimating equations), ensemble machine learning (Random Forest, XGBoost, LightGBM, CatBoost) and exploratory deep learning (multilayer perceptrons) across four prediction scenarios and two primary outcomes. Interpretability analysis uses SHapley Additive exPlanations (SHAP) values throughout. The methodology is documented in Data Harmonisation Documentation Version 6.1 and Statistical Analysis Plan Version 1 (February 2026), both publicly deposited on the Open Science Framework prior to commencement of model development analyses.

Keywords

stillbirth; neonatal death; sub-Saharan Africa; predictive modelling; data harmonization; machine learning; perinatal health; LMIC; low- and middle-income countries; ALERT; DHS; EN-INDEPTH; PRECISE; PTBi; WHOMCS; NCOPS; PRECISE study

No files available. Please consult associated links.


Atom BibTeX OpenURL ContextObject in Span Multiline CSV OpenURL ContextObject Dublin Core (with Type as Type) MPEG-21 DIDL Data Cite XML EndNote HTML Citation JSON METS MODS RDF+N3 RDF+N-Triples RDF+XML Reference Manager Refer Simple Metadata ASCII Citation EP3 XML
Export