Predictive modelling of linear growth faltering among pediatric patients with Diarrhea in Rural Western Kenya: an explainable machine learning approach

Ogwel, Billy; Mzazi, Vincent H.; Awuor, Alex O.; Okonji, Caleb; Anyango, Raphael O.; Oreso, Caren; Ochieng, John B.; Munga, Stephen; Nasrin, Dilruba; Tickell, Kirkby D.; Pavlinac, Patricia B.; Kotloff, Karen L.; Omore, Richard

doi:10.1186/s12911-024-02779-7

Research
Open access
Published: 02 December 2024

Predictive modelling of linear growth faltering among pediatric patients with Diarrhea in Rural Western Kenya: an explainable machine learning approach

Billy Ogwel^1,2,
Vincent H. Mzazi²,
Alex O. Awuor¹,
Caleb Okonji¹,
Raphael O. Anyango¹,
Caren Oreso¹,
John B. Ochieng¹,
Stephen Munga¹,
Dilruba Nasrin³,
Kirkby D. Tickell⁴,
Patricia B. Pavlinac⁴,
Karen L. Kotloff³ &
…
Richard Omore¹

BMC Medical Informatics and Decision Making volume 24, Article number: 368 (2024) Cite this article

812 Accesses
Metrics details

Abstract

Introduction

Stunting affects one-fifth of children globally with diarrhea accounting for an estimated 13.5% of stunting. Identifying risk factors for its precursor, linear growth faltering (LGF), is critical to designing interventions. Moreover, developing new predictive models for LGF using more recent data offers opportunity to enhance model accuracy, interpretability and capture new insights. We employed machine learning (ML) to derive and validate a predictive model for LGF among children enrolled with diarrhea in the Vaccine Impact on Diarrhea in Africa (VIDA) study and the Enterics for Global Heath (EFGH) ― Shigella study in rural western Kenya.

Methods

We used 7 diverse ML algorithms to retrospectively build prognostic models for the prediction of LGF (≥ 0.5 decrease in height/length for age z-score [HAZ]) among children 6–35 months. We used de-identified data from the VIDA study (n = 1,106) combined with synthetic data (n = 8,894) in model development, which entailed split-sampling and K-fold cross-validation with over-sampling technique, and data from EFGH-Shigella study (n = 655) for temporal validation. Potential predictors (n = 65) included demographic, household-level characteristics, illness history, anthropometric and clinical data were identified using boruta feature selection with an explanatory model analysis used to enhance interpretability.

Results

The prevalence of LGF in the development and temporal validation cohorts was 187 (16.9%) and 147 (22.4%), respectively. Feature selection identified the following 6 variables used in model development, ranked by importance: age (16.6%), temperature (6.0%), respiratory rate (4.1%), SAM (3.4%), rotavirus vaccination (3.3%), and skin turgor (2.1%). While all models showed good prediction capability, the gradient boosting model achieved the best performance (area under the curve % [95% Confidence Interval]: 83.5 [81.6–85.4] and 65.6 [60.8–70.4]) on the development and temporal validation datasets, respectively.

Conclusion

Our findings accentuate the enduring relevance of established predictors of LGF whilst demonstrating the practical utility of ML algorithms for rapid identification of at-risk children.

Peer Review reports

Introduction

Diarrhea, a global public health problem with greatest burden in low- and middle-income countries (LMICs) [1], is a leading etiology of malnutrition among children in LMICs, in part due to anorexia, decreased absorptive function, mucosal damage, catabolism and nutrient exhaustion [1, 2]. It has been reported that the cumulative burden of diarrhea days directly correlates with the degree of nutritional failure among children during early childhood and that catch-up growth does not appear to make up for the deficit [3]. Linear growth faltering (LGF), a precursor to stunting (height-for-age z-score [HAZ] <−2), is one form of malnutrition that results from protracted nutritional deprivation [4]. Stunting affects one-fifth of children globally and one-third of children in LMICs [5]. Globally, 13.5% of stunting cases are attributed to diarrhea [6]. Additionally, a vicious cycle of diarrhea and malnutrition can occur as malnutrition weakens the body’s defense against future diarrheal episodes resulting in more frequent and longer diarrheal illnesses. While LGF is a precursor to stunting, it also independently affects child development and health outcomes, such as cognitive delays, increased susceptibility to infections, and the risk of relapse into wasting after recovery [7, 8]. Furthermore, the effects of stunting can be severe and protracted, with reduced cognitive development, persistent poor health, and elevated risk of mortality [9]. Long term complications can include an increased risk of cardiovascular disease, type 2 diabetes, and obesity in adulthood [10, 11].

The timely and accurate identification of children at increased risk of LGF is crucial for early nutritional and healthcare interventions as well as efficient allocation of public health resources, efforts that could help to avert the associated negative outcomes. Data-driven predictive models could be leveraged to this end and a number of research efforts exist in the prediction of LGF among children with diarrhea [12, 13]. These studies utilized clinical and sociodemographic data from the Global Enteric Multicenter Study (GEMS), conducted between 2007 and 2011, to develop predictive models. The models lacked explainable methodologies to improve interpretability and demonstrated moderate discrimination, with areas under the ROC curve (AUC) of 67.0% for Branders et al. [12] and 75.0% for Ahmed et al. [13], respectively. While the existing models provide a valuable starting point, shifts in the study population over time may affect the predictive performance of these models [14, 15]. Moreover, development of new models using more recent and pertinent data offers the opportunity to improve model accuracy, enhance interpretability of models and capture new perspectives and insights into this public health problem. We used machine learning (ML), which has been adopted in public health and clinical practice to rapidly develop data-driven clinical prediction models, to develop and temporally validate predictive models for LGF among children aged < 5 years with diarrhea in rural Western Kenya.

Methods

Data sources

This retrospective study used data collected from the Kenyan site (in Siaya County) of two related diarrheal studies: The Vaccine Impact on Diarrhea in Africa (VIDA) study [16] for model development and evaluation; and the Enteric for Global Health (EFGH) Shigella surveillance study [17, 18] for temporal validation.

Development cohort

VIDA was designed to assess the population-based incidence, etiologies, and adverse clinical consequences of diarrhea following rotavirus vaccine introduction in children aged 0–59 months residing in censused populations in 3 African countries. Moderate-to-severe diarrhea (MSD) cases, defined as children in 3 age strata (0–11, 12–23, and 24–59 months) presenting with diarrhea (defined as ≥ 3 looser-than-normal stools within 24 h) that began within the past 7 days after ≥ 7 diarrhea-free days and had ≥ 1 of the following: sunken eyes, poor skin turgor, dysentery, intravenous rehydration, or required hospitalization, were enrolled from sentinel health centers (SHCs) serving the health and demographic surveillance systems population at each site. The aim was to enroll 8–9 MSD cases in each age stratum per fortnight. 1–3 diarrhea-free controls matched by age, gender and geographical location were enrolled within 14 days of case enrolment. Follow-ups were conducted between 49 and 91 days after enrolment. We utilized data collected from cases enrolled at the VIDA Kenya site over a 36 months period from May 2015 and July 2018 restricting to children aged 6–35 months to make the development and temporal validation cohorts comparable. The study design, clinical and epidemiological methods for VIDA have been described elsewhere [16, 19].

In addition to the VIDA data (n = 1,106), we generated a synthetic dataset (n = 8,894) based on the VIDA data using the synthpop package [20] to increase the sample size and to enable the algorithms to generate more stable and reliable predictions that are less sensitive to noise in the data. The variables of the synthetic dataset were compared to the original training dataset with the synthetic dataset demonstrating high similarity to the original dataset (Fig S1). The combined dataset (N = 10,000) was used for training and internal validation with a split-sampling conducted in the ratio 3:1 to partition the development data into training and test sets [21].

Temporal validation cohort

The EFGH study set out to establish incidence and consequences of Shigella medically attended diarrhea (MAD) within 7 country sites in Africa, Asia, and Latin America using cross-sectional and longitudinal study designs. MAD cases defined as children aged 6–35 months presenting with diarrhea (defined as ≥ 3 looser-than-normal stools within 24 h) that began within the past 7 days after ≥ 2 diarrhea-free days were enrolled from SHCs in the study catchment area [17]. Additional eligibility criteria included: residing within the pre-defined study catchment area; primary caregiver and child plan to remain at their current residence for at least the next 4 months; legal guardian consenting to child’s participation in the study as well willingness to be followed-up for 3 months post-enrolment; child is not being referred to a non-EFGH facility at the time of screening; and site enrollment cap has not been met. Follow-ups were conducted at week-4 (24–67 days) and month-3 (84–127 days). Our study utilized data from children enrolled in Kenya between 01 August, 2022 and 31 July, 2023 to temporally validate the champion model.

Information on demographic, socio-demographic, epidemiological and clinical characteristics were collected at enrollment by study personnel in both studies [18].

Target variable

Consistent with previous studies [12, 13], we defined the target variable, LGF, as decrease of 0.5 HAZ or more (Δ HAZ ≥ − 0.5) within 49–91 days of enrollment in VIDA, or within 84–127 days in EFGH. We also computed change in HAZ per month of follow-up and categorized a negative change as LGF in our sensitivity analysis, similar to the definition used by Nasrin et al. [22]. We excluded children with implausible HAZ values (HAZ > 6 or < − 6 and change in (Δ) HAZ > 3; or length values that were > 1.5 cm lower at follow-up than at enrollment.

Predictive variables and feature selection

A total of 68 potential candidate predictors collected at enrollment during both studies were considered, including demographic, household-level characteristics, illness history, anthropometric and clinical characteristics collected at enrolment. Missingness patterns were assessed among the features and the missing data points imputed using the Multiple Imputation by Chained Equations (MICE) package [23]. Furthermore, we conducted feature selection to reduce dimensionality, optimize performance, reduce computational complexity and enhance model interpretability. The feature selection was implemented using the Boruta package [24] an all relevant feature selection wrapper around the random forest algorithm that selects relevant features by comparing original attributes’ importance (contribution of each variable to the model’s predictive accuracy) with importance achievable at random using their permuted copies. Features that were either confirmed or tentative from the feature selection process were then used in model development. Moreover, among the confirmed and tentative features, we excluded variables that were not collected in both studies (breastfeeding).

Statistical analysis

We compared patient characteristics of children with LGF versus those without. Proportions were reported for categorical variables and either chi-square or Fisher`s exact test were performed as appropriate. Wilcoxon rank sum tests were used to compare continuous variables as appropriate. We also compared the prevalence of LGF between the 2 studies.

Model development and internal validation

To derive the LGF prediction model, we utilized 7 ML algorithms including: Random Forest (RF), Gradient Boosting (GBM), Naive Bayes (NB), Logistic regression (LR), Support vector machine (SVM), K-nearest neighbors (KNN) and Artificial Neural Networks (ANN). The predictive models were developed in the training dataset using 10-fold cross-validation [25], a valuable step in model development helping to obviate under-fitting or overfitting of the model and ensure robust and well-performing models. Due to the moderate class imbalance in our target variable (LGF), we employed sub-sampling techniques (over-sampling) within the resampling procedure to mitigate the negative impact of class disparity on model fitting [26]. We then conducted internal validation of the models on the test data evaluating performance using the following metrics: sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) and F1-score. Receiver operating characteristic (ROC) curves were constructed and the area under the curve (AUC) and the precision-recall area under the curve (PRAUC) for each model were computed using the precrec package [27]. The ROC AUC is a threshold-independent metric that summarizes a model’s overall performance in discriminating between two classes. It represents the area under the ROC curve, which plots the true positive rate (sensitivity) against the false positive rate (1-specificity) at various classification thresholds. The PRAUC is a threshold-independent metric, particularly well-suited for imbalanced datasets. It summarizes model performance by capturing the area under the precision-recall curve, which illustrates the tradeoff between precision (the proportion of true positives among predicted positives) and recall (the proportion of true positives among all actual positives) across various thresholds. We determined the champion model as the model with the best AUC. We also assessed calibration in the developed models using Brier scores (the mean squared error between the actual outcome and the estimated probabilities), Spiegelhalter’s z-test (a formal measurement that serves as a proxy for calibration calculated from the decomposition of Brier score) and its accompanying p-value [28]. We used Platt scaling approach, in which model estimates are transformed by passing the estimates through a trained sigmoid function, to calibrate the champion model [28]. To enhance model interpretability, trust and fairness, we conducted explanatory model analysis (EMA) for the top two models using a model agnostic procedure to estimate SHapley Additive exPlanations (SHAPs) attributions, showing the magnitude and direction of association, drawing on the DALEX package [29].

Temporal Validation and Business Value Evaluation

We further conducted temporal validation on the champion model to assess the robustness and generalizability of the model’s performance over time [30]. To evaluate the business value of the predictive model, modelplotr package [31] was used to build valuable evaluation plots (cumulative gains, cumulative lift, response and cumulative response plots). Descriptive analysis, predictive modelling for LGF and plotting were all performed in R version 4.2.2 [32].

Results

A total of 1,554 and 706 children were enrolled in the development and temporal validation cohorts, respectively. Among children aged 6–35 months enrolled, 1,106 (71.2%) and 655 (92.7%) had HAZ data that were plausible, respectively. Among those that had plausible HAZ data, 187 (16.9%) and 147 (22.4%) had LGF in the development and temporal validation cohorts, respectively (Fig. 1).

The median [interquartile range] ΔHAZ between enrollment and follow-up was − 0.21 [−0.42- −0.01] and − 0.24 [−0.48- −0.02] in the development and temporal validation cohorts, respectively. In the sensitivity analysis using the cut-off of negative change in HAZ, the prevalence of LGF was 1,051 (28.7%). Additionally, the constructed synthetic dataset had 8,527 observations and it closely replicated the propensity score distribution of the original development data (VIDA) as evidenced by the comprehensive descriptive analysis that compared each variable (Table S1).

The characteristics of VIDA participants at enrolment stratified by LGF status are shown in Table 1. Children who had LGF were younger than those who did not (Median age in months [IQR]: 11 [8,9,10,11,12,13,14] vs. 17 [11,12,13,14,15,16,17,18,19,20,21,22,23,24], p < 0.001). Furthermore, compared with those who did not have LGF, those with LGF had a higher respiratory rate (Median [IQR]: 38.5 [34.0–42.5] vs. 36.0[31.5–39.5], p < 0.001), a higher temperature (Median [IQR]: 37.1 [36.6–37.8] vs. 36.8 [36.4–37.5], p < 0.001) and more severe disease (Median Vesikari score [IQR]: 11 [9,10,11,12] vs. 10 [8,9,10,11,12], p < 0.001). Additionally, caretaker education, vomiting, wrinkled skin, restless, admission, and intravenous rehydration were significantly associated with LGF (Table 1).

Table 1 Characteristics of children aged < 5 years seeking care for moderate-to-severe diarrhea in Kenya stratified by Linear Growth faltering Status, 2015–2018

Full size table

From the feature selection analysis, the confirmed variables in order of importance were age (16.6%), temperature (6.0%), and respiratory rate (4.1%) SAM (3.4%), rotavirus vaccination (3.3%), and skin turgor (2.1%) were tentative features (Fig. 2).

In addition to age, respiratory rate, and temperature, the following features were selected: confirmed (stunting at baseline [5.2%], vomit [4.0%], Vesikari score (3.7%) and sunken eyes [3.6%]) and tentative (bacterial infection diagnosis [2.5%]) in the sensitivity analysis using a cut-off of negative change in HAZ (Figure S2).

Model performance

We evaluated seven ML algorithms in the prediction of LGF. From the developed models, sensitivity was highest in the RF model (80.7%), followed by the ANN (79.5%), SVM (77.3%), NB (76.5%), GBM (75.6%), LR (75.4%) and lowest in the KNN model (72.4%). The specificity ranged from 58.2 to 71.8%. Specifically, the specificity of the GBM model was the highest (71.8%), followed by RF (70.1%), LR (61.9%), NB and SVM (61.6%), KNN (61.4%) and lowest.

in the ANN model (58.2%). The PPV ranged between 27.4 − 34.9% while the NPV ranged between 92.3 − 94.8%. The AUC of the models ranged from 73.4 to 83.5% with the GBM model having the highest AUC (83.5%, 95% Confidence Interval [95% CI]: 81.6–85.4) (Table 2).

Table 2 Model performance of linear growth faltering prediction^β models using combined data (original and synthetic data)

Full size table

The GBM model was chosen as the champion model. The receiver operating characteristic (ROC) curves for LGF prediction models are shown in Figure S3. Moreover, in the sensitivity analysis using only the VIDA data in development, the model performance ranged between 63.0 and 82.6%, 55.9–78.6%, 27.3–33.7%, 91.0–94.2%, 40.3–44.3%, 68.0–75.5%, and 90.6–94.4% for sensitivity, specificity, PPV, NPV, F1-score, AUC and PRAUC, respectively (Table 3). All models showed a decline in predictive performance during sensitivity analysis except for the SVM model, which had a marginal increase.

Table 3 Model performance of linear growth faltering prediction ^β models using original training data only

Full size table

In the sensitivity analysis using the second definition of LGF (negative change in HAZ), the model performance ranged between 45.8 and 73.1%, 53.2–76.6%, 79.0–90.5%, 28.6–48.5%, 58.3–80.9%, 58.0–82.4%, and 29.0–62.6% for sensitivity, specificity, PPV, NPV, F1-score, AUC and PRAUC, respectively (Table S2). In this scenario, all models exhibited a drop in predictive performance except for the SVM model, which had a marginal increase and the RF model which registered same performance as in the primary analysis.

Overall the Brier scores were relatively high and ranged between 0.19 and 2.50 (Table 4).The Spiegelhalter’s p-value showed that all the models were not properly calibrated (p < 0.05). The performance of the calibrated GBM model was largely similar to its uncalibrated form with the model having an AUC of 83.7%.

Table 4 Calibration results of linear growth faltering prediction models

Full size table

Explanatory model analysis

The EMA results for the top 2 models in the primary analysis were similar though the degree of importance varied across models with no SAM, no skin turgor, no rotavirus vaccine, age, elevated temperature and respiratory rate being predictive of LGF (Fig. 3). Similarly, in the sensitivity analysis using the second definition of LGF, the direction of association was similar between the two models although the magnitude of importance varied. In addition to age, respiratory rate and temperature, the following factors were also identified to be predictive of LGF: severity of disease, no vomiting, stunting at baseline, bacterial infection and lack of sunken eyes (Fig. 3).

Business value evaluation of Champion Model

From the business value evaluation of our champion model (GBM), the cumulative gains plot shows that the model is able to select ~ 60% of the target class (LGF) if we select the top-20% cases based on our model. Additionally, from the cumulative lift plot, our champion model is able to identify ~ 3 times higher number of the target class compared to a random selection if we pick the top-20% observations based on model probability. Lastly, from the cumulative response plot, 48% of observations in the top-20% cases based on model probability belong to the target class (Fig. 4).

Temporal validation

We observed a decline in model performance on the temporal validation dataset with the AUC dropping by ~ 18%. Additionally, all metrics dropped in temporal validation with the GBM model achieving 53.7%, 67.7%, 32.5%, 83.5%, 40.5%, 65.6% and 86.4% for sensitivity, specificity, PPV, NPV, F1-score, AUC and PRAUC, respectively (Fig. 5).

Discussion

The study findings illuminate a comprehensive exploration into the prediction of LGF among pediatric patients presenting with diarrhea, employing a robust ML framework. The study involved the development and temporal validation of predictive models using diverse cohorts, revealing distinct prevalence rates and influencing factors associated with LGF. Key features linked to this outcome, such as age, rotavirus vaccination, respiratory rate, temperature and SAM, were identified through extensive feature selection and their impact on risk prediction was estimated using SHAP attribution. The ML algorithms exhibited varying performance with GBM model emerging as the champion model, demonstrating promising business value. However, the temporal validation uncovered a notable decline in model performance, emphasizing the dynamic nature of health data and the need for ongoing model evaluation and adaptation.

Despite the impact of rotavirus vaccine introduction on the epidemiology of diarrhea and pathogen landscape, we identified similar predictors, in addition to rotavirus vaccination, to previous modelling efforts [12, 13] that used data collected pre-vaccine introduction─ age,, respiratory rate, temperature, absence SAM and stunting at baseline. This finding underscores the enduring importance of these risk factors and the need for comprehensive, sustained, and adaptable public health strategies to combat LGF. Furthermore, we observed that rotavirus vaccination was inversely associated with LGF a finding that is consistent with those of Loli and Carcamo who studied the impact of vaccination on HAZ in Peruvian children aged 6–60 months [33]. This finding could be due to rotavirus vaccination substantially reducing the incidence and severity of rotavirus infections, curbing the immediate impact of diarrheal diseases on nutrient absorption and consequently diarrhea-mediated growth faltering [33]. Bolstering rotavirus vaccination is a possible strategy that could be leveraged by policy makers and public health experts to reduce stunting in such settings. Moreover, from a modelling perspective, this finding on predictors generates confidence in the relevance and stability of these variables in different contexts and epidemiological periods, enhancing model transferability and generalizability.

These variables have been documented as risk factors for LGF in previous studies. Specifically, age is a significant determinant of LGF among pediatric populations following a diarrheal episode [12, 13, 34]. Infants and very young children face heightened vulnerability to nutritional and health challenges due to their ongoing physiological maturation, which is exacerbated during diarrheal illness leading to pronounced impacts on nutrient absorption and utilization cumulatively contributing to a heightened prevalence of LGF among younger children. Stunting has been shown to be irreversible to a large extent after reaching 24 months of age [35]. Therefore, the timely identification of at-risk children (infants and toddlers) facilitates the implementation of effective preventive strategies during this critical window of opportunity in early childhood. Contrary to existing evidence [36, 37], we observed children without SAM to be at increased risk of LGF. Despite majority of factors predisposing children to SAM and stunting being similar, we observed a discordant relationship between the two and this may require further investigation to gain insights into this finding. Elevated baseline temperature and respiratory rate signal are markers of disease severity, and particularly those affecting the gastrointestinal tract, may lead to nutritional deficiencies and hinder linear growth [12, 13]. Additionally, elevated respiratory rate and temperature may indicate increased energy expenditure, potentially due to the body’s efforts to combat infections or inflammation. This increased energy demand can divert resources away from growth-related processes, impacting linear growth.

Tree-based ensembles showed good predictive performance with the GBM model narrowly outperforming the RF model in the prediction of LGF. Our champion model outperformed existing models by Brander et al. (AUC = 67.0%) [12] and (AUC = 75.0%) Ahmed et al. [13]. The improvement in model performance could be attributed to the robust modelling approach employed. Moreover, the predictive prowess of tree-based ensembles may have also contributed to this improvement. This strong discriminatory ability of the champion model has significant public health implications as it reinforces the feasibility and efficacy of ML algorithms in timely identification of children, at increased risk of LGF, for early nutritional and healthcare interventions. The model can enhance the efficiency of resource allocation by facilitating targeted screening as well as providing healthcare providers with a valuable tool for informed decision-making, enabling tailored interventions based on individual children risk profiles. The champion model could be implemented as web-based application using platforms like R-shiny or Plumber [38, 39], or integrated directly into electronic health record systems [40] to ensure alignment with clinical workflows. Deploying the model through these straightforward and adaptable methods would enable quick adoption in clinical settings, supporting clinicians in promptly identifying at-risk patients and enhancing clinical decision-making. However, the decline in model performance during temporal validation while consistent with findings from Ahmed et al. [13] raises important considerations. Spectral differences in the severity of diarrhea among children in the development and validation cohorts, coupled with potential shifts in the study population over time, highlight challenges in maintaining consistent predictive accuracy. This finding highlights the need for monitoring and periodic retraining of the model in order to maintain its predictive performance.

Our primary analysis that used combined data (VIDA and synthetic data) in model development had better performance than the sensitivity analysis that only used VIDA data. This result emphasizes the importance of synthetic data in addressing challenges associated with imbalanced, limited, or privacy-sensitive real-world datasets, providing a means to augment and diversify the data pool [41, 42]. This approach overcomes issues of data scarcity, facilitates more comprehensive model training, and enhances generalization. It contributes to overcoming biases, ensuring model fairness, and accommodating the complexity of risk factors influencing a health outcome. Ultimately, the strategic use of synthetic data strengthens the reliability, generalizability, and ethical integrity of predictive models, offering a pathway for more effective and personalized healthcare interventions. However, synthetic data may advance bias propagation since any biases in the primary data will be reflected in the generated data and this may perpetuate and even exacerbate healthcare disparities if they exist [43]. In addition to the quality of synthetic data being largely dependent on the underlying primary data, synthetic datasets may fail to encompass the complete range of variations and intricacies found in real-world data. Furthermore, in the second sensitivity analysis using a cutoff of any negative change in HAZ, we observed a substantial decline in model performance compared to using a cutoff of a decrease of 0.5 HAZ or more. These results imply that using a specific cutoff criteria for defining LGF can significantly impact the performance of the predictive model. Different cutoff criteria may be more appropriate in different contexts, and the choice should be informed by clinical expertise and relevance considering the specific context of the healthcare setting, study population (varying age categories), and the clinical significance of HAZ changes. It also underscores the dynamic nature of model performance, necessitating ongoing evaluation and adaptation to maintain optimal cutoff criteria.

Our study, while commendable, has limitations, notably the exclusion of pathogen data during model development to maintain practical applicability, despite its influence on LGF. Future research should address this gap, as well as focus on the acceptability and impact of ML models on clinical practice and patient outcomes. The cost-effectiveness of deploying these models is also crucial for practical implementation in diverse healthcare settings. Exploring these facets will contribute significantly to enhancing understanding and ensuring the effective use of ML models in healthcare.

Conclusion

The study’s findings emphasize the enduring relevance of established predictors of LGF. Addressing multifaceted challenges in pediatric LGF requires sustained efforts with adaptive interventions for these risk factors. The study demonstrates the practical use of ML algorithms for rapid identification of at-risk children for early nutritional and healthcare interventions. The model can enhance the efficiency of resource allocation by facilitating targeted screening as well as providing healthcare providers with a valuable tool for informed decision-making, enabling tailored interventions based on individual children risk profiles. However, a decline in model performance during temporal validation highlights the dynamic nature of health data, necessitating continuous evaluation and adaptation. Additionally, the study shows the viability of integrating synthetic data to enhance model robustness, providing a pathway for more comprehensive and ethical predictive modeling in healthcare.

Data availability

The data used for the modelling in this study belongs to KEMRI and restrictions apply to the availability of these data.

References

World Health Organization. Diarrhoeal disease. 2017. https://www.who.int/news-room/fact-sheets/detail/diarrhoeal-disease. Accessed 19 February 2022.
Ferdous F, Das SK, Ahmed S, et al. Severity of Diarrhea and Malnutrition among under five-year-old children in Rural Bangladesh. Am J Trop Med Hyg. 2013;89:223–8.
Article PubMed PubMed Central Google Scholar
Checkley W, Buckley G, Gilman RH, et al. Multi-country analysis of the effects of diarrhoea on childhood stunting. Int J Epidemiol. 2008;37:816.
Article PubMed PubMed Central Google Scholar
Lenters L, Wazny K, Bhutta ZA. Management of Severe and Moderate Acute Malnutrition in Children. In: Black RE, Laxminarayan R, Temmerman M, Walker N, eds. Reproductive, Maternal, Newborn, and Child Health: Disease Control Priorities, Third Edition (Volume 2). Washington (DC): The International Bank for Reconstruction and Development / The World Bank, 2016. http://www.ncbi.nlm.nih.gov/books/NBK361900/. Accessed 27 November 2020.
WHO, UNICEF, World Bank., Levels and trends in child malnutrition: UNICEF/WHO/World Bank Group joint child malnutrition estimates: key findings of the 2023 edition. 2023. https://www.who.int/publications/i/item/9789240073791. Accessed 3 November 2024.
Danaei G, Andrews KG, Sudfeld CR, et al. Risk factors for Childhood Stunting in 137 developing countries: a comparative risk Assessment Analysis at Global, Regional, and country levels. PLoS Med. 2016;13:e1002164.
Article PubMed PubMed Central Google Scholar
Stobaugh HC, Rogers BL, Rosenberg IH, et al. Children with poor Linear Growth are at risk for repeated relapse to Wasting after Recovery from Moderate Acute Malnutrition. J Nutr. 2018;148:974–9.
Article PubMed Google Scholar
Mertens A, Benjamin-Chung J, Colford JM, et al. Causes and consequences of child growth faltering in low-resource settings. Nature. 2023;621:568–76.
Article CAS PubMed PubMed Central Google Scholar
Wierzba TF, Muhib F. Exploring the broader consequences of diarrhoeal diseases on child health. Lancet Global Health. 2018;6:e230–1.
Article PubMed Google Scholar
Guerrant RL, DeBoer MD, Moore SR, Scharf RJ, Lima AAM. The impoverished gut–a triple burden of diarrhoea, stunting and chronic disease. Nat Rev Gastroenterol Hepatol. 2013;10:220–9.
Article PubMed Google Scholar
ROTA Council. The broader impact of early childhood diarrhea. 2019. https://preventrotavirus.org/wp-content/uploads/2019/05/ROTA-Brief6-LastingImpact-SP-1-3.pdf. Accessed 1 November 2022.
Brander RL, Pavlinac PB, Walson JL, et al. Determinants of linear growth faltering among children with moderate-to-severe diarrhea in the global enteric Multicenter Study. BMC Med. 2019;17:214.
Article PubMed PubMed Central Google Scholar
Ahmed SM, Brintz BJ, Pavlinac PB, et al. Derivation and external validation of clinical prediction rules identifying children at risk of linear growth faltering. Elife. 2023;12:e78491.
Article CAS PubMed PubMed Central Google Scholar
Rahmani K, Thapa R, Tsou P, et al. Assessing the effects of data drift on the performance of machine learning models used in clinical sepsis prediction. Int J Med Informatics. 2023;173:104930.
Article Google Scholar
Sahiner B, Chen W, Samala RK, Petrick N. Data drift in medical machine learning: implications and potential remedies. Br J Radiol. 2023;96(1150):20220878.
Powell H, Liang Y, Neuzil KM, et al. A description of the statistical methods for the Vaccine Impact on Diarrhea in Africa (VIDA) Study. Clin Infect Dis. 2023;76:S5–11.
Article PubMed PubMed Central Google Scholar
Atlas HE, Conteh B, Islam MT, et al. Diarrhea Case Surveillance in the Enterics for Global Health Shigella Surveillance Study: epidemiologic methods. Open Forum Infect Dis. 2024;11:S6–16.
Article PubMed PubMed Central Google Scholar
Feutz E, Biswas PK, Ndeketa L, et al. Data Management in Multicountry Consortium studies: the Enterics for Global Health (EFGH) Shigella Surveillance Study Example. Open Forum Infect Dis. 2024;11:S48–57.
Article PubMed PubMed Central Google Scholar
Nasrin D, Liang Y, Powell H, et al. Moderate-to-severe diarrhea and stunting among children younger than 5 years: findings from the Vaccine Impact on Diarrhea in Africa (VIDA) Study. Clin Infect Dis. 2023;76:S41–8.
Article PubMed PubMed Central Google Scholar
Nowok B, Raab GM, Dibben C. synthpop: Bespoke Creation of Synthetic Data in R. J Stat Soft. 2016; 74. http://www.jstatsoft.org/v74/i11/. Accessed 23 September 2023.
Joseph VR. Optimal ratio for data splitting. Statistical Analysis and Data Mining: The ASA Data Science Journal. 2022;15:531–8.
Article Google Scholar
Nasrin D, Blackwelder WC, Sommerfelt H, et al. Pathogens Associated with Linear Growth Faltering in Children with Diarrhea and Impact of Antibiotic Treatment: The Global Enteric Multicenter Study. J Infect Dis. 2021;224:S848–55.
Article PubMed PubMed Central Google Scholar
van Buuren S, Groothuis-Oudshoorn K, Vink G et al. Package ‘mice’. 2021. https://cran.r-project.org/web/packages/mice/mice.pdf. Accessed 31 May 2021.
Kursa MB, Rudnicki WR. Package ‘Boruta’. 2020. https://cran.r-project.org/web/packages/Boruta/Boruta.pdf. Accessed 31 May 2021.
Refaeilzadeh P, Tang L, Liu H. Cross-Validation. In: LIU L, ÖZSU MT,Encyclopedia of Database Systems. Boston, Springer MA. US, 2009: 532–538. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/978-0-387-39940-9_565. Accessed 17 October 2023.
Bach M, Werner A, Żywiec J, Pluskiewicz W. The study of under- and over-sampling methods’ utility in analysis of highly imbalanced data on osteoporosis. Inf Sci. 2017;384:174–90.
Article Google Scholar
Saito T, Rehmsmeier M. precrec: Calculate Accurate Precision-Recall and ROC (Receiver Operator Characteristics) Curves. 2023; https://CRAN.R-project.org/package=precrec. Accessed 10 February 2023.
Huang Y, Li W, Macheret F, Gabriel RA, Ohno-Machado L. A tutorial on calibration measurements and calibration models for clinical prediction models. J Am Med Inform Assoc. 2020;27:621–33.
Article PubMed PubMed Central Google Scholar
Biecek P, Maksymiuk S, Baniecki H, DALEX. : moDel Agnostic Language for Exploration and eXplanation. 2023; https://CRAN.R-project.org/package=DALEX. Accessed 10 February 2023.
Cowley LE, Farewell DM, Maguire S, Kemp AM. Methodological standards for the development and evaluation of clinical prediction rules: a review of the literature. Diagn Prognostic Res. 2019;3:16.
Article Google Scholar
Nagelkerke J. modelplotr: Plots to evaluate the business value of predictive models. 2020; https://cran.r-project.org/web/packages/modelplotr/vignettes/modelplotr.html. Accessed 19 November 2022.
R Core Team. R: The R Project for Statistical Computing. 2021. https://www.r-project.org/. Accessed 3 December 2021.
Loli S, Carcamo CP. Rotavirus vaccination and stunting: Secondary Data Analysis from the Peruvian Demographic and Health Survey. Vaccine. 2020;38:8010–5.
Article PubMed Google Scholar
Benjamin-Chung J, Mertens A, Colford JM, et al. Early-childhood linear growth faltering in low- and middle-income countries. Nature. 2023;621:550–7.
Article CAS PubMed PubMed Central Google Scholar
Victora CG, de Onis M, Hallal PC, Blössner M, Shrimpton R. Worldwide timing of growth faltering: revisiting implications for interventions. Pediatrics. 2010;125:e473–480.
Article PubMed Google Scholar
Ngari MM, Iversen PO, Thitiri J, et al. Linear growth following complicated severe malnutrition: 1-year follow-up cohort of Kenyan children. Arch Dis Child. 2019;104:229–35.
Article PubMed Google Scholar
Garenne M, Myatt M, Khara T, Dolan C, Briend A. Concurrent wasting and stunting among under-five children in Niakhar, Senegal. Matern Child Nutr. 2018;15:e12736.
Article PubMed PubMed Central Google Scholar
Eddington HS, Trickey AW, Shah V, Harris AHS. Tutorial: implementing and visualizing machine learning (ML) clinical prediction models into web-accessible calculators using Shiny R. Ann Transl Med. 2022;10:1414.
Article PubMed PubMed Central Google Scholar
Murphree DH, Quest DJ, Allen RM, Ngufor C, Storlie CB. Deploying Predictive Models In A Healthcare Environment - An Open Source Approach. In: 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). 2018: 6112–6116. https://ieeexplore.ieee.org/document/8513689. Accessed 6 October 2023.
Khalilia M, Choi M, Henderson A, Iyengar S, Braunstein M, Sun J. Clinical Predictive Modeling Development and Deployment through FHIR Web Services. AMIA Annu Symp Proc. 2015;2015:717–26.
PubMed PubMed Central Google Scholar
Giuffrè M, Shung DL. Harnessing the power of synthetic data in healthcare: innovation, application, and privacy. Npj Digit Med. 2023;6:1–8.
Article Google Scholar
Gonzales A, Guruswamy G, Smith SR. Synthetic data in health care: a narrative review. PLOS Digit Health. 2023;2:e0000082.
Article PubMed PubMed Central Google Scholar
Marwala T, Fournier-Tombs E, Stinckwich S. The use of synthetic data to train AI models: opportunities and risks for sustainable development. ArXiv abs/2309.00652. 2023: n. pag.

Download references

Acknowledgements

We appreciate the contributions and efforts of KEMRI-CGHR staff involved in the data collection, data management, and laboratory testing of samples in the two studies. We also wish to thank the study participants and the ministry of health staff for supporting both studies. Moreover, we would like to acknowledge the use of artificial intelligence (AI) technology for grammar checking and proofreading of this manuscript.

Funding

This work was supported by the Bill & Melinda Gates Foundation (grant INV-045988). The funders did not play any role in the study and interpretation of its outcome.

Author information

Authors and Affiliations

Kenya Medical Research Institute- Center for Global Health Research (KEMRI-CGHR), P.O Box 1578-40100, Kisumu, Kenya
Billy Ogwel, Alex O. Awuor, Caleb Okonji, Raphael O. Anyango, Caren Oreso, John B. Ochieng, Stephen Munga & Richard Omore
Department of Information Systems, University of South Africa, Pretoria, South Africa
Billy Ogwel & Vincent H. Mzazi
Department of Medicine, Center for Vaccine Development and Global Health, University of Maryland School of Medicine, Baltimore, MD, USA
Dilruba Nasrin & Karen L. Kotloff
Department of Global Health, University of Washington, Seattle, USA
Kirkby D. Tickell & Patricia B. Pavlinac

Authors

Billy Ogwel
View author publications
You can also search for this author inPubMed Google Scholar
Vincent H. Mzazi
View author publications
You can also search for this author inPubMed Google Scholar
Alex O. Awuor
View author publications
You can also search for this author inPubMed Google Scholar
Caleb Okonji
View author publications
You can also search for this author inPubMed Google Scholar
Raphael O. Anyango
View author publications
You can also search for this author inPubMed Google Scholar
Caren Oreso
View author publications
You can also search for this author inPubMed Google Scholar
John B. Ochieng
View author publications
You can also search for this author inPubMed Google Scholar
Stephen Munga
View author publications
You can also search for this author inPubMed Google Scholar
Dilruba Nasrin
View author publications
You can also search for this author inPubMed Google Scholar
Kirkby D. Tickell
View author publications
You can also search for this author inPubMed Google Scholar
Patricia B. Pavlinac
View author publications
You can also search for this author inPubMed Google Scholar
Karen L. Kotloff
View author publications
You can also search for this author inPubMed Google Scholar
Richard Omore
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

BO, VHM, KDT, PBP and RO conceived the study and contributed to study design and implementation. BO, VHM and KDT analyzed and interpreted the data. BO drafted the manuscript and all authors critically reviewed the manuscript for intellectual content and approved the final manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Billy Ogwel.

Ethics declarations

Ethics approval and consent to participate

The VIDA protocol was approved by the Institutional Review Board of the University of Maryland School of Medicine, Baltimore, MD, USA (UMB Protocol #: HM-HP-00062472) and the Kenya Medical Research Institute (KEMRI) Scientific and Ethical Review Unit (SERU) (SERU#2996). The EFGH protocol was approved by the KEMRI SERU (SERU#4362). Written informed consent was sought from caregivers in both studies before initiation of study procedures. Additionally, ethical approval for undertaking the current study was sought from the health research ethics committee of the University of South Africa, College of Agricultural Sciences (2023/CAES_HREC/2192).

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Material 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ogwel, B., Mzazi, V.H., Awuor, A.O. et al. Predictive modelling of linear growth faltering among pediatric patients with Diarrhea in Rural Western Kenya: an explainable machine learning approach. BMC Med Inform Decis Mak 24, 368 (2024). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12911-024-02779-7

Download citation

Received: 08 March 2024
Accepted: 22 November 2024
Published: 02 December 2024
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12911-024-02779-7

Predictive modelling of linear growth faltering among pediatric patients with Diarrhea in Rural Western Kenya: an explainable machine learning approach

Abstract

Introduction

Methods

Results

Conclusion

Introduction

Methods

Data sources

Development cohort

Temporal validation cohort

Target variable

Predictive variables and feature selection

Statistical analysis

Model development and internal validation

Temporal Validation and Business Value Evaluation

Results

Model performance

Explanatory model analysis

Business value evaluation of Champion Model

Temporal validation

Discussion

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary Information

Supplementary Material 1.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Medical Informatics and Decision Making

Contact us