All published articles of this journal are available on ScienceDirect.

RESEARCH ARTICLE

A Machine Learning Approach to the Prediction of Malaria in Under-five Children: Analysis of the 2021 Nigerian Malaria Indicator Survey

The Open Public Health Journal 12 June 2025 RESEARCH ARTICLE DOI: 10.2174/0118749445396163250604103305

Abstract

Background

Malaria remains a major cause of illness and death among children under five in Nigeria, despite efforts to control transmission. Accurate and reliable prediction of malaria outbreaks is crucial for health authorities to take timely measures. This study aims to identify the most robust machine learning classification algorithms for predicting the status of malaria in children under five (0-59 months).

Methods

The 2021 Nigeria Malaria Indicator Survey (NMIS) included 10,655 children under five who were tested for malaria using the Rapid Diagnostic Test (RDT). Various machine learning models were explored, including Decision Trees, K-Nearest Neighbor, Naïve Bayes, Random Forest, Support Vector Machines, and Survey Logistic Regression, and their performance was evaluated through metrics such as accuracy, AUC, balanced accuracy, F1-Score, negative predictive value, precision, sensitivity, and specificity.

Results

Random Forest (RF) is the most robust and balanced classification model due to its superior accuracy (79%), precision (77%), recall (62%), F1-score (69%), and AUC (80%). Support Vector Machine (SVM) also demonstrated strong performance, particularly in accuracy (74%) and AUC (80%). Survey Logistic Regression (SLR) and Decision Tree (DT) offered moderate results but fell short compared to RF and SVM, indicating the need for further optimization. Naive Bayes (NB) and K-Nearest Neighbors (KNN) had limitations, making them less reliable for this task.

Conclusion

In conclusion, the study reveals that RF and SVM are the best classification models for predicting malaria status in children under five years old. RF is reliable and balanced, while SVM is preferred for recall. SLR and DT show potential but require optimization. NB and KNN have significant performance gaps, making them less suitable. These findings will help policymakers and malaria intervention programs address key factors, enabling more targeted public health interventions to reduce the malaria burden on young children and improve the well-being of vulnerable populations in Nigeria.

Keywords: Malaria, Machine learning, Classification algorithms, Performance metrics, Rapid diagnostic test, K-Nearest neighbors.
Fulltext HTML PDF ePub
1800
1801
1802
1803
1804