Applications of Machine Learning Algorithm for Community-Based Health Insurance Classification in Ethiopia

Seyifemickael Amare Yilema1, *, Yegnanew A. Shiferaw2, Yikeber Abebaw Moyehodie1, Setegn Muche Fenta1, Denekew Bitew Belay3, Haile Mekonnen Fenta3
1 Department of Statistics, College of Natural and Computational Science, Debre Tabor University, Debre Tabor, Ethiopia
2 Department of Statistics, University of Johannesburg, Johannesburg, South Africa
3 Department of Statistics, College of Science, Bahir Dar University, Bahir Dar, Ethiopia



Adopting different machine learning algorithms to the Ethiopian health insurance data.


This study employed the 2019 Ethiopian mini demographic and health survey (EMDHS) data. The community-based health insurance (CBHI) data is used for machine learning (ML) comparisons.


The study's objective is to identify the ML algorithm for predicting health insurance and the important predictors.


Data splitting involves partitioning the data into an explicit training dataset to prepare the model and an unseen test dataset to evaluate the models' performance. ML algorithms such as linear discriminant analysis (LDA), support vector machine with radial basis function (SVM), k-nearest neighbors (KNN), classification and regression trees (CART), and random forest (RF) were employed for predicting health insurance.


The RF model algorithm is the best predicting power with large accuracy of CBHI. Many covariates are employed in the study, but the most important predictors are identified by using RF algorithms. The most important variables are age, wealth, household members and land usage were selected to be the top four important variables for predicting health insurance.


This study will be helpful for health insurance professionals and policy makers to develop a system for adopting necessary interventions. Further research will be conducted for ML-based classifiers and deep learning to predict health insurance.

Keywords: Machine learning, Health insurance, Random forest, Accuracy.

Abstract Information

Identifiers and Pagination:

Year: 2023
Volume: 16
DOI: 10.2174/18749445-v16-e230720-2023-30

Article History:

Electronic publication date: 20/07/2023
Collection year: 2023

© 2023 Seyifemickael et al.

open-access license: This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International Public License (CC-BY 4.0), a copy of which is available at: This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

* Address correspondence to this author at the Department of Statistics, College of Natural and Computational Science, Debre Tabor University, Debre Tabor, P.O.Box: 272, Ethiopia; E-mail: