A Comparison of Machine Learning Models for Stroke Risk Prediction
DOI:
https://doi.org/10.63318/waujpasv4i2_01Keywords:
Classification, GP, Machine Learning, Models, Stroke Prediction, Unbalanced datasetAbstract
Stroke is a major cause of death and permanent disability globally. Effective preventive measures depend on early stroke risk prediction. This, study investigates a form of machine learning classifiers for predicting the risk of stroke and evaluates their performance in comparison to more conventional models, including Artificial Neural Networks (ANN), Logistic Regression (LR), Decision Trees (DT), Support Vector Machines (SVM), Naive Bayes (NB), K-Nearest Neighbors (KNN), AdaBoost, Gradient Boosting, and Genetic Programming (GP). A public dataset from DataHack comprising 4981 participants was used, with class imbalance addressed by using Synthetic Minority Over-sampling Techniques (SMOTE). The dataset was split into a training set with 80% and a testing set with 20%. According to the experimental results, all the proposed approaches performed well in all the presented categories. Moreover, the GP approach achieved the highest results among the proposed approaches, reaching 95.6% accuracy, a significant improvement over the 90% state-of-the-art accuracy. The GP approach delivers prediction accuracy by creating symbolic expressions that illustrate correlations between the probability of stroke and risk factors. This study highlights the potential of machine learning approaches in predictive healthcare applications while also emphasizing the need for improvements through hybrid approaches, parameter modifications, and practical medical integration.
Downloads
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Mona Aljero

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
This journal uses Creative Commons Attribution-Noncommerical 4.0 International License (CC BY-NC 4.0), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. To view a copy of this license, visit https://creativecommons.org/licenses/by-nc/4.0/.
Copyright of articles
Authors retain copyright of their articles published in this journal.
