AI-Based Cerebral Vascular Accident (CVA) Analysis and Prediction


  • Hala Shaheen IT Department, Faculty of Engineering and Information Technology, Palestine Ahliya University (Palestine)
  • Mutaz Rasmi Abu Sara IT Department, Faculty of Engineering and Information Technology, Palestine Ahliya University (Palestine)
  • Khaled Sabarna Nursing Department, Faculty of Allied Medical Sciences, Palestine Ahliya University (Palestine)
  • Labib Arafeh IT Department, Faculty of Engineering and Information Technology, Palestine Ahliya University (Palestine)



Early stroke detection significantly increases the prognosis for both survival and rehabilitation. Patients are more likely to receive appropriate therapy that minimizes brain damage and lowers the risk of consequences if a stroke is detected early on. Researchers are motivated to investigate the possibilities of artificial intelligence and machine learning technologies in creating new categorization systems that can identify and detect strokes more quickly and accurately due to their rapid development. This could potentially enhance the likelihood of surviving and recuperating. The support-vector machine (SVM), logistic regression, decision tree, random forest, Bayes nets, and K-nearest neighbor (KNN) algorithms are employed in this study's CRISP model technique. To enhance the final quality, the dataset was balanced using an oversampling technique, and the algorithms employed were subjected to principal components analysis (PCA). With an accuracy rate of 99%, the Random Forest algorithm is regarded as the optimum for prediction. Our study illustrates that the random forest classification model using the data balancing strategy outperforms the other strategies investigated, with a 99% classification accuracy and a 98% F1 score. The study also shows that the outcomes are unaffected by preprocessing with the PCA technique. The next objectives of the study are to use a larger dataset, various preprocessing methods, and machine learning models to enhance the framework models.


Cerebral Vascular Accident (CVA), Stroke Machine Learning (ML), Principal Component Analysis (PCA), CRISP-DM, Data Balancing, Algorithms


Pikula A, Howard BV, Seshadri S. Stroke and Diabetes. In: Cowie CC, Casagrande SS, Menke A, Cissell MA, Eberhardt MS, Meigs JB, Gregg EW, Knowler WC, Barrett-Connor E, Becker DJ, Brancati FL, Boyko EJ, Herman WH, Howard BV, Narayan KMV, Rewers M, Fradkin JE, editors. Diabetes in America. 3rd ed. Bethesda (MD): National Institute of Diabetes and Digestive and Kidney Diseases (US); 2018 Aug. CHAPTER 19. PMID: 33651535. Available:

World Health Organization (WHO), " Stroke, Cerebrovascular accident". Available:

K. Mridha, S. Ghimire, J. Shin, A. Aran, M. M. Uddin and M. F. Mridha, "Automated Stroke Prediction Using Machine Learning: An Explainable and Exploratory Study With a Web Application for Early Intervention," in IEEE Access, vol. 11, pp. 52288-52308, 2023, doi: 10.1109/ACCESS.2023.3278273.

Tazin, T., Alam, M. N., Dola, N. N., Bari, M. S., Bourouis, S., & Khan, M. M. (2021). Stroke disease detection and prediction using robust learning approaches. Journal of Healthcare Engineering, 2021, 7633381.

Ivanov, I. G., Kumchev, Y., & Hooper, V. J., An optimization precise model of stroke data to improve stroke prediction. Algorithms, 16(9), 417, 2023.

Gangavarapu Sailasya and Gorli L Aruna Kumari, “Analyzing the Performance of Stroke Prediction using ML Classification Algorithms” International Journal of Advanced Computer Science and Applications(IJACSA), 12(6), 2021.

Chapman, P., Clinton, J., Kerber, R., Khabaza, T., Reinartz, T., Shearer, C., & Wirth, R. (1999). CRISP-DM 1.0: Step-by-step data mining guide. Available:

Quantum, Data Science project management methodologies, Medium, 2019, August 20. Available:

Hotz, N., What is CRISPR-DM? Data Science Process Alliance, 10 September 2018. Available:

J. Ma, Y. Ding, J. C. P. Cheng, Y. Tan, V. J. L. Gan and J. Zhang, "Analyzing the Leading Causes of Traffic Fatalities Using XGBoost and Grid-Based Analysis: A City Management Perspective," in IEEE Access, vol. 7, pp. 148059-148072, 2019, doi: 10.1109/ACCESS.2019.2946401.

Elangovan, Viswa Priya Subramaniyam; Devarajan, Rajeswari; Khalaf, Osamah I.; Sharif, Mhd Saeed; and Elmedany, Wael (2024) "Analysing an imbalanced stroke prediction dataset using machine learning techniques," Karbala International Journal of Modern Science: Vol. 10: Iss. 2, Article 8. doi:

Association of Incident Stroke Risk With an IL-18-Centered Inflammatory Network Biomarker Composite Richard A. Martirosian, Crystal D. Wiedner, Jasmin Sanchez, Katherine T. Mun, Kiran Marla, Cristina Teran, Marissa Thirion, David S. Liebeskind, Emer R. McGrath, originally published May 2024. doi: 2024; 55:1601–1608




How to Cite

Shaheen, H. ., Abu Sara, M. R. ., Sabarna, K. ., & Arafeh, L. . (2024). AI-Based Cerebral Vascular Accident (CVA) Analysis and Prediction. Ahliya Journal of Allied Medico-Technology Science, 1(1), 15–19.




Most read articles by the same author(s)