Predictive Model of cardiovascular diseases based on Bayesian Networks

Authors

DOI:

https://doi.org/10.31637/epsir-2024-1804

Keywords:

bayesian networks, cardiovascular disease, prevention, predictive models, BSEJ, KDB, naive Bayes, machine learning

Abstract

Introduction: This study presents an analysis and comparison of several Bayesian network models for cardiovascular disease prediction, using clinical data to identify the most effective models. Methodology: The algorithms evaluated include Naive Bayes, TAN_cl, TAN_hcsp, FSSJ, BSEJ and KDB, which were trained and validated to measure their performance. A clinical dataset of patients, combining five public databases, was used to evaluate performance. The metrics used were accuracy, sensitivity, specificity, F1 Score and also a cross validation to ensure consistency of results. Results: The BSEJ model presented the best performance in all the metrics evaluated, standing out for its ability to eliminate irrelevant dependencies, thus maintaining an optimal balance between complexity and accuracy. Discussions: Bayesian networks offer a powerful tool for CVD prediction, as they not only provide accurate predictions, but also facilitate the interpretation of relationships between risk factors, which is crucial in the clinical setting. Conclusions: Bayesian networks, and in particular the BSEJ and KDB models, stand out for their effectiveness in predicting CVD, providing informed clinical decision support.

Downloads

Download data is not yet available.

Author Biographies

Dayron Rumbaut Rangel, Universidad Bolivariana del Ecuador

PhD candidate in Applied Computer Science, Master’s in Artificial Intelligence, and Master’s in Educational Technology and Innovation. With a strong professional background in teaching and research at various educational institutions, he currently serves as a lecturer and researcher at the Bolivarian University of Ecuador, teaching undergraduate and postgraduate programs. His main professional strengths lie in the fields of artificial intelligence, learning analytics, and educational technology. Recipient of the 2023 METARED TIC International Honor Mention for the SMART EDUCATION TREE project. He has expertise in managing virtual platforms and training trainers, as well as publishing in academic journals. He also coordinates research projects and supervises master’s theses.

Milton Rafael Maridueña Arroyave, Universidad Bolivariana del Ecuador

Computer Engineer, Master’s in University Teaching and Educational Research, Master’s in Mathematical Research, Master’s in Innovation and Digital Transformation, PhD in Pedagogical Sciences, and PhD in Technical Sciences. Accredited researcher by SENESCYT. Faculty member of the Faculty of Mathematical and Physical Sciences at the University of Guayaquil (UG), the Faculty of Natural and Mathematical Sciences at ESPOL, and the Bolivarian University Institute of Technology (ITB). Postgraduate professor at UTEG-UPSE-UNEMI. Author of books and articles published in indexed journals. He has served as General Director of Scientific Research at UG and as Rector of the “Vicente Rocafuerte” Higher Technological Institute and the “Ana Paredes de Alfaro” Higher Technological Institute. Currently, he is a Peer Reviewer and Academic Advisor for CACES. 

References

Bravo, F. P., Del Barrio García, A. A., Gago Veiga, A. B., Gallego de la Sacristana, M. M., Pinero, M. R., Peral, A. G., Dzeroski, S. y Ayala, J. L. (2019). SMURF: Systematic Methodology for Unveiling Relevant Factors in Retrospective Data on Chronic Disease Treatments. IEEE Access, 7, 92598–92614. https://doi.org/10.1109/ACCESS.2019.2927429 DOI: https://doi.org/10.1109/ACCESS.2019.2927429

Bravo, F. P., García, A. A. D. B., Russo, L. M. S. y Ayala, J. L. (2020). SOFIA: Selection of Medical Features by Induced Alterations in Numeric Labels. Electronics, 9(9), 1492. https://doi.org/10.3390/electronics9091492 DOI: https://doi.org/10.3390/electronics9091492

Cardiovascular diseases (CVDs). (n.d.). Cardiovascular diseases. https://lc.cx/nFw6WA

Chicco, D. y Jurman, G. (2020). The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genomics, 21(1), 6. https://doi.org/10.1186/s12864-019-6413-7 DOI: https://doi.org/10.1186/s12864-019-6413-7

Ecuador acumula pacientes con enfermedades cardiovasculares | CEAP :: Centro de Estudios Asia-Pacífico (n.d.). https://lc.cx/x5lAR1

Enfermedades cardiovasculares - OPS/OMS | Organización Panamericana de la Salud. (n.d.). https://www.paho.org/es/temas/enfermedades-cardiovasculares

Heart Failure Prediction Dataset (n.d.). https://lc.cx/D2AHi-

Jahirul, M. (2010). Investigating the Performance of Naive- Bayes Classifiers and K- Nearest Neighbor Classifiers. International Conference on Convergence Information Technology (ICCIT 2007), 1541-1546. https://doi.org/10.1109/ICCIT.2007.148 DOI: https://doi.org/10.1109/ICCIT.2007.148

Kumar, S. y Sahoo, G. (2015). Classification of Heart Disease Using Naïve Bayes and Genetic Algorithm (pp. 269-282). https://doi.org/10.1007/978-81-322-2208-8_25 DOI: https://doi.org/10.1007/978-81-322-2208-8_25

La Carga de Enfermedades Cardiovasculares - OPS/OMS | Organización Panamericana de la Salud. (n.d.). https://www.paho.org/es/enlace/carga-enfermedades-cardiovasculares

Madden, M. G. (2009). On the classification performance of TAN and general Bayesian networks. Knowledge-Based Systems, 22(7), 489-495. https://doi.org/10.1016/j.knosys.2008.10.006 DOI: https://doi.org/10.1016/j.knosys.2008.10.006

Moscatelli, M., Manconi, A., Pessina, M., Fellegara, G., Rampoldi, S., Milanesi, L., Casasco, A. y Gnocchi, M. (2018). An infrastructure for precision medicine through analysis of big data. BMC Bioinformatics, 19(S10), 351. https://doi.org/10.1186/s12859-018-2300-5 DOI: https://doi.org/10.1186/s12859-018-2300-5

Nagavelli, U., Samanta, D. y Thomas, B. (2023). ML-Based Prediction Model for Cardiovascular Disease (pp. 91-98). https://doi.org/10.1007/978-981-19-4052-1_11 DOI: https://doi.org/10.1007/978-981-19-4052-1_11

Tovey, C. A. (1985). Hill Climbing with Multiple Local Optima. SIAM Journal on Algebraic Discrete Methods, 6(3), 384-393. https://doi.org/10.1137/0606040 DOI: https://doi.org/10.1137/0606040

Wang, X., Wang, F., Hu, J. y Sorrentino, R. (2014). Exploring joint disease risk prediction, 1180-1187.

Zhong, C., Pedrycz, W., Wang, D., Li, L. y Li, Z. (2016). Granular data imputation: A framework of Granular Computing. Applied Soft Computing, 46, 307-316. https://doi.org/10.1016/j.asoc.2016.05.006 DOI: https://doi.org/10.1016/j.asoc.2016.05.006

Published

2024-12-27

How to Cite

Rumbaut Rangel, D., & Maridueña Arroyave, M. R. (2024). Predictive Model of cardiovascular diseases based on Bayesian Networks. European Public & Social Innovation Review, 9, 1–22. https://doi.org/10.31637/epsir-2024-1804

Issue

Section

INNOVATING IN HEALTH AND HEALTHCARE