| Issue |
ITM Web Conf.
Volume 84, 2026
2026 International Conference on Advent Trends in Computational Intelligence and Data Science (ATCIDS 2026)
|
|
|---|---|---|
| Article Number | 01007 | |
| Number of page(s) | 11 | |
| Section | Intelligent Computing in Healthcare and Bioinformatics | |
| DOI | https://doi.org/10.1051/itmconf/20268401007 | |
| Published online | 06 April 2026 | |
Interpretable Ensemble Learning for Cardiovascular Disease Prediction: A Large-Scale SHAP-Based Analysis
School of Computer Science and Technology, Tongji University, Shanghai 200092, China
* Corresponding author’s email: This email address is being protected from spambots. You need JavaScript enabled to view it.
Abstract
Cardiovascular disease is now the leading cause of death globally; however, there remains a significant barrier to the use of machine learning models to predict cardiovascular disease due to their lack of generalizability from smaller databases. The aim of this study was to develop an interpretable ensemble learning model based on a large database of over 70,000 patients. Specifically, this research created three advanced Gradient Boosting Models and enhanced these by Synthetic Minority Over-sampling Technique (SMOTE) for improving the data balance as well as Bayesian Hyperparameter Tuning for optimizing the models. The top performing Voting Ensemble had a test accuracy of 73.61% and an Area Under the Curve of 0.8022. SHAP results demonstrated that systolic blood pressure was the most important feature and that meaningful clinical thresholds existed at 120 mmHg for blood pressure and 48 years for age that aligned with established medical standards. Although the model’s accuracy was somewhat lower than the benchmark of 78.42%, the increased interpretability of the model makes it a promising tool for clinical decision support. Therefore, this research demonstrates that by combining powerful ensemble methods with strong explanations for decisions, more trustworthy AI systems in the field of cardiovascular medicine can be created.
© The Authors, published by EDP Sciences, 2026
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.

