Paper
20 June 2023 Interpretable prediction of heart disease based on random forest and SHAP
Lin Wu
Author Affiliations +
Proceedings Volume 12715, Eighth International Conference on Electronic Technology and Information Science (ICETIS 2023); 127151F (2023) https://doi.org/10.1117/12.2682322
Event: Eighth International Conference on Electronic Technology and Information Science (ICETIS 2023), 2023, Dalian, China
Abstract
In order to improve the accuracy of heart disease prediction models and address the lack of interpretability in traditional machine learning models, this paper proposes a heart disease prediction method based on random forests and SHAP value. This method first preprocesses the dataset by encoding the data, filling in missing values, and removing outliers. It then uses recursive feature elimination and cross-validation to remove irrelevant features and select relevant features for further model training. The results, compared with other methods using accuracy, precision, recall, and F1 score, show that the proposed method outperforms other models. The interpretable model constructed based on SHAP value reflects the effect of feature values on prediction model results and provides a ranking of feature importance. The experimental results show that the method can effectively improve the accuracy of heart disease prediction, and provide a clear interpretation of the model prediction results. It can be an aid in the treatment and prevention of heart disease.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Lin Wu "Interpretable prediction of heart disease based on random forest and SHAP", Proc. SPIE 12715, Eighth International Conference on Electronic Technology and Information Science (ICETIS 2023), 127151F (20 June 2023); https://doi.org/10.1117/12.2682322
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Cardiovascular disorders

Random forests

Data modeling

Feature selection

Machine learning

RELATED CONTENT


Back to Top