FedCVD: Towards a Scalable, Privacy-Preserving Federated Learning Model for Cardiovascular Diseases Prediction

Funding Sponsor

Carnegie Mellon University

Third Author's Department

Computer Science & Engineering Department

Find in your Library

https://doi.org/10.1145/3647750.3647752

All Authors

Abdelrhman Gaber, Hassan Abdeltwab, Tamer Elbatt

Document Type

Research Article

Publication Title

ACM International Conference Proceeding Series

Publication Date

1-26-2024

doi

10.1145/3647750.3647752

Abstract

This paper presents FedCVD, a federated learning model designed for predicting cardiovascular disease (CVD) by employing logistic regression and Support Vector Machine (SVM) algorithms. FedCVD utilizes the privacy and scalability advantages offered by federated learning to facilitate collaborative model training using decentralized patient data, ensuring confidentiality. To evaluate the effectiveness of the proposed model, experiments were conducted using the 10-year risk of coronary heart disease Kaggle dataset. To address data imbalance challenges, three techniques - Random Over Sampling, Random Under Sampling, and Synthetic Minority Oversampling Technique (SMOTE) - were explored. The study demonstrates promising performance,For the federated logistic regression with SMOTE achieving an AUC value of 0.7048. Comparative analysis with a centralized logistic regression model shows competitive results, with an AUC value of 0.7081 using Random Over Sampling. For the federated SVM model, an AUC value of 0.7340 is achieved using Random Under Sampling. In comparison, a centralized machine learning approach utilizing SVM and Random Over Sampling achieves an AUC value of 0.6962. These findings highlight the effectiveness of the proposed federated learning approach, surpassing the performance of centralized machine learning models for CVD prediction.

First Page

7

Last Page

11

Comments

Conference Paper. Record derived from SCOPUS.

Share

COinS