Hybrid PSO Feature Selection Correlation and Support Vector Machine Model for Heart Disease Detection
DOI:
https://doi.org/10.32664/j-intech.v14i01.2251Keywords:
correlation, Heart, machine learning, particle swarm optimization, Support vector machineAbstract
Heart disease remains a major health problem worldwide. The World Health Organization (WHO) reports that in 2022, approximately 19.8 million people died from heart disease, highlighting the need for the implementation of an appropriate early detection model. This study proposes a hybrid SVM–PSO model with correlation-based feature selection, duplicate data handling, and a multi-metric fitness function to enhance classification performance. PSO is employed to optimize the C parameter and RBF kernel of SVM, producing a more robust and balanced model compared to existing approaches. This study uses a heart disease dataset consisting of 1,025 rows with 13 attributes and 1 target variable obtained from the Kaggle repository and republished on the Zenodo platform in 2024. The research stages include Pre-Processing, Standardization, Feature Selection based on Correlation, and evaluation using the 10-Fold Cross Validation technique with Accuracy, precision, recall, and F1-score metrics. The results show that Support Vector Machine (SVM) achieved an Accuracy of 82.80%, Precision of 79.31%, Recall of 91.70%, and an F1-score of 84.88%. After optimization using PSO, the performance improved to an accuracy of 84.46%, precision of 80.54%, recall of 92.72%, and an F1-score of 86.04%. The experimental results indicate performance improvements of 2.00% in accuracy, 1.55% in precision, 1.11% in recall, and 1.37% in F1-score after PSO optimization. These results prove that the applied hybrid approach successfully improved the ability to detect heart disease. Therefore, this study contributes by demonstrating that PSO-based hyperparameter optimization can effectively enhance SVM classification performance for heart disease detection. The proposed model also has practical implications as a decision support tool for early heart disease detection that can assist medical practitioners in improving diagnostic accuracy and supporting preventive treatment strategies.
References
[1] WHO, “Cardiovascular diseases (CVDs),” Sep. 14, 2025, World Health Organization.
[2] Irfan Sazali Nasution, Arini Dwi Rahmadani, W. Audina, D. P. Sari, and N. D. Sari, “Systematic Review: Pengaruh Gaya Hidup dan Pengetahuan Masyarakat terhadap Risiko Penyakit Jantung Koroner,” Sehat Rakyat: Jurnal Kesehatan Masyarakat, vol. 4, no. 2, pp. 287–298, 2025, doi: 10.54259/sehatrakyat.v4i2.4337.
[3] D. Saraswati, “Inovasi Pelayanan Kesehatan : Deteksi Dini Penyakit Jantung Koroner melalui Posbindu PTM,” Kesehatan dan Kebidanan Nusantara, vol. 2, pp. 10–16, 2024, doi: 10.69688/jkn.v2i1.81.
[4] M. Mirbabaie, S. Stieglitz, and N. R. J. Frick, “Artificial intelligence in disease diagnostics: A critical review and classification on the current state of research guiding future direction,” Health and Technology, vol. 11, no. 4, pp. 693–731, 2021, doi: 10.1007/s12553-021-00555-5.
[5] J. E. Black, J. K. Kueper, and T. S. Williamson, “An introduction to machine learning for classification and prediction,” Family Practice, vol. 40, no. 1, pp. 200–204, 2023, doi: 10.1093/fampra/cmac104.
[6] F. Reynaldi Valerian, M. Syarief, and D. Abdul Fatah, “Klasifikasi Tingkat Obesitas Menggunakan Metode Gbm Dan Confusion Matrix,” JATI (Jurnal Mahasiswa Teknik Informatika), vol. 9, no. 2, pp. 2242–2249, 2025, doi: 10.36040/jati.v9i2.13062.
[7] I. Akbar, F. Supriadi, and D. Indra Junaedi, “Pemanfaatan Machine Learning Di Bidang Kesehatan,” JATI (Jurnal Mahasiswa Teknik Informatika), vol. 9, no. 1, pp. 1744–1749, 2025, doi: 10.36040/jati.v9i1.12663.
[8] S. Sharma, G. Singh, and P. Negi, “Heart disease Prediction Using Data Mining Techniques,” 2024 IEEE 1st Karachi Section Humanitarian Technology Conference, Khi-HTC 2024, vol. 10, no. 02, pp. 281–286, 2024, doi: 10.1109/KHI-HTC60760.2024.10482141.
[9] Y. Rimal, N. Sharma, S. Paudel, A. Alsadoon, M. P. Koirala, and S. Gill, “Comparative analysis of heart disease prediction using logistic regression, SVM, KNN, and random forest with cross-validation for improved accuracy,” Scientific Reports, vol. 15, no. 1, pp. 1–14, 2025, doi: 10.1038/s41598-025-93675-1.
[10] L. N. Farida and S. Bahri, “Klasifikasi Gagal Jantung menggunakan Metode SVM (Support Vector Machine),” Komputika : Jurnal Sistem Komputer, vol. 13, no. 2, pp. 149–156, 2024, doi: 10.34010/komputika.v13i2.11330.
[11] D. H. Depari, Y. Widiastiwi, and M. M. Santoni, “Perbandingan Model Decision Tree, Naive Bayes dan Random Forest untuk Prediksi Klasifikasi Penyakit Jantung,” Informatik : Jurnal Ilmu Komputer, vol. 18, no. 3, p. 239, 2022, doi: 10.52958/iftk.v18i3.4694.
[12] R. Resmiati and T. Arifin, “Klasifikasi Pasien Kanker Payudara Menggunakan Metode Support Vector Machine dengan Backward Elimination,” Jurnal Sistem Informasi, vol. 10, no. 2, pp. 381–393, 2021, doi: 10.32520/stmsi.v10i2.1238.
[13] M. F. R. Angga Aditya Permana, “Implementasi Metode K-Nearest Neighbors (KNN) untuk Klasifikasi Penyakit Jantung,” G-Tech : Jurnal Teknologi Terapan, vol. 8, no. 1, pp. 186–195, 2024, doi: 10.33379/gtech.v8i3.4495.
[14] R. Hidayat, Y. S. Sy, T. Sujana, M. Husnah, and H. T. Saputra, “Implementation of Machine Learning for Heart Disease Prediction Using Support Vector Machine Algorithm,” Teknologi Informasi dan Rekayasa Komputer, vol. 5, no. 2, pp. 161–168, 2024, doi: 10.37148/bios.v5i2.152.
[15] M. Hasan, M. A. Sahid, M. P. Uddin, M. A. Marjan, S. Kadry, and J. Kim, “Performance discrepancy mitigation in heart disease prediction for multisensory inter-datasets,” PeerJ Computer Science, vol. 10, pp. 1–51, 2024, doi: 10.7717/peerj-cs.1917.
[16] D. Irawan, E. B. Perkasa, Y. Yurindra, D. Wahyuningsih, and E. Helmud, “Perbandingan Klassifikasi SMS Berbasis Support Vector Machine, Naive Bayes Classifier, Random Forest dan Bagging Classifier,” Jurnal Sisfokom (Sistem Informasi dan Komputer), vol. 10, no. 3, pp. 432–437, 2021, doi: 10.32736/sisfokom.v10i3.1302.
[17] A. Arifuddin, G. S. Buana, R. A. Vinarti, and A. Djunaidy, “Performance Comparison of Decision Tree and Support Vector Machine Algorithms for Heart Failure Prediction,” Procedia Computer Science, vol. 234, pp. 628–636, 2024, doi: 10.1016/j.procs.2024.03.048.
[18] A. Ashwini, V. Chirchi, S. Balasubramaniam, and M. A. Shah, “Bio inspired optimization techniques for disease detection in deep learning systems,” Scientific Reports, vol. 15, no. 1, pp. 1–30, 2025, doi: 10.1038/s41598-025-02846-7.
[19] H. Setiani, A. Sunyoto, and A. Nasiri, “Metode Naïve Bayes dan Particle Swarm Optimization untuk Klasifikasi Penyakit Jantung,” Explore, vol. 12, no. 2, p. 6, 2022, doi: 10.35200/explore.v12i2.566.
[20] M. G. El-Shafiey, A. Hagag, E. S. A. El-Dahshan, and M. A. Ismail, “A hybrid GA and PSO optimized approach for heart-disease prediction based on random forest,” Multimedia Tools and Applications, vol. 81, no. 13, pp. 18155–18179, 2022, doi: 10.1007/s11042-022-12425-x.
[21] A. Al Bataineh and S. Manacek, “MLP-PSO Hybrid Algorithm for Heart Disease Prediction,” Journal of Personalized Medicine, vol. 12, no. 8, 2022, doi: 10.3390/jpm12081208.
[22] A. Agung, G. Agung, and I. M. Widiartha, “Optimasi Metode Support Vector Machine ( SVM ) Mengunakan Particle Swarm Optimization pada Permasalahan Klasifikasi Diabetes,” JNATIA, vol. 3, pp. 879–888, 2025, doi: 10.24843/JNATIA.2025.v03.i04.p18.
[23] R. Hidayat, “Dataset Heart Disease,” 2024, Zenodo. doi: 10.5281/zenodo.13208473.
[24] P. Martins, F. Cardoso, P. Váz, J. Silva, and M. Abbasi, “Performance and Scalability of Data Cleaning and Preprocessing Tools: A Benchmark on Large Real-World Datasets,” Data, vol. 10, no. 5, pp. 1–22, 2025, doi: 10.3390/data10050068.
[25] C. Fan, M. Chen, X. Wang, J. Wang, and B. Huang, “A Review on Data Preprocessing Techniques Toward Ef fi cient and Reliable Knowledge Discovery From Building Operational Data,” vol. 9, no. March, pp. 1–17, 2021, doi: 10.3389/fenrg.2021.652801.
[26] M. Anita et al., “Klasifikasi Faktor Risiko Penyakit Jantung Menggunakan Machine Learning,” Teknologi Informasi, vol. 16, pp. 68–78, 2025, doi: 10.52972/hoaq.vol16no1.
[27] P. A. Mahasiswa, “Optimasi correlation-based feature selection untuk perbaikan akurasi random forest classifier dalam prediksi performa akademik mahasiswa,” Informatika dan Komputer, vol. 6, no. 2, pp. 251–260, 2022, doi: 10.26798/jiko.v6i2.651.
[28] E. S. Alomari et al., “Malware Detection Using Deep Learning and Correlation-Based Feature Selection,” Symmetry, vol. 15, no. 1, pp. 1–21, 2023, doi: 10.3390/sym15010123.
[29] I. K. Nti, “Performance of Machine Learning Algorithms with Different K Values in K-fold Cross- Validation,” I.J. Information Technology and Compuer Science, vol. 6, pp. 61–71, 2021, doi: 10.5815/ijitcs.2021.06.05.
[30] R. Syaputra, T. A. Y. Siswa, and W. J. Pranoto, “Model Optimasi SVM Dengan PSO-GA dan SMOTE Dalam Menangani High Dimensional dan Imbalance Data Banjir,” Teknika, vol. 13, no. 2, pp. 273–282, 2024, doi: 10.34148/teknika.v13i2.876.
[31] D. D. Saputra et al., “Analisis Sentimen Terhadap Twitter Direktorat Jenderal Bea dan Cukai Menggunakan komparasi Algoritma Naïve Bayes dan Support Vector Machine,” J-INTECH (Journal of Information and Technology), no. 204, pp. 285–296, 2024, doi: 10.32664/j-intech.v12i02.1274.
[32] T. M. Shami, A. A. El-saleh, and S. Member, “Particle Swarm Optimization : A Comprehensive Survey,” Access, vol. 10, pp. 10031–10061, 2022, doi: 10.1109/ACCESS.2022.3142859.
[33] I. Taufiq, T. A. Y. Siswa, and W. J. Pranoto, “Model Optimasi Random Forest dengan PSO-CHI-SM dalam Mengatasi High Dimensional dan Imbalanced Data Banjir Kota Samarinda,” Jurnal Teknologi Sistem Informasi dan Aplikasi, vol. 7, no. 3, pp. 1267–1279, 2024, doi: 10.32493/jtsi.v7i3.41632.
[34] A. Damuri, U. Riyanto, H. Rusdianto, and M. Aminudin, “Implementasi Data Mining dengan Algoritma Naïve Bayes Untuk Klasifikasi Kelayakan Penerima Bantuan Sembako,” JURIKOM (Jurnal Riset Komputer), vol. 8, no. 6, p. 219, 2021, doi: 10.30865/jurikom.v8i6.3655.
[35] L. Techniques, “Enhancing Heart Disease Prediction Accuracy through Machine Learning Techniques and Optimization,” Processe, vol. 11, no. 4, p. 1210, 2023, doi: 10.3390/pr11041210.
[36] J. C. Obi, “A Comparative Study of Several Classification Metrics and Their Performances on Data,” WJAETS, vol. 08, no. 01, pp. 308–314, 2023, doi: 10.30574/wjaets.2023.8.1.0054.
[37] A. Churcher et al., “An experimental analysis of attack classification using machine learning in IoT networks,” Sensors (Switzerland), vol. 21, no. 2, pp. 1–32, 2021, doi: 10.3390/s21020446.
[38] Y. Ayuningtyas and I. M. Suartana, “Klasifikasi Penyakit Stroke Menggunakan Support Vector Machine ( SVM ) dan Particle Swarm Optimization ( PSO ),” Informatics and Computer Science, vol. 04, no. 2022, pp. 452–457, 2023, doi: 10.26740/jinacs.v4n04.p451-457.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 J-INTECH

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

