Comparative Analysis of Machine Learning Algorithm Performance for Student Mental Health Classification

Authors

DOI:

https://doi.org/10.32664/smatika.v16i01.2080

Keywords:

Classification, Machine Learning, Mental Health, RapidMiner.

Abstract

Mental health issues among college students are a critical issue that requires data-driven approaches to detect early treatment needs. This study aims to analyze and compare the performance of three machine learning algorithms: Naive Bayes, K-Nearest Neighbor (K-NN), and Decision Tree in classifying college students' mental health treatment needs based on an open survey dataset. The study was conducted systematically using RapidMiner software, with data preprocessing, model training, testing, and performance evaluation using accuracy, precision, and recall metrics. The test results showed that the Naive Bayes algorithm produced an accuracy of 78.85%, a precision of 75.96%, and a recall of 72.84%. K-NN performed better with an accuracy of 82.62%, a precision of 80.83%, and a recall of 77.37%. Meanwhile, the Decision Tree algorithm performed best with an accuracy of 88.32%, a precision of 86.77%, and a recall of 85.80%. In addition to its high performance, Decision Tree also offers advantages in interpreting results through its decision tree structure, which illustrates the role of variables such as employment status (self_employed), family history (family_history), survey completion time (timestamp), and care options (care_options) in the classification process. Decision Tree can be concluded as the most effective classification model for detecting student mental health needs in this data context. These findings are expected to serve as a reference in the development of machine learning-based early detection systems to support mental health policies and interventions in higher education settings.

References

[1] S. Gill, B. Sharma, R. Phogat, dan A. Kirar, “Machine and Deep Learning-Based Prediction Modelling for Assessment of Stress, Anxiety, and Depression Among Students,” dalam Organizational Sociology in the Era of AI, IGI Global, 2025. doi: 10.4018/978-1-6684-9480-7.ch009.

[2] J. Homolak dkk., “A hacked kitchen scale-based system for quantification of grip strength in rodents,” Comput Biol Med, vol. 144, hlm. 105391, Mei 2022, doi: 10.1016/j.compbiomed.2022.105391.

[3] A. Hossen, R. U. Rafin, M. Mahtab, dan S. I. Khan, “A Machine Learning Approach to Assess Psychological State Among University Students Throughout the COVID-19 Pandemic: Bangladesh Perspective,” Asian Journal of Convergence Technology, 2024, [Daring]. Tersedia pada: http://asianssr.org/index.php/ajct/article/view/1356

[4] M. J. Hasan, A. Das, J. Matubber, S. H. Shifat, dan M. K. Morol, “Enhanced Classification of Anxiety, Depression, and Stress Levels: A Comparative Analysis of DASS21 Questionnaire Data Augmentation and Classification Algorithms,” dalam Proceedings of the 3rd International Conference on Computing Advancements, New York, NY, USA: ACM, Okt 2024, hlm. 435–442. doi: 10.1145/3723178.3723236.

[5] M. Salahuddin dan S. A. Siddiqui, “Classification of Mental Health Status Using Supervised Machine Learning Techniques with DASS-21,” Artif Intell Rev, 2023, doi: 10.1007/s10462-023-10561-2.

[6] T. Debbarma dan S. Sahu, “Mental health prediction among students using machine learning techniques,” dalam Frontiers of Intelligent Computing: Theory and Applications, Springer, 2022. [Daring]. Tersedia pada: https://link.springer.com/chapter/10.1007/978-981-19-7513-4_46

[7] P. P. Shinde, V. P. Desai, dan K. S. Oza, “A Data Driven Mental Health Analysis Using Machine Learning Techniques,” dalam International Conference on Intelligent Systems and IoT, IEEE, 2024. [Daring]. Tersedia pada: https://ieeexplore.ieee.org/document/10696584

[8] I. K. A. Wiraguna, E. Setyati, dan E. Pramana, “Prediksi Anak Stunting Berdasarkan Kondisi Orang Tua Dengan Metode Support Vector Machine Dengan Study Kasus Di Kabupaten Tabanan-Bali,” SMATIKA: STIKI Informatika Jurnal, vol. 11, no. 2, hlm. 123–132, 2022, [Daring]. Tersedia pada: https://jurnal.stiki.ac.id/SMATIKA/article/view/662

[9] H. Darwis, R. Puspitasari, dan D. Widyawati, “Comparative Analysis of Anxiety Disorder Classification Using Algorithm Naïve Bayes, Decision Tree and K-NN,” dalam 2025 IEEE International Conference on Management and Digital Systems, IEEE, 2025. [Daring]. Tersedia pada: https://ieeexplore.ieee.org/document/10857485

[10] S. Aulia dan H. Wibowo, “Implementasi Data Cleaning dan Transformasi untuk Proses Data Mining pada Dataset Kesehatan,” Jurnal Informatika: Jurnal Pengembangan IT, vol. 7, no. 3, hlm. 111–118, 2022, [Daring]. Tersedia pada: https://jurnal.unej.ac.id/index.php/J-PTIIK/article/view/32491

[11] M. D. D. A. Putra, A. Widodo, dan M. Rizki, “Implementasi Preprocessing Data dalam Analisis Data Sekunder untuk Prediksi Keberhasilan Studi Mahasiswa,” Jurnal Teknologi dan Sistem Komputer, vol. 9, no. 2, hlm. 245–251, 2021, [Daring]. Tersedia pada: https://jtsiskom.undip.ac.id/index.php/jtsiskom/article/view/2541

[12] R. Fauzan dan A. Mulyana, “Perbandingan Teknik Feature Selection pada Proses Preprocessing Data dalam Klasifikasi Sentimen Media Sosial,” Jurnal RESTI, vol. 5, no. 1, hlm. 101–108, 2021, [Daring]. Tersedia pada: https://ejurnal.itenas.ac.id/index.php/resti/article/view/1361

[13] M. Harahap dan R. A. Siregar, “Implementasi Algoritma Naive Bayes untuk Deteksi Emosi Mahasiswa Berdasarkan Ulasan Daring,” Jurnal Teknologi dan Sistem Komputer, vol. 10, no. 1, hlm. 77–84, 2022, [Daring]. Tersedia pada: https://jtsiskom.undip.ac.id/index.php/jtsiskom/article/view/3027

[14] D. Mulyadi dan H. Siregar, “Model Klasifikasi Kesehatan Mental Remaja Berdasarkan Data Media Sosial Menggunakan K-NN,” Jurnal Sistem Informasi, vol. 15, no. 1, hlm. 50–57, 2023, [Daring]. Tersedia pada: https://jurnal.unmer.ac.id/index.php/jsi/article/view/4902

[15] A. Kurniawan dan H. Prasetyo, “Penggunaan Decision Tree untuk Klasifikasi Masalah Kesehatan Mental Mahasiswa dari Formulir Konseling,” Jurnal Informatika Mulawarman, vol. 16, no. 2, hlm. 85–92, 2021, [Daring]. Tersedia pada: https://journal.unmul.ac.id/index.php/JIM/article/view/2641

[16] A. D. S. Depari, C. C. Kirana, dan C. N. Oktariana, “Prediksi Risiko Diabetes dengan Metode Naive Bayes: Identifikasi Faktor Risiko Utama dan Evaluasi Akurasi Model,” JATI: Jurnal Aplikasi Teknik dan Informasi, 2025, [Daring]. Tersedia pada: https://www.ejournal.itn.ac.id/index.php/jati/article/view/14078

[17] N. B. Binna, T. Rohana, dan H. Y. Novita, “Klasifikasi Jenis Buah Tomat Menggunakan Algoritma K-Nearest Neighbor dan Support Vector Machine,” JINTEKS: Jurnal Informatika Teknologi dan Sains, 2025, [Daring]. Tersedia pada: https://www.jurnal.uts.ac.id/index.php/JINTEKS/article/view/5743

Published

2026-03-13