Implementation of Web Scraping and Data Mining for Performance Evaluation of PT Ceria Multimedia Services

Authors

  • Ravel Yanuartha Universitas Teknologi Digital Indonesia Yogyakarta
  • Dini Fakta Sari Universitas Teknologi Digital Indonesia Yogyakarta

DOI:

https://doi.org/10.32664/j-intech.v14i01.2192

Keywords:

Application, Information, Software, System

Abstract

This research aims to implement web scraping techniques to collect testimonial data from the CeriaMultimedia website and perform sentiment analysis to evaluate service quality. The collected data consists of a limited number of testimonials, which are then processed through text preprocessing stages including case folding, tokenizing, filtering, and stemming. The sentiment classification process is conducted using machine learning methods based on TF-IDF weighting and classification algorithms. Due to the limited dataset, the analysis results are used primarily to demonstrate the implementation process rather than to draw generalized conclusions. The results show that the sentiment categories obtained include positive, negative, and neutral sentiments, although not all categories consistently appear in the testing phase. This research highlights the effectiveness of web scraping and text processing techniques while also indicating the need for a larger dataset to improve evaluation accuracy in future studies.

References

[1] A. K. Qorita and F. Rahma, “Analisis Sentimen Berdasarkan Aspek pada Tempat Wisata di Daerah Istimewa Yogyakarta,” AUTOMATA, vol. 3, no. 1, 2022, [Online]. Available: https://journal.uii.ac.id/AUTOMATA/article/view/21906

[2] P. Pandunata, C. K. Ananta, and Y. Nurdiansyah, “Analisis Sentimen Opini Publik Terhadap Pekan Olahraga Nasional Pada Instagram Menggunakan Metode Naïve Bayes Classifier,” INFORMAL Informatics Journal, vol. 7, no. 2, pp. 146–156, 2022, doi: 10.19184/isj.v7i2.33928.

[3] R. Sulastiyono, A. Setiawan, and S. Nugroho, “Sentimen Analisis Pembatalan Indonesia Menjadi Tuan Rumah Piala Dunia U-20 Menggunakan Metode Naïve Bayes,” Journal of Information System Research, vol. 4, no. 4, pp. 1387–1394, 2023, doi: 10.47065/josh.v4i4.3737.

[4] F. P. Herlambang and D. Avianto, “Analisis Sentimen Opini Pengguna Twitter Terhadap Tragedi Kanjuruhan Malang dengan Metode Support Vector Machine,” Jurnal Media Informatika Budidarma, vol. 7, no. 4, pp. 1727–1739, 2023, doi: 10.30865/mib.v7i4.6332.

[5] M. Kholilullah, M. Martanto, and U. Hayati, “Analisis Sentimen Pengguna Twitter (X) Tentang Piala Dunia Usia 17 Menggunakan Metode Naive Bayes,” JATI, vol. 8, no. 1, pp. 392–398, 2024, doi: 10.36040/jati.v8i1.8378.

[6] R. Rasiban and S. Riyadi, “Analisis Sentimen Opini Masyarakat Terhadap Stadion Jakarta International Stadium (JIS) Pada Twitter Dengan Perbandingan Metode Naive Bayes dan Support Vector Machine,” Jurnal Sains dan Teknologi, vol. 5, no. 3, pp. 1010–1017, 2024, doi: 10.55338/saintek.v5i3.2790.

[7] C. A. Cholik, “Perkembangan Teknologi Informasi Komunikasi (ICT) dalam Berbagai Bidang,” Jurnal Fakultas Teknik UNISA Kuningan, vol. 2, no. 2, pp. 39–46, 2021.

[8] C. A. Cholik, “Perkembangan Teknologi Informasi Komunikasi (ICT) dalam Berbagai Bidang,” Jurnal Fakultas Teknik UNISA Kuningan, vol. 2, no. 2, pp. 39–46, 2021.

[9] A. Purwansyah, A. Afriyudi, and S. Suyanto, “Perancangan dan Implementasi Sistem Informasi Pelaporan Masyarakat untuk Kerusakan Jalan di Palembang Menggunakan Google Maps API,” Jurnal Nasional Ilmu Komputer, vol. 1, no. 4, pp. 175–182, 2020, doi: 10.47747/jurnalnik.v1i4.164.

[10] S. Khairunnisa, “Pengaruh Text Preprocessing terhadap Analisis Sentimen Komentar Twitter,” Jurnal Media Informatika Budidarma, vol. 5, no. 2, pp. 406–414, 2021.

[11] K. M. Mahendra, M. H. Murdiansyah, D. T. Lhaksmana, “Analisis Sentimen Tweet COVID-19 Menggunakan K-Nearest Neighbors dengan TF-IDF dan CountVectorizer,” DIKE: Jurnal Ilmu Multidisiplin, vol. 1, no. 2, pp. 37–43, 2023.

[12] A. K. Qorita and F. Rahma, “Analisis Sentimen Berdasarkan Aspek pada Tempat Wisata di Daerah Istimewa Yogyakarta,” AUTOMATA, vol. 3, no. 1, 2022.

[13] A. Pebdika, R. Herdiana, and D. Solihudin, “Klasifikasi Menggunakan Metode Naive Bayes untuk Menentukan Calon Penerima PIP,” JATI, vol. 7, no. 1, pp. 452–458, 2023.

[14] F. P. Herlambang and D. Avianto, “Analisis Sentimen Opini Pengguna Twitter Terhadap Tragedi Kanjuruhan Malang dengan Metode Support Vector Machine,” Jurnal Media Informatika Budidarma, vol. 7, no. 4, pp. 1727–1739, 2023.

[15] A. H. Ayatullah, “Analisis Sentimen Penilaian Masyarakat terhadap Pelayanan Rumah Sakit Muhammadiyah Lamongan Menggunakan TF-IDF dan Naive Bayes,” Jurnal Informatika Medis, vol. 2, no. 1, pp. 27–23, 2024.

[16] K. M. Mahendra, M. H. Murdiansyah, and D. T. Lhaksmana, “Analisis Sentimen Tweet COVID-19 Menggunakan K-Nearest Neighbors dengan TF-IDF dan CountVectorizer,” DIKE: Jurnal Ilmu Multidisiplin, vol. 1, no. 2, pp. 37–43, 2023, doi: 10.69688/dike.v1i2.35.

[17] A. H. Ayatullah, “Analisis Sentimen Penilaian Masyarakat terhadap Pelayanan Rumah Sakit Muhammadiyah Lamongan Menggunakan TF-IDF dan Naive Bayes,” Jurnal Informatika Medis, vol. 2, no. 1, pp. 27–33, 2024, doi: 10.52060/im.v2i1.2198.

[18] T. I. Alfawas, “Penerapan TF-IDF untuk Analisis Sentimen Ulasan Game Bus Simulator Indonesia,” Innovative Journal of Social Science Research, vol. 4, no. 5, pp. 3177–3193, 2024.

[19] A. Munawaroh, “Sentiment Analysis dengan Naive Bayes terhadap Risiko Pembangunan IKN,” JATI, 2024, doi: 10.36040/jati.v8i1.8454.

[20] E. Suryati, Styawati, and A. A. Aldino, “Analisis Sentimen Transportasi Online Menggunakan Word2Vec dan SVM,” Jurnal Teknologi dan Sistem Informasi, vol. 4, no. 1, pp. 96–106, 2023.

[21] N. P. Husain, “Analisis Sentimen Ulasan TikTok Berbasis TF-IDF dan SVM,” JSCE, vol. 5, no. 1, pp. 91–102, 2024, doi: 10.61628/jsce.v5i1.1105.

[22] H. C. Husada and A. S. Paramita, “Analisis Sentimen Maskapai Penerbangan di Twitter Menggunakan SVM,” Teknika, vol. 10, no. 1, pp. 18–26, 2021, doi: 10.34148/teknika.v10i1.311.

[23] H. Apriyani and K. Kurniati, “Perbandingan Metode Naive Bayes dan SVM dalam Klasifikasi Penyakit Diabetes,” Jurnal Information Technology Ampera, vol. 1, no. 3, pp. 133–143

Downloads

Published

2026-03-30