Klasifikasi Ulasan Fasilitas Publik Menggunakan Metode Naïve Bayes dengan Seleksi Fitur Chi-Square
DOI:
https://doi.org/10.14421/jiska.2023.8.2.112-124Keywords:
Sentiment Analysis, Public Facility, Google Maps Reviews, Naïve Bayes, Chi-Square Feature SelectionAbstract
Government builds public facilities to support the needs of the community. The use of these public facilities needs to be re-evaluated, and one way to do it is through community response. Google Maps is one platform that receives the most responses from the community about location. Google Maps Reviews allow us to see how the public reacts to a location. Naïve Bayes method is used for classification in this study because it is one of the simple methods in machine learning that can be easily applied to several experiments conducted by the author. In the classification process, reviews produce many features that will be calculated based on their class. More features generated, more features processed too in the system. Chi-Square feature selection will be used to reduce features that have low dependence on the system. In this study, performance values will be calculated based on the experimental use of feature ratios of 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, and 100%. The results show that the use of 10% Chi-Square features produces the best performance, with an accuracy rate of 86.94%, precision of 80.42%, recall of 80.42%, and f-measure of 80.42%.
References
Aziz, K. E., Crysdian, C., & Imamudin, M. (2021). Identification of Student Academic Performance in Computer Science Based on Naive Bayes. MATICS, 13(1), 33–41. https://doi.org/10.18860/mat.v13i1.9726
Baskoro, S. E., Suhartono, S., Chamidy, T., & Zaman, S. (2022). Pengujian akurasi model regresi logistik multinomial untuk memprediksi keberhasilan mahasiswa di perguruan tinggi menggunakan r. Fair Value: Jurnal Ilmiah Akuntansi Dan Keuangan, 5(3), 1551–1565. https://doi.org/10.32670/fairvalue.v5i3.2472
Dulhare, U. N. (2018). Prediction system for heart disease using Naive Bayes and particle swarm optimization. Biomedical Research (India), 29(12), 2646–2649. https://doi.org/10.4066/BIOMEDICALRESEARCH.29-18-620
Faisal, M., Nugroho, F., Sulthan, M. M. El, Amini, F., Hariyadi, M. A., & Sedayu, A. (2020). Plagiarism Detection Using Manber and Winnowing Algorithm. International Journal of Advanced Science and Technology, 29(6s), 2130–2136. http://sersc.org/journals/index.php/IJAST/article/view/10924
Han, J., & Kamber, M. (2001). Data Mining: Concepts and Techniques (3rd ed.). Morgan Kaufmann.
Harahap, F., Harahap, A. Y. N., Ekadiansyah, E., Sari, R. N., Adawiyah, R., & Harahap, C. B. (2018). Implementation of Naïve Bayes Classification Method for Predicting Purchase. 2018 6th International Conference on Cyber and IT Service Management (CITSM), 1–5. https://doi.org/10.1109/CITSM.2018.8674324
Irvantoro, D. (2019). Feature Selection Chi-Square N-Gram Naïve Bayes Classifier Review [Universitas Muhammadiyah Jember]. http://repository.unmuhjember.ac.id/7128/
Jurafsky, D., & Martin, J. H. (2023). Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition (3rd ed.). Prentice Hall. https://web.stanford.edu/~jurafsky/slp3/ed3book.pdf
Linawati, S., Nurdiani, S., Handayani, K., & Latifah. (2020). Prediksi Prestasi Akademik Mahasiswa Menggunakan Algoritma Random Forest dan C4.5. 8(1), 6–13. https://doi.org/10.31294/jki.v8i1.7827
Liu, B. (2012). Sentiment Analysis and Opinion Mining. Synthesis Lectures on Human Language Technologies, 5(1), 1–167. https://doi.org/10.2200/S00416ED1V01Y201204HLT016
Huan Liu, & Setiono, R. (1995). Chi2: feature selection and discretization of numeric attributes. Proceedings of 7th IEEE International Conference on Tools with Artificial Intelligence, 388–391. https://doi.org/10.1109/TAI.1995.479783
Pratama, N. D., Sari, Y. A., & Adikara, P. P. (2018). Analisis Sentimen pada Review Konsumen Menggunakan Metode Naive Bayes dengan Seleksi Fitur Chi Square untuk Rekomendasi Lokasi Makanan Tradisional. Jurnal Pengembangan Teknologi Informasi Dan Ilmu Komputer, 2(9), 2982–2988. https://j-ptiik.ub.ac.id/index.php/j-ptiik/article/view/2494
Putri, L. A. A. R. (2018). Seleksi Fitur dalam Klasifikasi Genre Musik. Jurnal Ilmu Komputer, 10(1), 19–26. https://ojs.unud.ac.id/index.php/jik/article/view/39772
Russell, S., & Norvig, P. (1995). Artificial intelligence—a modern approach. In The Knowledge Engineering Review (Issue 1). Pearson Education. https://www.cambridge.org/core/product/identifier/S0269888900007724/type/journal_article
Ruz, G. A., Henríquez, P. A., & Mascareño, A. (2020). Sentiment analysis of Twitter data during critical events through Bayesian networks classifiers. Future Generation Computer Systems, 106, 92–104. https://doi.org/10.1016/j.future.2020.01.005
Sanders, R. (1987). THE PARETO PRINCIPLE: ITS USE AND ABUSE. Journal of Services Marketing, 1(2), 37–40. https://doi.org/10.1108/eb024706
Singh, G., Kumar, B., Gaur, L., & Tyagi, A. (2019). Comparison between Multinomial and Bernoulli Naïve Bayes for Text Classification. 2019 International Conference on Automation, Computational and Technology Management (ICACTM), 593–596. https://doi.org/10.1109/ICACTM.2019.8776800
Xu, G., Meng, Y., Qiu, X., Yu, Z., & Wu, X. (2019). Sentiment Analysis of Comment Texts Based on BiLSTM. IEEE Access, 7, 51522–51532. https://doi.org/10.1109/ACCESS.2019.2909919
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Adhitya Prayoga Permana, Totok Chamidy, Cahyo Crysdian
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Authors who publish with this journal agree to the following terms as stated in http://creativecommons.org/licenses/by-nc/4.0
a. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
b. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
c. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.