Journal Classification Based on Abstract Using Cosine Similarity and Support Vector Machine
DOI:
https://doi.org/10.14421/jiska.2020.43-06Abstract
One of the problems related to journal publishing is the process of categorizing entry into journals according to the field of science. A large number of journal documents included in a journal editorial makes it difficult to categorize so that the process of plotting to reviewers requires a long process. The review process in a journal must be done planning according to the expertise of the reviewer, to produce a quality journal. This study aims to create a classification model that can classify journals automatically using the Cosine Similarity algorithm and Support Vector Machine in the classification process and using the TF-IDF weighting method. The object of this research is abstract in scientific journals. The journals will be classified according to the reviewer's field of expertise. Based on the experimental results, the Support Vector Machine method produces better performance accuracy than the Cosine Similarity method. The results of the calculation of the value of precision, recall, and f-score are known that the Support Vector Machine method produces better amounts, in line with the accuracy value.
References
Ahmed, H., Razzaq, M. A., & Qamar, A. M. (2013). Prediction of popular tweets using Similarity Learning. ICET 2013 - 2013 IEEE 9th International Conference on Emerging Technologies. https://doi.org/10.1109/ICET.2013.6743524
Cahyo, P. W. (2017). Model Monitoring Sebaran Penyakit Demam Berdarah di Indonesia Berdasarkan Analisis Pesan Twitter. Universitas Gadjah Mada Yogyakarta.
Dermawan, R. (2016). Klasifikasi Tweet dan Pengenalan Entitas Bernama pada Tweet Bencana Dengan Support Vector Machine. Universitas Gadjah Mada.
Fushiki, T. (2011). Estimation of prediction error by using K-fold cross-validation. Statistics and Computing, 21(2), 137–146. https://doi.org/10.1007/s11222-009-9153-8
Habibi, Muhamad, & Cahyo, P. W. (2019). Clustering User Characteristics Based on the influence of Hashtags on the Instagram Platform. IJCCS (Indonesian Journal of Computing and Cybernetics Systems), 13(4), 399–408. https://doi.org/10.22146/ijccs.50574
Habibi, Muhammad. (2017). Analisis Sentimen dan Klasifikasi Komentar Mahasiswa pada Sistem Evaluasi Pembelajaran Menggunakan Kombinasi KNN Berbasis Cosine Similarity dan Supervised Model. Departemen Ilmu Komputer dan Elektronika, Fakultas Matematika dan Ilmu Pengetahuan Alam. Universitas Gadjah Mada.
Habibi, Muhammad. (2018). Analisis Konten Jejaring Sosial Twitter dalam Kasus Pemilihan Gubernur DKI 2017. Teknomatika, 11(1), 31–40.
Habibi, Muhammad. & Sumarsono. (2018). Implementation of Cosine Similarity in an automatic classifier for comments. Jiska (Jurnal Informatika Sunan Kalijaga), 3(2), 38–46.
Haddi, E., Liu, X., & Shi, Y. (2013). The Role of Text Pre-processing in Sentiment Analysis. Procedia Computer Science, 17, 26–32. https://doi.org/10.1016/j.procs.2013.05.005
Hidayatullah, A. F., & Maarif, M. R. (2016). Penerapan Text Mining dalam Klasifikasi Judul Skripsi. In Seminar Nasional Aplikasi Teknologi Informasi (SNATi) Agustus (pp. 1907–5022). Yogyakarta.
Jayakodi, K., Bandara, M., & Meedeniya, D. (2016). An automatic classifier for exam questions with WordNet and Cosine similarity. 2nd International Moratuwa Engineering Research Conference, MERCon 2016, 12–17. https://doi.org/10.1109/MERCon.2016.7480108
Kadhim, A. I., Cheah, Y. N., Ahamed, N. H., & Salman, L. A. (2014). Feature extraction for co-occurrence-based cosine similarity score of text documents. 2014 IEEE Student Conference on Research and Development, SCOReD 2014, 2–5. https://doi.org/10.1109/SCORED.2014.7072954
Manning, C. D., Raghavan, P., & Schutze, H. (2009). An Introduction to Information Retrieval. Cambridge, England: Cambridge University Press. https://doi.org/10.1109/LPT.2009.2020494
Saipech, P., & Seresangtakul, P. (2018). Automatic Thai Subjective Examination using Cosine Similarity. ICAICTA 2018 - 5th International Conference on Advanced Informatics: Concepts Theory and Applications, 214–218. https://doi.org/10.1109/ICAICTA.2018.8541276
Siqueira, H., & Barros, F. (2010). A Feature Extraction Process for Sentiment Analysis of Opinions on Services. Proceedings of the III International Workshop on Web and Text Intelligence (WTI).
Downloads
Published
How to Cite
Issue
Section
License
Authors who publish with this journal agree to the following terms as stated in http://creativecommons.org/licenses/by-nc/4.0
a. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
b. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
c. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.