Journal Classification Based on Abstract Using Cosine Similarity and Support Vector Machine

Authors

  • Muhammad Habibi Universitas Jenderal Achmad Yani Yogyakarta http://orcid.org/0000-0001-7039-5827
  • Puji Winar Cahyo Universitas Jenderal Achmad Yani Yogyakarta

DOI:

https://doi.org/10.14421/jiska.2020.43-06

Abstract

One of the problems related to journal publishing is the process of categorizing entry into journals according to the field of science. A large number of journal documents included in a journal editorial makes it difficult to categorize so that the process of plotting to reviewers requires a long process. The review process in a journal must be done planning according to the expertise of the reviewer, to produce a quality journal. This study aims to create a classification model that can classify journals automatically using the Cosine Similarity algorithm and Support Vector Machine in the classification process and using the TF-IDF weighting method. The object of this research is abstract in scientific journals. The journals will be classified according to the reviewer's field of expertise. Based on the experimental results, the Support Vector Machine method produces better performance accuracy than the Cosine Similarity method. The results of the calculation of the value of precision, recall, and f-score are known that the Support Vector Machine method produces better amounts, in line with the accuracy value.

Author Biographies

Muhammad Habibi, Universitas Jenderal Achmad Yani Yogyakarta

Program Studi Informatika

Puji Winar Cahyo, Universitas Jenderal Achmad Yani Yogyakarta

Program Studi Informatika

References

Ahmed, H., Razzaq, M. A., & Qamar, A. M. (2013). Prediction of popular tweets using Similarity Learning. ICET 2013 - 2013 IEEE 9th International Conference on Emerging Technologies. https://doi.org/10.1109/ICET.2013.6743524

Cahyo, P. W. (2017). Model Monitoring Sebaran Penyakit Demam Berdarah di Indonesia Berdasarkan Analisis Pesan Twitter. Universitas Gadjah Mada Yogyakarta.

Dermawan, R. (2016). Klasifikasi Tweet dan Pengenalan Entitas Bernama pada Tweet Bencana Dengan Support Vector Machine. Universitas Gadjah Mada.

Fushiki, T. (2011). Estimation of prediction error by using K-fold cross-validation. Statistics and Computing, 21(2), 137–146. https://doi.org/10.1007/s11222-009-9153-8

Habibi, Muhamad, & Cahyo, P. W. (2019). Clustering User Characteristics Based on the influence of Hashtags on the Instagram Platform. IJCCS (Indonesian Journal of Computing and Cybernetics Systems), 13(4), 399–408. https://doi.org/10.22146/ijccs.50574

Habibi, Muhammad. (2017). Analisis Sentimen dan Klasifikasi Komentar Mahasiswa pada Sistem Evaluasi Pembelajaran Menggunakan Kombinasi KNN Berbasis Cosine Similarity dan Supervised Model. Departemen Ilmu Komputer dan Elektronika, Fakultas Matematika dan Ilmu Pengetahuan Alam. Universitas Gadjah Mada.

Habibi, Muhammad. (2018). Analisis Konten Jejaring Sosial Twitter dalam Kasus Pemilihan Gubernur DKI 2017. Teknomatika, 11(1), 31–40.

Habibi, Muhammad. & Sumarsono. (2018). Implementation of Cosine Similarity in an automatic classifier for comments. Jiska (Jurnal Informatika Sunan Kalijaga), 3(2), 38–46.

Haddi, E., Liu, X., & Shi, Y. (2013). The Role of Text Pre-processing in Sentiment Analysis. Procedia Computer Science, 17, 26–32. https://doi.org/10.1016/j.procs.2013.05.005

Hidayatullah, A. F., & Maarif, M. R. (2016). Penerapan Text Mining dalam Klasifikasi Judul Skripsi. In Seminar Nasional Aplikasi Teknologi Informasi (SNATi) Agustus (pp. 1907–5022). Yogyakarta.

Jayakodi, K., Bandara, M., & Meedeniya, D. (2016). An automatic classifier for exam questions with WordNet and Cosine similarity. 2nd International Moratuwa Engineering Research Conference, MERCon 2016, 12–17. https://doi.org/10.1109/MERCon.2016.7480108

Kadhim, A. I., Cheah, Y. N., Ahamed, N. H., & Salman, L. A. (2014). Feature extraction for co-occurrence-based cosine similarity score of text documents. 2014 IEEE Student Conference on Research and Development, SCOReD 2014, 2–5. https://doi.org/10.1109/SCORED.2014.7072954

Manning, C. D., Raghavan, P., & Schutze, H. (2009). An Introduction to Information Retrieval. Cambridge, England: Cambridge University Press. https://doi.org/10.1109/LPT.2009.2020494

Saipech, P., & Seresangtakul, P. (2018). Automatic Thai Subjective Examination using Cosine Similarity. ICAICTA 2018 - 5th International Conference on Advanced Informatics: Concepts Theory and Applications, 214–218. https://doi.org/10.1109/ICAICTA.2018.8541276

Siqueira, H., & Barros, F. (2010). A Feature Extraction Process for Sentiment Analysis of Opinions on Services. Proceedings of the III International Workshop on Web and Text Intelligence (WTI).

Downloads

Published

2020-02-21

How to Cite

Habibi, M., & Cahyo, P. W. (2020). Journal Classification Based on Abstract Using Cosine Similarity and Support Vector Machine. JISKA (Jurnal Informatika Sunan Kalijaga), 4(3), 185–192. https://doi.org/10.14421/jiska.2020.43-06