Ensemble Learning pada Kategorisasi Produk E-Commerce Menggunakan Teknik Boosting

Authors

  • Genta Dwigi Sepbriant Universitas Dian Nuswantoro
  • Danang Wahyu Utomo Universitas Dian Nuswantoro

DOI:

https://doi.org/10.14421/jiska.2024.9.2.123-133

Keywords:

Product Categorization, E-Commerce, Ensemble Learning, XGBoost, Boosting

Abstract

The development of e-commerce significantly contributes to technological advancement, especially for businesses adopting the concept. The growth of e-commerce has seen a significant increase, reaching 196.47 million users in 2023. In e-commerce, a wide range of product variations is provided to users, which can lead to errors or confusion in product selection. Product categorization is crucial in e-commerce to assist users in navigating efficiently. However, manual categorization is less effective as it can be time-consuming. This study aims to clarify the factors of concern in grouping using the K-Nearest Neighbors (KNN) algorithm in product categorization on the e-commerce platform. This research focuses on whether the novelty lies in the implemented algorithm, the variables used, or the applied grouping parameters. This work applies the XGBoost algorithm to improve the effectiveness of product categorization in e-commerce through ensemble learning approaches. The research findings indicate that boosting algorithms like XGBoost outperform individual algorithms like KNN regarding classification accuracy. This proves that ensemble learning approaches may greatly enhance product classification in e-commerce. The testing process of the implemented e-commerce system in this study also provides confidence in the theoretical and practical benefits of applying this research to enhance efficiency and user experience in product categorization on the e-commerce platform.

References

Andriani, N., & Wibowo, A. (2021). Implementasi Text Mining Klasifikasi Topik Tugas Akhir Mahasiswa Teknik Informatika Menggunakan Pembobotan TF-IDF dan Metode Cosine Similarity Berbasis Web. Prosiding Seminar Nasional Mahasiswa Bidang Ilmu Komputer Dan Aplikasinya, 2(2), 130–137. https://conference.upnvj.ac.id/index.php/senamika/article/view/1807

Ansharullah, M. O., Agustin, W., Lusiana, Junadhi, Erlinda, S., & Zoromi, F. (2023). Product Classification Based on Categories and Customer Interests on the Shopee Marketplace Using the Naïve Bayes Method. JAIA - Journal of Artificial Intelligence and Applications, 2(2), 15–22. https://doi.org/10.33372/jaia.v2i2.888

Arumnisaa, R. I., & Wijayanto, A. W. (2023). Comparison of Ensemble Learning Method: Random Forest, Support Vector Machine, AdaBoost for Classification Human Development Index (HDI). SISTEMASI, 12(1), 206. https://doi.org/10.32520/stmsi.v12i1.2501

Donati, L., Iotti, E., Mordonini, G., & Prati, A. (2019). Fashion Product Classification through Deep Learning and Computer Vision. Applied Sciences, 9(7), 1385. https://doi.org/10.3390/app9071385

Dong, X., Yu, Z., Cao, W., Shi, Y., & Ma, Q. (2020). A survey on ensemble learning. Frontiers of Computer Science, 14(2), 241–258. https://doi.org/10.1007/S11704-019-8208-Z/METRICS

Fayaz, M., Khan, A., Rahman, J. U., Alharbi, A., Uddin, M. I., & Alouffi, B. (2020). Ensemble Machine Learning Model for Classification of Spam Product Reviews. Complexity, 2020, 1–10. https://doi.org/10.1155/2020/8857570

Gomero-Fanny, V., Ruiz, A., & Andrade-Arenas, L. (2021). Prototype of Web System for Organizations Dedicated to e-Commerce under the SCRUM Methodology. International Journal of Advanced Computer Science and Applications, 12(1), 437–444. https://doi.org/10.14569/IJACSA.2021.0120152

Huang, Y., Chai, Y., Liu, Y., & Shen, J. (2019). Architecture of next-generation e-commerce platform. Tsinghua Science and Technology, 24(1), 18–29. https://doi.org/10.26599/TST.2018.9010067

Indasari, S. S., & Tjahyanto, A. (2023). Automatic Categorization of Multi Marketplace FMCGs Products using TF-IDF and PCA Features. Jurnal Sisfokom (Sistem Informasi Dan Komputer), 12(2), 198–204. https://doi.org/10.32736/sisfokom.v12i2.1621

Jafarzadeh, H., Mahdianpari, M., Gill, E., Mohammadimanesh, F., & Homayouni, S. (2021). Bagging and Boosting Ensemble Classifiers for Classification of Multispectral, Hyperspectral and PolSAR Data: A Comparative Evaluation. Remote Sensing, 13(21), 4405. https://doi.org/10.3390/rs13214405

Jahanshahi, H., Ozyegen, O., Cevik, M., Bulut, B., Yigit, D., Gonen, F. F., & Başar, A. (2021). Text Classification for Predicting Multi-level Product Categories. http://arxiv.org/abs/2109.01084

Jain, S., & Kumar, V. (2020). Garment Categorization Using Data Mining Techniques. Symmetry, 12(6), 984. https://doi.org/10.3390/sym12060984

Kalaivani, P. (2020). Machine Learning Approach to Analyse Ensemble Models and Neural Network Model for E-Commerce Application. Indian Journal of Science and Technology, 13(28), 2849–2857. https://doi.org/10.17485/IJST/v13i28.927

Kim, H., Joo, G., & Im, H. (2021). Product Category Classification using Word Embedding and GRUs. The Journal of Korean Institute of Information Technology, 19(4), 11–18. https://doi.org/10.14801/jkiit.2021.19.4.11

Lee, H., & Yoon, Y. (2018). Engineering doc2vec for automatic classification of product descriptions on O2O applications. Electronic Commerce Research, 18(3), 433–456. https://doi.org/10.1007/S10660-017-9268-5/METRICS

Mashalah, H. Al, Hassini, E., Gunasekaran, A., & Bhatt (Mishra), D. (2022). The impact of digital transformation on supply chains through e-commerce: Literature review and a conceptual framework. Transportation Research Part E: Logistics and Transportation Review, 165, 102837. https://doi.org/10.1016/j.tre.2022.102837

Mienye, I. D., & Sun, Y. (2022). A Survey of Ensemble Learning: Concepts, Algorithms, Applications, and Prospects. IEEE Access, 10, 99129–99149. https://doi.org/10.1109/ACCESS.2022.3207287

Nobre, J., & Neves, R. F. (2019). Combining Principal Component Analysis, Discrete Wavelet Transform and XGBoost to trade in the financial markets. Expert Systems with Applications, 125, 181–194. https://doi.org/10.1016/j.eswa.2019.01.083

Ozyegen, O., Jahanshahi, H., Cevik, M., Bulut, B., Yigit, D., Gonen, F. F., & Başar, A. (2022). Classifying multi-level product categories using dynamic masking and transformer models. Journal of Data, Information and Management, 4(1), 71–85. https://doi.org/10.1007/s42488-022-00066-6

Patra, A., Vivek, V., Shambhavi, B. R., Sindhu, K., & Balaji, S. (2021). Product Classification in E-Commerce Sites. In Advances in Intelligent Systems and Computing: Vol. 1299 AISC (pp. 485–495). Springer, Singapore. https://doi.org/10.1007/978-981-33-4299-6_40

Pawłowski, M. (2022). Machine Learning Based Product Classification for eCommerce. Journal of Computer Information Systems, 62(4), 730–739. https://doi.org/10.1080/08874417.2021.1910880

Perdana, S. A. P., Aji, T. B., & Ferdiana, R. (2021). Aspect Category Classification dengan Pendekatan Machine Learning Menggunakan Dataset Bahasa Indonesia. Jurnal Nasional Teknik Elektro Dan Teknologi Informasi, 10(3), 229–235. https://doi.org/10.22146/jnteti.v10i3.1819

Pothuganti, K. (2019). Open-World Classification Algorithm to Product Identification. International Journal of Innovative Research in Computer and Communication Engineering, 7(12), 4282–4287. https://doi.org/10.2139/ssrn.3719055

Ristoski, P., Petrovski, P., Mika, P., & Paulheim, H. (2018). A machine learning approach for product matching and categorization. Semantic Web, 9(5), 707–728. https://doi.org/10.3233/SW-180300

Sharma, P., & Sagvekar, V. R. (2023). Weighted Ensemble LSTM Model with Word Embedding Attention for E-Commerce Product Recommendation. Journal of Communications Software and Systems, 19(4), 299–307. https://doi.org/10.24138/jcomss-2023-0126

Tan, L., Li, M. Y., & Kok, S. (2020). E-Commerce Product Categorization via Machine Translation. ACM Transactions on Management Information Systems, 11(3), 1–14. https://doi.org/10.1145/3382189

Downloads

Published

2024-05-25

How to Cite

Sepbriant, G. D., & Utomo, D. W. (2024). Ensemble Learning pada Kategorisasi Produk E-Commerce Menggunakan Teknik Boosting. JISKA (Jurnal Informatika Sunan Kalijaga), 9(2), 123–133. https://doi.org/10.14421/jiska.2024.9.2.123-133

Issue

Section

Articles