Ensemble Learning pada Kategorisasi Produk E-Commerce Menggunakan Teknik Boosting
DOI:
https://doi.org/10.14421/jiska.2024.9.2.123-133Keywords:
Product Categorization, E-Commerce, Ensemble Learning, XGBoost, BoostingAbstract
The development of e-commerce significantly contributes to technological advancement, especially for businesses adopting the concept. The growth of e-commerce has seen a significant increase, reaching 196.47 million users in 2023. In e-commerce, a wide range of product variations is provided to users, which can lead to errors or confusion in product selection. Product categorization is crucial in e-commerce to assist users in navigating efficiently. However, manual categorization is less effective as it can be time-consuming. This study aims to clarify the factors of concern in grouping using the K-Nearest Neighbors (KNN) algorithm in product categorization on the e-commerce platform. This research focuses on whether the novelty lies in the implemented algorithm, the variables used, or the applied grouping parameters. This work applies the XGBoost algorithm to improve the effectiveness of product categorization in e-commerce through ensemble learning approaches. The research findings indicate that boosting algorithms like XGBoost outperform individual algorithms like KNN regarding classification accuracy. This proves that ensemble learning approaches may greatly enhance product classification in e-commerce. The testing process of the implemented e-commerce system in this study also provides confidence in the theoretical and practical benefits of applying this research to enhance efficiency and user experience in product categorization on the e-commerce platform.
References
Andriani, N., & Wibowo, A. (2021). Implementasi Text Mining Klasifikasi Topik Tugas Akhir Mahasiswa Teknik Informatika Menggunakan Pembobotan TF-IDF dan Metode Cosine Similarity Berbasis Web. Prosiding Seminar Nasional Mahasiswa Bidang Ilmu Komputer Dan Aplikasinya, 2(2), 130–137. https://conference.upnvj.ac.id/index.php/senamika/article/view/1807
Ansharullah, M. O., Agustin, W., Lusiana, Junadhi, Erlinda, S., & Zoromi, F. (2023). Product Classification Based on Categories and Customer Interests on the Shopee Marketplace Using the Naïve Bayes Method. JAIA - Journal of Artificial Intelligence and Applications, 2(2), 15–22. https://doi.org/10.33372/jaia.v2i2.888
Arumnisaa, R. I., & Wijayanto, A. W. (2023). Comparison of Ensemble Learning Method: Random Forest, Support Vector Machine, AdaBoost for Classification Human Development Index (HDI). SISTEMASI, 12(1), 206. https://doi.org/10.32520/stmsi.v12i1.2501
Donati, L., Iotti, E., Mordonini, G., & Prati, A. (2019). Fashion Product Classification through Deep Learning and Computer Vision. Applied Sciences, 9(7), 1385. https://doi.org/10.3390/app9071385
Dong, X., Yu, Z., Cao, W., Shi, Y., & Ma, Q. (2020). A survey on ensemble learning. Frontiers of Computer Science, 14(2), 241–258. https://doi.org/10.1007/S11704-019-8208-Z/METRICS
Fayaz, M., Khan, A., Rahman, J. U., Alharbi, A., Uddin, M. I., & Alouffi, B. (2020). Ensemble Machine Learning Model for Classification of Spam Product Reviews. Complexity, 2020, 1–10. https://doi.org/10.1155/2020/8857570
Gomero-Fanny, V., Ruiz, A., & Andrade-Arenas, L. (2021). Prototype of Web System for Organizations Dedicated to e-Commerce under the SCRUM Methodology. International Journal of Advanced Computer Science and Applications, 12(1), 437–444. https://doi.org/10.14569/IJACSA.2021.0120152
Huang, Y., Chai, Y., Liu, Y., & Shen, J. (2019). Architecture of next-generation e-commerce platform. Tsinghua Science and Technology, 24(1), 18–29. https://doi.org/10.26599/TST.2018.9010067
Indasari, S. S., & Tjahyanto, A. (2023). Automatic Categorization of Multi Marketplace FMCGs Products using TF-IDF and PCA Features. Jurnal Sisfokom (Sistem Informasi Dan Komputer), 12(2), 198–204. https://doi.org/10.32736/sisfokom.v12i2.1621
Jafarzadeh, H., Mahdianpari, M., Gill, E., Mohammadimanesh, F., & Homayouni, S. (2021). Bagging and Boosting Ensemble Classifiers for Classification of Multispectral, Hyperspectral and PolSAR Data: A Comparative Evaluation. Remote Sensing, 13(21), 4405. https://doi.org/10.3390/rs13214405
Jahanshahi, H., Ozyegen, O., Cevik, M., Bulut, B., Yigit, D., Gonen, F. F., & Başar, A. (2021). Text Classification for Predicting Multi-level Product Categories. http://arxiv.org/abs/2109.01084
Jain, S., & Kumar, V. (2020). Garment Categorization Using Data Mining Techniques. Symmetry, 12(6), 984. https://doi.org/10.3390/sym12060984
Kalaivani, P. (2020). Machine Learning Approach to Analyse Ensemble Models and Neural Network Model for E-Commerce Application. Indian Journal of Science and Technology, 13(28), 2849–2857. https://doi.org/10.17485/IJST/v13i28.927
Kim, H., Joo, G., & Im, H. (2021). Product Category Classification using Word Embedding and GRUs. The Journal of Korean Institute of Information Technology, 19(4), 11–18. https://doi.org/10.14801/jkiit.2021.19.4.11
Lee, H., & Yoon, Y. (2018). Engineering doc2vec for automatic classification of product descriptions on O2O applications. Electronic Commerce Research, 18(3), 433–456. https://doi.org/10.1007/S10660-017-9268-5/METRICS
Mashalah, H. Al, Hassini, E., Gunasekaran, A., & Bhatt (Mishra), D. (2022). The impact of digital transformation on supply chains through e-commerce: Literature review and a conceptual framework. Transportation Research Part E: Logistics and Transportation Review, 165, 102837. https://doi.org/10.1016/j.tre.2022.102837
Mienye, I. D., & Sun, Y. (2022). A Survey of Ensemble Learning: Concepts, Algorithms, Applications, and Prospects. IEEE Access, 10, 99129–99149. https://doi.org/10.1109/ACCESS.2022.3207287
Nobre, J., & Neves, R. F. (2019). Combining Principal Component Analysis, Discrete Wavelet Transform and XGBoost to trade in the financial markets. Expert Systems with Applications, 125, 181–194. https://doi.org/10.1016/j.eswa.2019.01.083
Ozyegen, O., Jahanshahi, H., Cevik, M., Bulut, B., Yigit, D., Gonen, F. F., & Başar, A. (2022). Classifying multi-level product categories using dynamic masking and transformer models. Journal of Data, Information and Management, 4(1), 71–85. https://doi.org/10.1007/s42488-022-00066-6
Patra, A., Vivek, V., Shambhavi, B. R., Sindhu, K., & Balaji, S. (2021). Product Classification in E-Commerce Sites. In Advances in Intelligent Systems and Computing: Vol. 1299 AISC (pp. 485–495). Springer, Singapore. https://doi.org/10.1007/978-981-33-4299-6_40
Pawłowski, M. (2022). Machine Learning Based Product Classification for eCommerce. Journal of Computer Information Systems, 62(4), 730–739. https://doi.org/10.1080/08874417.2021.1910880
Perdana, S. A. P., Aji, T. B., & Ferdiana, R. (2021). Aspect Category Classification dengan Pendekatan Machine Learning Menggunakan Dataset Bahasa Indonesia. Jurnal Nasional Teknik Elektro Dan Teknologi Informasi, 10(3), 229–235. https://doi.org/10.22146/jnteti.v10i3.1819
Pothuganti, K. (2019). Open-World Classification Algorithm to Product Identification. International Journal of Innovative Research in Computer and Communication Engineering, 7(12), 4282–4287. https://doi.org/10.2139/ssrn.3719055
Ristoski, P., Petrovski, P., Mika, P., & Paulheim, H. (2018). A machine learning approach for product matching and categorization. Semantic Web, 9(5), 707–728. https://doi.org/10.3233/SW-180300
Sharma, P., & Sagvekar, V. R. (2023). Weighted Ensemble LSTM Model with Word Embedding Attention for E-Commerce Product Recommendation. Journal of Communications Software and Systems, 19(4), 299–307. https://doi.org/10.24138/jcomss-2023-0126
Tan, L., Li, M. Y., & Kok, S. (2020). E-Commerce Product Categorization via Machine Translation. ACM Transactions on Management Information Systems, 11(3), 1–14. https://doi.org/10.1145/3382189
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Genta Dwigi Sepbriant, Danang Wahyu Utomo
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Authors who publish with this journal agree to the following terms as stated in http://creativecommons.org/licenses/by-nc/4.0
a. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
b. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
c. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.