Implementation of Web Scraping to Build a Web-Based Instagram Account Data Downloader Application

Arif Himawan, Adri Priadana, Aris Murdiyanto

Abstract


Instagram has been used by many groups, such as business people, academics, to politicians, to take advantage of the insights gained by processing and analyzing Instagram data for various purposes. However, before processing and analyzing data, users must first pass data collection or downloading from Instagram. The problem faced is that most data collection methods are still done manually as for many parties that offer Instagram account data download services with various price options. This research applied a web scraping method to automatically build a web-based Instagram account data download application so that several parties can use it. The web scraping method was chosen because by using this method, researchers do not need to use Instagram's Application Programming Interface (API), which has access restrictions in retrieving data on Instagram. In this study, application testing was conducted on 15 Instagram accounts with various publications, namely between 100 and 11000. Based on the download data analysis results, the application of the web scraping method to download Instagram account data can successfully download a maximum of 2412 account data. In this application, users can download Instagram account data to Data Collection and then manage it like deleting and exporting data collection in the form of CSV, Excel, or JSON.

Keywords


data download application; Instagram account data; web scraping; social media; Instagram

Full Text:

PDF

References


Y. Hu, L. Manikonda, and S. Kambhampati, “What we instagram: A first analysis of instagram photo content and user types.” The AAAI Press, pp. 595–598, 2014.

J. Constine, “Instagram hits 1 billion monthly users, up from 800M in September | TechCrunch,” 2018. [Online]. Available: https://techcrunch.com/2018/06/20/instagram-1-billion-users/. [Accessed: 19-Dec-2019].

S. L. B. Ginting, “Algoritma Apriori untuk Menampilkan Korelasi Nilai Akademik dengan Kelulusan Mahasiswa: Data Mining,” Komputika J. Sist. Komput., vol. 6, no. 2, pp. 59–65, Jun. 2019, doi: 10.34010/komputika.v6i2.1706.

K. Latifah, “ANALISIS DAN PENERAPAN ALGORITHMA C45 DALAM DATA MINING UNTUK MENUNJANG STRATEGI PROMOSI PRODI INFORMATIKA UPGRIS,” J. Tek. Inform., vol. 11, no. 2, pp. 109–120, Nov. 2018, doi: 10.15408/jti.v11i2.6706.

H. Sa’dyah, W. Sarinastiti, and R. R. Ramadhan, “Rancang Bangun Mesin Crawler di Instagram dan Pinterest untuk Kebutuhan Data pada Riset Visual,” MIND J., vol. 4, no. 1, pp. 24–37, Sep. 2019, doi: 10.26760/mindjournal.v4i1.24-37.

B. Yadranjiaghdam, S. Yasrobi, and N. Tabrizi, “Developing a Real-Time Data Analytics Framework for Twitter Streaming Data,” in Proceedings - 2017 IEEE 6th International Congress on Big Data, BigData Congress 2017, 2017, pp. 329–336, doi: 10.1109/BigDataCongress.2017.49.

R. Fauziah, I. A. Ratnamulyani, and A. A. Kusumadinata, “EFEKTIFITAS PROMOSI DESTINASI WISATA REKREASI GUNUNG PANCAR MELALUI POSTINGAN INSTAGRAM MEDIA SOSIAL,” J. Komun., vol. 4, no. 1, Jul. 2018, doi: 10.30997/jk.v4i1.1210.

A. A. Arman and A. P. Sidik, “Measurement of Engagement Rate in Instagram (Case Study: Instagram Indonesian Government Ministry and Institutions),” in Proceeding - 2019 International Conference on ICT for Smart Society: Innovation and Transformation Toward Smart Region, ICISS 2019, 2019, doi: 10.1109/ICISS48059.2019.8969826.

M. I. Akrianto, A. D. Hartanto, and A. Priadana, “The Best Parameters to Select Instagram Account for Endorsement using Web Scraping,” in 2019 4th International Conference on Information Technology, Information Systems and Electrical Engineering (ICITISEE), 2019, pp. 40–45, doi: 10.1109/ICITISEE48480.2019.9004038.

I. D. Utama and T. Inayati, “Brand Post Analysis and Categorization in Automobile’s Instagram Accounts,” in Proceedings of 2019 International Conference on Information Management and Technology, ICIMTech 2019, 2019, pp. 12–17, doi: 10.1109/ICIMTech.2019.8843753.

W. Kurniawan, F. Ramadhan, and H. Ardiansyah, “The Application of Intersection in the Set Theory for Instagram Hashtags,” IJID (International J. Informatics Dev., vol. 8, no. 2, p. 88, Mar. 2020, doi: 10.14421/ijid.2019.08207.

A. Priadana and A. W. Murdiyanto, “Analisis Waktu Terbaik untuk Menerbitkan Konten di Instagram untuk Menjangkau Audiens,” J. Penelit. Pers dan Komun. Pembang., vol. 24, no. 1, pp. 59–70, Jun. 2020, doi: 10.46426/jp2kp.v24i1.118.

A. Priadana and M. Habibi, “Face detection using haar cascades to filter selfie face image on instagram,” in Proceeding - 2019 International Conference of Artificial Intelligence and Information Technology, ICAIIT 2019, 2019, pp. 6–9, doi: 10.1109/ICAIIT.2019.8834526.

H. Ting, W. W. P. Ming, E. C. de Run, and S. L. Y. Choo, “Beliefs about the use of Instagram: an exploratory study,” in International Journal of Business and Innovation , 2 (2), 2015, pp. 15–31.

A. Alsaeed, O. Alotaibi, N. Alotaibi, and M. Almutairy, “Automating Instagram Activities and Analysis: A Survey of Existing Tools,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2019, vol. 11578 LNCS, pp. 267–277, doi: 10.1007/978-3-030-21902-4_19.

R. C. Pereira and T. Vanitha, “Web Scraping of Social Networks,” Int. J. Innov. Res. Comput. Commun. Eng., vol. 3, no. 7, pp. 237–240, 2015.

Fatmasari, Y. N. Kunang, and S. D. Purnamasari, “Web Scraping Techniques to Collect Weather Data in South Sumatera,” in Proceedings of 2018 International Conference on Electrical Engineering and Computer Science, ICECOS 2018, 2019, doi: 10.1109/ICECOS.2018.8605202.

M. Huda, S. Wiyono, M. F. Hidayatullah, and S. Bahri, “Studi Kasus: Sistem Informasi dan Pelayanan Administrasi Kependudukan,” Komputika J. Sist. Komput., vol. 9, no. 1, pp. 59–65, Apr. 2020, doi: 10.34010/komputika.v9i1.2518.

N. Buslim, “Pengembangan Algoritma Unsupervised Learning Technique Pada Big Data Analysis di Media Sosial sebagai media promosi Online Bagi Masyarakat,” J. Tek. Inform., vol. 12, no. 1, pp. 79–96, Jun. 2019, doi: 10.15408/jti.v12i1.11342.




DOI: http://dx.doi.org/10.14421/ijid.2020.09201

Refbacks

  • There are currently no refbacks.


Copyright (c) 2020 IJID (International Journal on Informatics for Development)

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

 

ISSN: 2252-7834 (print) | 2549-7448 (online)

International Journal on Informatics for Development

Office : Informatics Dept. Faculty of Science and Technology,

State Islamic University (UIN) Sunan Kalijaga,

Yogyakarta-Indonesia

Marsda Adisucipto Street, Yogyakarta

Phone +62-274 519739 Fax. +62-274 540971

Email : ijid@uin-suka.ac.id

Creative Commons License

All publications

by International Journal on Informatics for Development are licensed under a

Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License