Analyzing the Accuracy of Answer Sheet Data in Paper-based Test Using Decision Tree


data mining
decision tree
paper-based test

How to Cite

Suharto, E., Widodo, A. P., & Suryono, S. (2019). Analyzing the Accuracy of Answer Sheet Data in Paper-based Test Using Decision Tree. IJID (International Journal on Informatics for Development), 8(1), 1–7.


In education quality assurance, the accuracy of test data is crucial. However, there is still a problem regarding to the possibility of incorrect data filled by test taker during paper-based test. On the contrary, this problem does not appear in computer-based test. In this study, a method was proposed in order to analyze the accuracy of answer sheet filling out in paper-based test using data mining technique. A single layer of data comprehension was added within the method instead of raw data. The results of the study were a web-based program for data pre-processing and decision tree models. There were 374 instances which were analyzed. The accuracy of answer sheet filling out attained 95.19% while the accuracy of classification varied from 99.47% to 100% depend on evaluation method chosen. This study could motivate the administrators for test improvement since it preferred computer-based test to paper-based.


M. S. Kandil, A. E. Hassan, A. S. Asem, and M. E. Ibrahim, Prototype of Web2-based system for Quality Assurance Evaluation Process in Higher education Institutions, International Journal of Electrical & Computer Sciences IJECS-IJENS 10 (02), 2010.

P. B. Ebrey, The Cambridge Illustrated History of China, Cambridge University Press, Cambridge, 2010.

K. J. G. Leeuwenkamp, D. J. Brinke, and L. Kester, Assessment quality in tertiary education: An integrative literature review, Studies in Educational Evaluation 55, 2017, pp. 94-116.

M. W. Rodrigues, L. E. Zárate, and S. Isotani, Educational Data Mining: A review ofevaluation process in the e-learning, Telematics and Informatics, 2018.

S. Natek and M. Zwilling, Student data mining solution–knowledge management system related to higher education institutions, Expert Systems with Applications 41 (14), 2014, pp. 6400-6407.

X. Guan, J. Liang, Y. Qian, and J. Pang, A multi-view OVA model based on decision tree for multi-classification tasks, Knowledge-Based Systems 138, 2017, pp. 208-219.

J. R. Quinlan, C4.5: Programs for Machine Learning, Morgan Kaufmann, San Francisco, 1993.

Q. A. Al-Radaideh, A. A. Assaf, and E. Alnagi, Predicting Stock Prices Using Data mining Techniques, The International Arab Conference on Information Technology, 2013.

H. R. Kim, M. Bowles, X. Yan, and S. J. Chung, Examining the comparability between paper- and computer-based versions of an integrated writing placement test, Assessing Writing 36, 2018, pp. 49-62.

N. A. Karim and Z. Shukur, Proposed features of an online examination interface design and its optimal values, Computers in Human Behavior 64, 2016, pp. 414-422.

D. Coniam, Evaluating computer-based and paper-based versions of an English-language listening test. Cambridge University Press 18 (2), 2006, pp. 193-211.

C. Y. Piaw, Replacing paper-based testing with computer-based testing in assessment: Are we doing wrong?, Procedia - Social and Behavioral Sciences 64, November 9, 2012, pp. 655-664.

A. A. Prisacari and J. Danielson, Computer-based versus paper-based testing: Investigating testing mode with cognitive load and scratch paper use, Computers in Human Behavior 77, 2017, pp. 1-10.

S. Chan, S. Bax, and C. Weir, Researching the comparability of paper-based and computer-based delivery in a high-stakes writing test, Assessing Writing 36, 2018, pp. 32-48.

M. J. Zaki and W. Meira Jr., Data mining and analysis: fundamental concepts and algorithms, Cambridge University Press, New York, 2014.

I. Witten, E. Frank, M. A. Hall, and C. J. Pal, Data mining: practical machine learning tools and techniques Fourth Edition, Morgan Kaufmann, Cambridge, 2017.

J. Han, M. Kamber, and J. Pei, Data mining: Concepts and Techniques, 3rd edition., Morgan Kaufmann, Amsterdam, 2011.

T. M. Mitchell, Machine Learning, McGraw-Hill, New York, 1997.

C. E. Shannon, Prediction and entropy of printed English. The Bell System Technical Journal 30, 1951, pp. 50–64.

E. Suharto, A. P. Widodo, and Suryono, Decision Tree for Analyzing the Accuracy of Answer Sheet Data in Paper-based Test, Proceeding The 2nd International Conference on Informatics for Development, 2018.

A. B. Downey, Think Stats: Probability and Statistics for Programmers, Green Tea Press, Needham, 2011.

Creative Commons License
IJID (International Journal on Informatics for Development) is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License