Abstract
Digital image processing is a field that is being cultivated by many researchers at this time because it is interesting to apply to various activities, both analysis and production activities. One branch of the digital image is pattern recognition. This study uses Tesseract as a tool to recognize patterns from Hiragana letters. This study was conducted to find out how much Tesseract was able to recognize a Japanese text and handwritten text. This study uses 1 image as training data containing 74 Hiragana letters which are processed through training for each letter. This study has several testing criteria based on font size and resolution to find the best results in pattern recognition. This pattern recognition system is able to do data training and recognize 74 Hiragana letters using the Tesseract Engine. The system can also recognize images with the best success percentage of 98.24% with an image resolution of 200dpi (dots per inch) at size 18. This system can also recognize handwritten images with the best percentage of success of 90% with 200dpi image resolution.References
A. R. Kardian, “Pengantar Pengolahan Citra,” in Pengolahan Citra Digital, Jakarta, 2012.
R. Smith, “Tesseract OCR engine What it is, where it came from, where it is going,” 2007. [Online]. Available: tesseract-ocr.googlecode.com/files/TesseractOSCON.pdf. [Accessed: 29-Jan-2013].
E. and C. L. Tsujita, Mahir Bahasa Jepang dalam Sepekan. Jakarta: Kesaint Blanc, 2005.
A. Hasnan, “Pengantar Belajar Bahasa Jepang,” 2009. [Online]. Available: https://bando07.files.wordpress.com/2009/10/bahasa-jepang-dan-indonesia1.pdf. [Accessed: 18-Aug-2013].
S. Center, “Hiragana,” 2006. [Online]. Available: http://www.shinjukucenter.com/Hiragana.php. [Accessed: 18-Aug-2013].

IJID (International Journal on Informatics for Development) is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License