Japanese Letter Pattern Recognition Application  with Tesseract Engine

Akhmad Imam Fahrizal; Ahmad Subhan Yazid; Shofwatul Uyun

doi:10.14421/ijid.2015.04202

Vol. 4 No. 2 (2015), Articles

Vol. 4 No. 2 (2015)

Japanese Letter Pattern Recognition Application with Tesseract Engine

Articles

https://doi.org/10.14421/ijid.2015.04202

Published 2015-12-26

Akhmad Imam Fahrizal⁺⁻
Ahmad Subhan Yazid⁺⁻
Shofwatul Uyun⁺⁻

Akhmad Imam Fahrizal

Informatics Department Islamic State University (UIN) of Sunan Kalijaga Yogyakarta

Ahmad Subhan Yazid

Informatics Department Islamic State University (UIN) of Sunan Kalijaga Yogyakarta

Shofwatul Uyun

Prodi Informatika, Program Magister, Fakultas Sains dan Teknologi, UIN Sunan Kalijaga Yogyakarta

http://orcid.org/0000-0002-6704-0248

PDF

Keywords

Hiragana
Pattern Recognition System
Tesseract

How to Cite

Fahrizal, A. I., Yazid, A. S., & Uyun, S. (2015). Japanese Letter Pattern Recognition Application with Tesseract Engine. IJID (International Journal on Informatics for Development), 4(2), 8–11. https://doi.org/10.14421/ijid.2015.04202

Abstract

Digital image processing is a field that is being cultivated by many researchers at this time because it is interesting to apply to various activities, both analysis and production activities. One branch of the digital image is pattern recognition. This study uses Tesseract as a tool to recognize patterns from Hiragana letters. This study was conducted to find out how much Tesseract was able to recognize a Japanese text and handwritten text. This study uses 1 image as training data containing 74 Hiragana letters which are processed through training for each letter. This study has several testing criteria based on font size and resolution to find the best results in pattern recognition. This pattern recognition system is able to do data training and recognize 74 Hiragana letters using the Tesseract Engine. The system can also recognize images with the best success percentage of 98.24% with an image resolution of 200dpi (dots per inch) at size 18. This system can also recognize handwritten images with the best percentage of success of 90% with 200dpi image resolution.

https://doi.org/10.14421/ijid.2015.04202

PDF

References

A. R. Kardian, “Pengantar Pengolahan Citra,” in Pengolahan Citra Digital, Jakarta, 2012.

R. Smith, “Tesseract OCR engine What it is, where it came from, where it is going,” 2007. [Online]. Available: tesseract-ocr.googlecode.com/files/TesseractOSCON.pdf. [Accessed: 29-Jan-2013].

E. and C. L. Tsujita, Mahir Bahasa Jepang dalam Sepekan. Jakarta: Kesaint Blanc, 2005.

A. Hasnan, “Pengantar Belajar Bahasa Jepang,” 2009. [Online]. Available: https://bando07.files.wordpress.com/2009/10/bahasa-jepang-dan-indonesia1.pdf. [Accessed: 18-Aug-2013].

S. Center, “Hiragana,” 2006. [Online]. Available: http://www.shinjukucenter.com/Hiragana.php. [Accessed: 18-Aug-2013].

IJID (International Journal on Informatics for Development) is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License

Japanese Letter Pattern Recognition Application with Tesseract Engine

Keywords

How to Cite

Download Citation

Abstract

References