Text Detection In Indonesian Identity Card Based On Maximally Stable Extremal Regions

https://doi.org/10.22146/ijccs.41259

Angga Maulana Purba(1*), Agus Harjoko(2), Mohammad Edi Wibowo(3)

(1) Master Program of Computer Science; FMIPA UGM, Yogyakarta
(2) Department of Computer Science and Electronics, FMIPA UGM, Yogyakarta
(3) Department of Computer Science and Electronics, FMIPA UGM, Yogyakarta
(*) Corresponding Author

Abstract


Most of Indonesian organizations either it is government or non government sometime required their member to provide their identity card (E-KTP) as legal document collection in their database. This collection of image usually being used as manual verification method. These document images acquired by each person with their own device, there are variations of angles they are used to acquire the image. This situation created problems in text recognition by OCR softwares especially in text detection part, orientation and noise will affect their accuracy. These cases making the text detection more complex and cannot be solved by simple vertical projection profile of black pixels.  This research proposed a method to improve text detection in identity document by fixing the orientation first, then using MSER regions to form text region. We fix the orientation using the line that made by Progressive Probabilistic Hough Transform. Then we used MSER to obtain all candidate regions and Horizontal RLSA acts as connector between those candidate. The orientation fixing strategy reach average of margin error 0.377o (in 360o system) and the text detection method reach 84.49% accuracy in best condition.


Keywords


MSER, Hough Transform; Progressive Probabilistic Hough Transform; RLSA; text detection

Full Text:

PDF


References

[1] A. Farahmand, A. Sarrafzadeh and J. Shanbehzadeh, Document Image Noises and Removal Methods, International MultiConference of Engineers and Computer Scientists, Vol I., 2013.

[2] A. El Harraj and N. Raissouni, OCR Accuracy Improvement On Document Images Through A Novel Pre-Processing Approach, Signal & Image Processing : An International Journal (SIPIJ), Vol.6, No.4, 2015.

[3] S. Widodo and Gunawan, "Template Matching pada Citra E-KTP Indonesia", SNATIKA, 2015.

[4] R. Akhter, M. Bhuiyandan Uddin., Extraction of Words from the National ID Cards for Automated Recognition, The International Society for Optical Engineering, 72-. 10.1117/12.913478, 2011.

[5] N. Jirasuwankul, "Effect of text orientation to OCR error and anti-skew of text using projective transform technique," IEEE/ASME International Conference on Advanced Intelligent Mechatronics (AIM), pp. 856-861., 2011.

[6] T.A. Jundale and R.S. Hegadi, Skew Detection and Correction of Devenagari Script Using Hough Transform, International Conferenca on Advanced Computing Technologies and Applications, pp. 305-311., 2015.

[7] A. S. Hassanein , S. Mohammad, M. Sameer, and M. E. Ragab, A Survey on Hough Transform, Theory, Techniques and Applications, International Journal Of Computer Science, Vol. 12, Issue 1, 2015.

[8] X. Yang, Y. Zhao, J. Fang, Y. Lu, Y. Zhang and Y. Yuan, "A license plate segmentation algorithm based on MSER and template matching," 12th International Conference on Signal Processing (ICSP), Hangzhou, pp. 1195-1199., 2014.

[9] A. Mammeri, A. Boukerche and E. H. Khiari, "MSER-based text detection and communication algorithm for autonomous vehicles", IEEE Symposium on Computers and Communication (ISCC), pp. 1218-1223., 2016.

[10] K. Mikolajczyk, T. Tuytelaars , T. Schmid , A. Zisserman, J. Matas, F. Schaffalitzky, T.Kadir, and L. Van Gool, " A Comparison of Affine Region Detectors", International Journal of Computer Vision, DOI: 10.1007/s11263-005-3848-x., 2005.

[11] W. Zhu, Q. Chen , C. Wei, Z. Li, A Segmentation Algorithm based on Image Projection for Complex Text Layout, 2nd International Conference on Materials Science, Resource and Environmental Engineering (MSREE), 030011-1–030011-8, 2017.

[12] H. Juffry, E. Chandra, and Sofyan, Deteksi Marka Jalan Dan Estimasi Posisi Menggunakan Multiresolution Hough Transform. Jurnal Teknik Komputer Binus, 21., 2013.

[13] P. Jaswanth, S. Anusuya, Anil Kumar, and T. Dhikhi , "Enhanced MSER Algorithm for Text Extraction", International Journal of Computational Intelligence and Informatics, Vol. 5, No. 4., 2016.

[14] MICC (Media Integration and Communication Center). MSER Presentation lecture, University of Firenze. 2016 [online]. Available : http://www.micc.unifi.it/delbimbo/wp-content/uploads/2011/03/slide_corso/A34%20MSER.pdf . [Accessed: 1-Jan-2018]

[15] E. Christopher and R. Munir, Pengembangan Algoritma Pengubahan Ukuran Citra Berbasiskan Analisis Gradien dengan Pendekatan Polinomial, Konferensi Nasional Informatika., 2013.



DOI: https://doi.org/10.22146/ijccs.41259

Article Metrics

Abstract views : 5102 | views : 3158

Refbacks

  • There are currently no refbacks.




Copyright (c) 2019 IJCCS (Indonesian Journal of Computing and Cybernetics Systems)

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.



Copyright of :
IJCCS (Indonesian Journal of Computing and Cybernetics Systems)
ISSN 1978-1520 (print); ISSN 2460-7258 (online)
is a scientific journal the results of Computing
and Cybernetics Systems
A publication of IndoCEISS.
Gedung S1 Ruang 416 FMIPA UGM, Sekip Utara, Yogyakarta 55281
Fax: +62274 555133
email:ijccs.mipa@ugm.ac.id | http://jurnal.ugm.ac.id/ijccs



View My Stats1
View My Stats2