Abstract
Optical Character Recognition (OCR) has an important role in information retrieval which converts scanned documents into machine editable and searchable text formats. This work is focussing on the recognition part of OCR. LeNet-5, a Convolutional Neural Network (CNN) trained with gradient based learning and backpropagation algorithm is used for classification of Malayalam character images. Result obtained for multi-class classifier shows that CNN performance is dropping down when the number of classes exceeds range of 40. Accuracy is improved by grouping misclassified characters together. Without grouping, CNN is giving an average accuracy of 75% and after grouping the performance is improved upto 92%. Inner level classification is done using multi-class SVM which is giving an average accuracy in the range of 99-100%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Neeba, N.V., Namboodiri, A., Jawahar, C.V., Narayanan, P.J.: Recognition of Malayalam Documents. In: Advances in Pattern Recognition, Guide to OCR for Indic Scripts. Springer, London (2009)
Neeba, N.V., Jawahar, C.V.: Empirical evaluation of character classification schemes. In: Seventh International Conference on ICAPR 2009. Advances in Pattern Recognition, pp. 310–313. IEEE (2009)
Anil, R., Pradeep, A., Midhun, E.M., Manjusha, K.: Malayalam Character Recognition using Singular Value Decomposition. International Journal of Computer Applications (0975 – 8887) 92(12) (April 2014)
Divakaran, S.: Spectral Analysis of Projection Histogram for En- hancing Close matching character Recognition in Malayalam. International Journal of Computer Science and Information Technology (IJCSIT) 4(2) (April 2012)
Chaudhuri, B.B.: On OCR of a Printed Indian Script. In: Advances in Pattern Recognition (ed) Digital Document Processing. Springer, London (2007)
Lecun, Y.E.: Learning algorithms for classification: A comparison on handwritten digit recognition. Neural Networks: The Statistical Mechanics Perspective, 261–276 (1995)
Bouchain, D.: Character Recognition Using Convolutional Neural Networks. Institute for Neural Information Processing 2007 (2006)
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-Based Learning Applied to Document Recognition. Proceedings of the IEEE 86(11), 2278–2324 (1998)
Wei, Y., Xia, W., Huang, J., Ni, B., Dong, J., Zhao, Y., Yan, S.: CNN: Single-label to Multi-label, arXiv preprint arXiv: 1406.5726 (2014)
Ciresan, D., Meier, U., Schmidhuber, J.: Multi-column Deep Neural Networks for Image Classification. In: Computer Vision and Pattern Recognition, pp. 3642–3649 (2012)
Soman, K.P., Loganathan, R., Ajay, V.: Machine Learning with SVM and other Kernel methods. PHI Learning Pvt. Ltd (2009)
Ramanathan, R., Arun, S., Nair, V., Vidhya Sagar, N.: A support vector machines approach for efficient facial expression recognition. In: International Conference on Advances in Recent Technologies in Communication and Computing, ARTCom 2009, pp. 850–854. IEEE (2009)
Hsu, C.-W., Lin, C.-J.: A comparison of methods for multiclass support vector machines. IEEE Transactions on Neural Networks 13(2), 415–425 (2002)
Cherian, M., Radhika, G., Shajeesh, K.U., Soman, K.P., Sabarimalai Manikandan, M.: A Levelset Based Binarization and Segmentation for Scanned Malayalam Document Image Analysis. In: IEEE International Conference on computational Intelligence and Computing Research (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Anil, R., Manjusha, K., Kumar, S.S., Soman, K.P. (2015). Convolutional Neural Networks for the Recognition of Malayalam Characters. In: Satapathy, S., Biswal, B., Udgata, S., Mandal, J. (eds) Proceedings of the 3rd International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA) 2014. Advances in Intelligent Systems and Computing, vol 328. Springer, Cham. https://doi.org/10.1007/978-3-319-12012-6_54
Download citation
DOI: https://doi.org/10.1007/978-3-319-12012-6_54
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12011-9
Online ISBN: 978-3-319-12012-6
eBook Packages: EngineeringEngineering (R0)