Convolutional Neural Networks for the Recognition of Malayalam Characters | SpringerLink
Skip to main content

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 328))

Abstract

Optical Character Recognition (OCR) has an important role in information retrieval which converts scanned documents into machine editable and searchable text formats. This work is focussing on the recognition part of OCR. LeNet-5, a Convolutional Neural Network (CNN) trained with gradient based learning and backpropagation algorithm is used for classification of Malayalam character images. Result obtained for multi-class classifier shows that CNN performance is dropping down when the number of classes exceeds range of 40. Accuracy is improved by grouping misclassified characters together. Without grouping, CNN is giving an average accuracy of 75% and after grouping the performance is improved upto 92%. Inner level classification is done using multi-class SVM which is giving an average accuracy in the range of 99-100%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 22879
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 28599
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Neeba, N.V., Namboodiri, A., Jawahar, C.V., Narayanan, P.J.: Recognition of Malayalam Documents. In: Advances in Pattern Recognition, Guide to OCR for Indic Scripts. Springer, London (2009)

    Google Scholar 

  2. Neeba, N.V., Jawahar, C.V.: Empirical evaluation of character classification schemes. In: Seventh International Conference on ICAPR 2009. Advances in Pattern Recognition, pp. 310–313. IEEE (2009)

    Google Scholar 

  3. Anil, R., Pradeep, A., Midhun, E.M., Manjusha, K.: Malayalam Character Recognition using Singular Value Decomposition. International Journal of Computer Applications (0975 – 8887) 92(12) (April 2014)

    Google Scholar 

  4. Divakaran, S.: Spectral Analysis of Projection Histogram for En- hancing Close matching character Recognition in Malayalam. International Journal of Computer Science and Information Technology (IJCSIT) 4(2) (April 2012)

    Google Scholar 

  5. Chaudhuri, B.B.: On OCR of a Printed Indian Script. In: Advances in Pattern Recognition (ed) Digital Document Processing. Springer, London (2007)

    Google Scholar 

  6. Lecun, Y.E.: Learning algorithms for classification: A comparison on handwritten digit recognition. Neural Networks: The Statistical Mechanics Perspective, 261–276 (1995)

    Google Scholar 

  7. Bouchain, D.: Character Recognition Using Convolutional Neural Networks. Institute for Neural Information Processing 2007 (2006)

    Google Scholar 

  8. LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-Based Learning Applied to Document Recognition. Proceedings of the IEEE 86(11), 2278–2324 (1998)

    Article  Google Scholar 

  9. Wei, Y., Xia, W., Huang, J., Ni, B., Dong, J., Zhao, Y., Yan, S.: CNN: Single-label to Multi-label, arXiv preprint arXiv: 1406.5726 (2014)

    Google Scholar 

  10. Ciresan, D., Meier, U., Schmidhuber, J.: Multi-column Deep Neural Networks for Image Classification. In: Computer Vision and Pattern Recognition, pp. 3642–3649 (2012)

    Google Scholar 

  11. Soman, K.P., Loganathan, R., Ajay, V.: Machine Learning with SVM and other Kernel methods. PHI Learning Pvt. Ltd (2009)

    Google Scholar 

  12. Ramanathan, R., Arun, S., Nair, V., Vidhya Sagar, N.: A support vector machines approach for efficient facial expression recognition. In: International Conference on Advances in Recent Technologies in Communication and Computing, ARTCom 2009, pp. 850–854. IEEE (2009)

    Google Scholar 

  13. Hsu, C.-W., Lin, C.-J.: A comparison of methods for multiclass support vector machines. IEEE Transactions on Neural Networks 13(2), 415–425 (2002)

    Article  Google Scholar 

  14. Cherian, M., Radhika, G., Shajeesh, K.U., Soman, K.P., Sabarimalai Manikandan, M.: A Levelset Based Binarization and Segmentation for Scanned Malayalam Document Image Analysis. In: IEEE International Conference on computational Intelligence and Computing Research (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to R. Anil .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Anil, R., Manjusha, K., Kumar, S.S., Soman, K.P. (2015). Convolutional Neural Networks for the Recognition of Malayalam Characters. In: Satapathy, S., Biswal, B., Udgata, S., Mandal, J. (eds) Proceedings of the 3rd International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA) 2014. Advances in Intelligent Systems and Computing, vol 328. Springer, Cham. https://doi.org/10.1007/978-3-319-12012-6_54

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-12012-6_54

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-12011-9

  • Online ISBN: 978-3-319-12012-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics