Abstract
Handwriting is a crucial way to enhance character recognition and learn new words. However, the Bangla characters consist of very complex shapes and similar patterns. Deep learning (DL) techniques have become a prominent solution for handwritten Bangla character recognition (HBCR) due to their ability to extract high-level features from complex data. Several DL techniques have been proposed for HBCR, but they are computationally expensive and large in model size and thus not suitable for use in resource-constrained devices such as smartphones. In this study, we have evaluated the state-of-the-art DL models for HBCR. For this, we have used four existing datasets and created a merged dataset (by combining the four) for cross-dataset evaluation. We have provided a comparative performance analysis of the state-of-the-art DL models for HBCR. Additionally, we have proposed a new lightweight DL model for HBCR and evaluated its performance. The proposed DL model consists of 74 layers, including sub-layers, and its architecture is divided into five similar blocks. It includes the convolutional layers of (3, 3) and (5, 5) kernels, (1,1) stride, and the maximum pool layer of the (2, 2) pool size. The proposed model achieved accuracy, model size, loading and testing times of 96.87%, 13 MB, 9.11 s, and 7.95 s, respectively. The experimental results show that our model outperformed state-of-the-art models in terms of efficiency (loading and testing time) and model size with competitive accuracy.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability statements
The datasets generated during and/or analysed during the current study are available from the corresponding author upon reasonable request.
References
Du S, Ibrahim M, Shehata M, Badawy W (2012) Automatic license plate recognition (ALPR): a state-of-the-art review. IEEE Trans Circuits Syst Video Technol 23(2):311–325
Srikantan JFG, Srihari S (1994) Handprinted character/digit recognition using a multiple feature/resolution philosophy. In: Proceeding of the fourth int’l workshop frontiers in handwriting recognition
Weng Y, Xia C (2020) A new deep learning-based handwritten character recognition system on mobile computing devices. Mobile Netw Appl 25(2):402–411
Simons GF, Fennig CD (2017) Summary by language size. Languages of World, Ethnologue
Rabby ASA, Haque S, Islam S, Abujar S, Hossain SA (2018) Bornonet: Bangla handwritten characters recognition using convolutional neural network. Procedia Comput Sci 143:528–535
Opu MNI, Islam MR, Kabir MA, Hossain MS, Islam MM (2022) Learn2write: augmented reality and machine learning-based mobile app to learn writing. Computers. https://doi.org/10.3390/computers11010004
Rakshit S, Ghosal D, Das T, Dutta S, Basu S (2010) Development of a multi-user recognition engine for handwritten Bangla basic characters and digits. arXiv preprint arXiv:1003.5897
Chaudhuri B (2006) A complete handwritten numeral database of Bangla—a major indic script. In Proceedings of tenth international workshop on frontiers in handwriting recognition, Suvisoft, Baule, France
Liu C-L, Suen CY (2009) A new benchmark on the recognition of handwritten Bangla and Farsi numeral characters. Pattern Recogn 42(12):3287–3295
Surinta O, Schomaker L, Wiering M (2013) A comparison of feature and pixel-based methods for recognizing handwritten Bangla digits. In: 2013 12th international conference on document analysis and recognition. IEEE, pp 165–169
Bhowmik TK, Ghanty P, Roy A, Parui SK (2009) SVM-based hierarchical architectures for handwritten Bangla character recognition. Int J Doc Anal Recognit (IJDAR) 12(2):97–108
Rabby ASA, Abujar S, Haque S, Hossain SA (2019) Bangla handwritten digit recognition using convolutional neural network. In: Emerging technologies in data mining and information security. Springer, Berlin, pp 111–122
Hakim SA, et al. (2019) Handwritten Bangla numeral and basic character recognition using deep convolutional neural network. In: 2019 international conference on electrical, computer and communication engineering (ECCE). IEEE, pp 1–6
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
Hasnat MA, Habib SM, Khan M (2008) A high performance domain specific OCR for Bangla script. In: Novel algorithms and techniques in telecommunications, automation and industrial electronics. Springer, Berlin, pp 174–178
Pal U, Chaudhuri B (1994) OCR in Bangla: an indo-Bangladeshi language. In: Proceedings of the 12th IAPR international conference on pattern recognition, vol 3-conference C: signal processing (Cat. No. 94CH3440-5). IEEE, vol 2, pp 269–273
Chowdhury A, Foysal A, Islam S (2016) Bangla character recognition for android devices. Int J Comput Appl 136(11):13–19
Pal U, Chaudhuri B (2000) Automatic recognition of unconstrained off-line Bangla handwritten numerals. In: International conference on multimodal interfaces. Springer, Berlin, pp 371–378
Bhattacharya U, Chaudhuri B (2003) A majority voting scheme for multiresolution recognition of handprinted numerals. In: Seventh international conference on document analysis and recognition, 2003. Proceedings. IEEE, pp 16–20
Basu S, Das N, Sarkar, R., Kundu M, Nasipuri M, Basu DK (2012) An MLP based approach for recognition of handwritten bangla’numerals. arXiv preprint arXiv:1203.0876
Xu J-w, Xu J, Lu Y (2008) Handwritten Bangla digit recognition using hierarchical Bayesian network. In: 2008 3rd international conference on intelligent system and knowledge engineering. IEEE, vol 1, pp 1096–1099
Cireşan D, Meier U (2015) Multi-column deep neural networks for offline handwritten Chinese character classification. In: 2015 international joint conference on neural networks (IJCNN). IEEE, pp 1–6
Roy S, Das N, Kundu M, Nasipuri M (2017) Handwritten isolated Bangla compound character recognition: a new benchmark using a novel deep learning approach. Pattern Recogn Lett 90:15–21
Maitra DS, Bhattacharya U, Parui SK (2015) CNN based common approach to handwritten character recognition of multiple scripts. In: 2015 13th International Conference on Document Analysis and Recognition (ICDAR). IEEE, pp 1021–1025
Chowdhury RR, Hossain MS, Ul Islam R, Andersson K, Hossain S (2019) Bangla handwritten character recognition using convolutional neural network with data augmentation. In: 2019 Joint 8th international conference on informatics, electronics and vision (ICIEV) and 2019 3rd international conference on imaging, vision and pattern recognition (icIVPR). IEEE, pp 318–323
Sethy A, Patra PK, Nayak SR (2020) Offline handwritten numeral recognition using convolution neural network. Mach Vis Insp Syst 1:197–212. https://doi.org/10.1002/9781119682042.ch9
Alom MZ, Sidike P, Hasan M, Taha TM, Asari VK (2018) Handwritten Bangla character recognition using the state-of-the-art deep convolutional neural networks. Comput Intell Neurosci. https://doi.org/10.1155/2018/6747098
Shawon A, Rahman MJ-U, Mahmud F, Zaman MA (2018) Bangla handwritten digit recognition using deep CNN for large and unbiased dataset. In: 2018 international conference on Bangla speech and language processing (ICBSLP). IEEE, pp 1–6
Saha C, Faisal RH, Rahman MM (2019) Bangla handwritten basic character recognition using deep convolutional neural network. In: 2019 joint 8th international conference on informatics, electronics and vision (ICIEV) and 2019 3rd international conference on imaging, vision and pattern recognition (icIVPR). IEEE, pp 190–195
Rabby ASA, Haque S, Abujar S, Hossain SA (2018) Ekushnet: using convolutional neural network for Bangla handwritten recognition. Procedia Comput Sci 143:603–610
Sandler M, Howard A, Zhu M, Zhmoginov A, Chen L-C (2018) Mobilenetv2: inverted residuals and linear bottlenecks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4510–4520
Wang Z, Li C, Wang X (2021) Convolutional neural network pruning with structural redundancy reduction. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 14913–14922
Wang Z, Li C, Wang X, Wang D (2019) Towards efficient convolutional neural networks through low-error filter saliency estimation. In: PRICAI 2019: trends in Artificial Intelligence: 16th Pacific rim international conference on artificial intelligence, Cuvu, Yanuca Island, Fiji, August 26–30, 2019, Proceedings, Part II 16, pp 255–267. Springer, Berlin
Wang Z, Li C (2022) Channel pruning via lookahead search guided reinforcement learning. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision, pp 2029–2040
Hinton G, Vinyals O, Dean J (2015) Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531
Wang Z (2021) Zero-shot knowledge distillation from a decision-based black-box model. In: International conference on machine learning, pp 10675–10685. PMLR
Li C, Wang Z, Qi H (2022) Online knowledge distillation by temporal–spatial boosting. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision, pp 197–206
Wu G, Gong S (2021) Peer collaborative learning for online knowledge distillation. In: Proceedings of the AAAI conference on artificial intelligence, vol 35, pp 10302–10310
Wang Z (2021) Data-free knowledge distillation with soft targeted transfer set synthesis. In: Proceedings of the AAAI conference on artificial intelligence, vol 35, pp 10245–10253
Sarkar R, Das N, Basu S, Kundu M, Nasipuri M, Basu DK (2012) Cmaterdb1: a database of unconstrained handwritten Bangla and Bangla–English mixed script document image. Int J Doc Anal Recognit (IJDAR) 15(1):71–83
Biswas M, Islam R, Shom GK, Shopon M, Mohammed N, Momen S, Abedin A (2017) Banglalekha-isolated: a multi-purpose comprehensive dataset of handwritten Bangla isolated characters. Data Brief 12:103–107
Bhattacharya U, Chaudhuri BB (2008) Handwritten numeral databases of Indian scripts and multistage recognition of mixed numerals. IEEE Trans Pattern Anal Mach Intell 31(3):444–457
Rabby ASA, Haque S, Islam MS, Abujar S, Hossain SA (2018) EKUSH: a multipurpose and multitype comprehensive database for online off-line Bangla handwritten characters. In: International conference on recent trends in image processing and pattern recognition. Springer, Berlin, pp 149–158
Kingma DP, Ba J (2014) ADAM: a method for stochastic optimization. arXiv preprint arXiv:1412.6980
Schaul T, Zhang S, LeCun Y (2013) No more pesky learning rates. In: International conference on machine learning, pp 343–351
Perez L, Wang J (2017) The effectiveness of data augmentation in image classification using deep learning. arXiv preprint arXiv:1712.04621
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Opu, M.N.I., Hossain, M.E. & Kabir, M.A. Handwritten Bangla character recognition using convolutional neural networks: a comparative study and new lightweight model. Neural Comput & Applic 36, 337–348 (2024). https://doi.org/10.1007/s00521-023-09008-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-023-09008-8