A novel skew correction methodology for handwritten words in multilingual multi-oriented documents | Multimedia Tools and Applications Skip to main content
Log in

A novel skew correction methodology for handwritten words in multilingual multi-oriented documents

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Multi-oriented handwritten documents require additional preprocessing for segmentation and subsequent phases to work accurately in handwritten recognition systems. Skew correction is one such additional phase. Appearance of skew in multi-oriented Indian language based handwritten document is higher due to the presence of cursive nature. In the current work, we utilise a salient feature present in Indian scripts called \(m\bar {a}\)tr\(\bar {a}\) (also known as headline), extract a group of eligible pixels, and employ linear curve fitting for detecting and correcting skew in handwritten words. The proposed method is capable of correcting skew in four distinct Indian languages, viz. Bangla, Hindi, Marathi, and Punjabi. It is capable of efficiently handling skewed word images to an extent of ± 55 and delivers precise result even when the \(m\bar {a}\)tr\(\bar {a}\) is mostly absent or discontinuous.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  1. Bag S, Harit G (2013) A survey on optical character recognition for Bangla and Devanagari scripts. Sadhana 38(1):133–168

    Article  Google Scholar 

  2. Bagdanov A, Kanai J (1997) Projection profile based skew estimation algorithm for JBIG compressed images. In: Proceedings of the international conference on document analysis and recognition, vol 1, pp 401–405. IEEE

  3. Basu S, Chaudhuri C, Kundu M, Nasipuri M, Basu DK (2007) Text line extraction from multi-skewed handwritten documents. Pattern Recogn 40 (6):1825–1839

    Article  Google Scholar 

  4. Bhowmik TK, Roy A, Roy U (2005) Character segmentation for handwritten Bangla words using artificial neural network. In: Proceedings of the IAPR TC3 NNLDAR

  5. Boukharouba A (2017) A new algorithm for skew correction and baseline detection based on the randomized Hough Transform. Journal of King Saud University-Computer and Information Sciences 29(1):29–38

    Article  Google Scholar 

  6. Brodić D, Milivojević ZN (2012) Estimation of the handwritten text skew based on binary moments. Radioengineering 21(1):162–169

    Google Scholar 

  7. Ghosh R, Mandal G (2012) Skew detection and correction of online Bangla handwritten word. Int J Comp Sci Issues 9(4):202

    Google Scholar 

  8. Gupta D, Bag S (2019) Handwritten multilingual word segmentation using polygonal approximation of digital curves for Indian languages. Multi Tools App 78(14):1–26

    Google Scholar 

  9. Guru DS, Ravikumar M, Manjunath S (2013) Multiple skew estimation in multilingual handwritten documents. Int J Comp Sci Issues 10(5):65

    Google Scholar 

  10. Guru DS, Suhil M, Ravikumar M, Manjunath S (2015) Small eigenvalue based skew estimation of handwritten Devanagari words. In: International conference on mining intelligence and knowledge exploration, pp 216–225. Springer

  11. Jayadevan R, Kolhe SR, Patil PM, Pal U (2011) Database development and recognition of handwritten Devanagari legal amount words. In: Proceedings of the international conference on document analysis and recognition, pp 304–308. IEEE

  12. Jundale TA, Hegadi RS (2015) Skew detection and correction of Devanagari script using Hough Transform. Proc Comp Sci 45:305–311

    Article  Google Scholar 

  13. Jundale TA, Hegadi RS (2015) Skew detection of Devanagari script using pixels of axes-parallel rectangle and linear regression. In: Proceedings of the international conference on energy systems and applications, pp 480–484. IEEE

  14. Kar R, Saha S, Bera SK, Kavallieratou E, Bhateja V, Sarkar R (2019) Novel approaches towards slope and slant correction for tri-script handwritten word images. The Imaging Sci J 67(3):159–170

    Article  Google Scholar 

  15. Kavallieratou E, Fakotakis N, Kokkinakis G (2002) Skew angle estimation for printed and handwritten documents using the Wigner–Ville distribution. Image Vis Comput 20(11):813–824

    Article  Google Scholar 

  16. Kumar R, Singh A (2010) Detection and segmentation of lines and words in Gurmukhi handwritten text. In: Proceedings of the international conference on advance computing conference, pp 353–356. IEEE

  17. Liang Y, He F, Zeng X (2020) 3D mesh simplification with feature preservation based on whale optimization algorithm and differential evolution. Integrated Computer-Aided Engineering Preprint, pp 1–19

  18. Liu S, Li M, Li M, Xu Q (2020) Research of animals image semantic segmentation based on deep learning. Concurrency and Computation: Practice and Experience 32(1):e4892

    Google Scholar 

  19. Liu S, Yu M, Li M, Xu Q (2019) The research of virtual face based on deep convolutional generative adversarial networks using tensorflow. Physica A: Statistical Mechanics and its Applications 521:667–680

    Article  Google Scholar 

  20. Malakar S, Seraogi B, Sarkar R, Das N, Basu S, Nasipuri M (2012) Two-stage skew correction of handwritten Bangla document images. In: Proceedings of the international conference on emerging applications of information technology, pp 303–306. IEEE

  21. Mei M, Zhong Y, He F, Xu C (2020) An innovative multi-label learning based algorithm for city data computing. GeoInformatica 24(1):221–245

    Article  Google Scholar 

  22. Mello Carlos AB, Sánchez A, Cavalcanti George DC (2011) Multiple line skew estimation of handwritten images of documents based on a visual perception approach. In: Proceedings of the international conference on computer analysis of images and patterns, pp 138–145. Springer

  23. Obaidullah SM, Halder C, Santosh KC, Das N, Roy K (2018) PHDIndic_11: page-level handwritten document image dataset of 11 official Indic scripts for script identification. Multi Tools App 77(2):1643–1678

    Article  Google Scholar 

  24. Pramanik R, Bag S (2017) Linear curve fitting-based headline estimation in handwritten words for Indian scripts. In: Proceedings of the international conference on pattern recognition and machine intelligence, pp 116–123. Springer

  25. Pramanik R, Bag S (2018) Linear regression-based skew correction of handwritten words in Indian languages. In: Proceedings of the international conference on computer vision & image processing, pp 129–139. Springer

  26. Pramanik R, Bag S (2018) Shape decomposition-based handwritten compound character recognition for Bangla OCR. J Vis Commun Image Represent 50:123–134

    Article  Google Scholar 

  27. Pramanik R, Bag S, Kumar R (2018) A fuzzy and contour-based segmentation methodology for handwritten Hindi words in legal documents. In: Proceedings of the international conference on recent advances in information technology, pp 1–6. IEEE

  28. Pramanik R, Raj V, Bag S (2018) Finding the optimum classifier: Classification of segmentable components in offline handwritten Devanagari words. In: Proceedings of the international conference on recent advances in information technology, pp 1–5. IEEE

  29. Roy A, Bhowmik TK, Parui SK, Roy U (2005) A novel approach to skew detection and character segmentation for handwritten Bangla words. In: Proceedings of the international conference on digital image computing: techniques and applications, pp 30–30. IEEE

  30. Roy K, Roy K, Pal U (2006) Segmentation of unconstrained handwritten text based on RLSA algorithm. In: Proceedings of the national conference on recent trends in information systems, pp 196–199

  31. Sharma MK, Dhaka VP (2016) Segmentation of English offline handwritten cursive scripts using a feedforward neural network. Neural Comput & Applic 27(5):1369–1379

    Article  Google Scholar 

  32. Shaw B, Parui SK (2010) A two stage recognition scheme for offline handwritten Devanagari words. In: Machine interpretation of patterns: image analysis and data mining, World Scientific, pp 145–165

  33. Shi Z, Govindaraju V (2003) Skew detection for complex document images using fuzzy runlength. In: Proceedings of the international conference on document analysis and recognition, p 715. IEEE

  34. Stamatopoulos N, Gatos B, Louloudis G, Pal U, Alaei A (2013) ICDAR 2013 handwriting segmentation contest. In: Proceedings of the international conference on document analysis and recognition, pp 1402–1406. IEEE

  35. Wu Y, He F, Zhang D, Li X (2015) Service-oriented feature-based data exchange for cloud-based design and manufacturing. IEEE Transactions on Services Computing 11(2):341–353

    Article  Google Scholar 

  36. Xu Q, Huang G, Yu M, Guo Y (2020) Fall prediction based on key points of human bones. Physica A: Statistical Mechanics and its Applications 540:123205

    Article  MathSciNet  Google Scholar 

  37. Xu Q, Li M, Li M, Liu S (2018) Energy spectrum CT image detection based dimensionality reduction with phase congruency. J Medical Systems 42 (3):49

    Article  Google Scholar 

  38. Xu Q, Wang F, Gong Y, Wang Z, Zeng K, Li Q, Luo X (2019) A novel edge-oriented framework for saliency detection enhancement. Image Vis Comput 87:1–12

    Article  Google Scholar 

  39. Xu Q, Wang Z, Wang F, Gong Y (2019) Multi-feature fusion CNNs for Drosophila embryo of interest detection. Physica A: Statistical Mechanics and its Applications 531:121808

    Article  Google Scholar 

  40. Xu Q, Wang Z, Wang F, Li J (2018) Thermal comfort research on human CT data modeling. Multi Tools App 77(5):6311–6326

    Article  MathSciNet  Google Scholar 

  41. Yu H, He F, Pan Y (2020) A scalable region-based level set method using adaptive bilateral filter for noisy image segmentation. Multi Tools App 79 (9):5743–5765

    Article  Google Scholar 

  42. Zhang DJ, He FZ, Han SH, Li XX (2016) Quantitative optimization of interoperability during feature-based data exchange. Integrated Computer-Aided Engineering 23(1):31–50

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rahul Pramanik.

Ethics declarations

Conflict of Interests

Rahul Pramanik declares that he has no conflict of interest. Soumen Bag declares that he has no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Pramanik, R., Bag, S. A novel skew correction methodology for handwritten words in multilingual multi-oriented documents. Multimed Tools Appl 80, 27323–27342 (2021). https://doi.org/10.1007/s11042-021-10822-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-021-10822-2

Keywords

Navigation