Abstract
On-air writing can be considered as a time-dependent event where hand gesture is produced in a natural environment through index finger movement. A sequence of such movements containing several time steps in 3D space can be utilized to construct an English Capital Alphabet (ECA). While Previous researches investigated 2D features, we believe that depth information may play a significant role along with other features in recognition of these dynamic gestures. We have captured hand finger motion information using a depth camera and represented them as depth images for each ECA. The hand finger trajectory data were extracted from the depth image, and a combination of depth-based features and non-depth features were generated; depth variation was performed in the depth-based features, and then, all the feature values were converted into time-series data. Dynamic Time Warping distances were determined between a template ECA and a test ECA for each ECA collected from 15 participants. These distance-based features were then fed into a multi-class SVM for training and testing and got the recognition accuracy of \(80.77\%\) without depth and \(88.21\%\) with depth-based features. To cope with the over-fitting problem, we applied the resampling technique and got the highest recognition accuracy of 96.85%, and at last, we applied some feature selection techniques to analyze the recognition results.








Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Agrawal, S., Constandache, I., Gaonkar, S., Roy Choudhury, R., Caves, K., DeRuyter, F.: Using mobile phones to write in air. In: Proceedings of the 9th International Conference on Mobile Systems, Applications, and Services, MobiSys ’11, New York, NY, USA, pp. 15–28. ACM (2011)
Amma, C., Georgi, M., Schultz, T.: Airwriting: a wearable handwriting recognition system. Pers. Ubiquitous Comput. 18(1), 191–203 (2014)
Freeman, D., Barrentine, D.B.: Method and system for operating a multi-function portable electronic device using voice-activation, March 10 . US Patent 8,977,255 (2015)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. ACM SIGKDD Explor. Newslett. 11(1), 10–18 (2009)
Hunter, J.S.: The exponentially weighted moving average. J. Qual. Technol. 18(4), 203–210 (1986)
Islam, R., Mahmud, H., Hasan, M.K., Rubaiyeat, H.A.: Alphabet recognition in air writing using depth information. In: The Ninth International Conference on Advances in Computer–Human Interactions, pp. 299–301 (2016)
Jalalian, Arash, Chalup, Stephan K.: GDTW-P-SVMS: variable-length time series analysis using support vector machines. Neurocomputing 99, 270–282 (2013)
Jeong, Y.S., Jeong, M.K., Omitaomu, O.A.: Weighted dynamic time warping for time series classification. Pattern Recognit. 44(9), 2231–2240 (2011)
Kim, D.H., Choi, H.I., Kim, J.H.: 3d space handwriting recognition with ligature model. In: Proceedings of the Third International Conference on Ubiquitous Computing Systems, UCS’06, pp. 41–56. Springer, Berlin (2006)
Knowledge Euronews. Future of texting: writing in the air! Apr 8, 2013
Mahmud, H., Hasan, M., Kabir, M., Mottalib, M.A.: Recognition of symbolic gestures using depth information. Adv. Hum. Comput. Interact. (2018). https://doi.org/10.1155/2018/1069823
Mullin, R.: Time warps, string edits, and macromolecules. In: Kruskall, J., Liberman, M. (eds.) The theory and practice of sequence comparison. Advanced book program, Addison-Wesley, Reading, MA, Don mills, ON, 300 pp. us \$31.95. ISBN: 0-201-07809-0 (1983). Can. J. Stat. 13(2):167–168 (1985)
Priya, A., Mishra, S., Raj, S., Mandal, S., Datta, S.: Online and offline character recognition: a survey. In: International Conference on Communication and Signal Processing (ICCSP). IEEE (2016)
Ren, Z., Yuan, J., Meng, J., Zhang, Z.: Robust part-based hand gesture recognition using kinect sensor. IEEE Trans. Multimed. 15(5), 1110–1120 (2013)
Mohammadi, S., Maleki, R.: Air-writing recognition system for Persian numbers with a novel classifier. Vis. Comput. 36, 1001–1015 (2019)
Siena, F.L., Byrom, B., Watts, P., Breedon, P.: Utilising the intel realsense camera for measuring health outcomes in clinical research. J. Med. Syst. 42(3), 53 (2018)
Sreekanth, N.S., Narayanan, N.K.: Dynamic gesture recognition: a machine vision based approach. In: Proceedings of the International Conference on Signal, Networks, Computing, and Systems, pp. 105–115. Springer (2017)
Thomas, M.: A role for “air writing” in second-language learners’ acquisition of Japanese in the age of the word processor. J. Jpn. Linguist. 30(1), 86–106 (2014)
Wexelblat, A.: An approach to natural gesture in virtual environments. ACM Trans. Comput. Hum. Interact. 2(3), 179–200 (1995)
Yin, Y., Xie, L., Gu, T., Lu, Y., Lu, S.: AirContour: building contour-based model for in-air writing gesture recognition. ACM Trans. Sens. Netw. 15(4), 44:1–44:25 (2019)
Acknowledgements
The authors would like to thank to all users who have participated voluntarily in the data collection process.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Mahmud, H., Islam, R. & Hasan, M.K. On-air English Capital Alphabet (ECA) recognition using depth information. Vis Comput 38, 1015–1025 (2022). https://doi.org/10.1007/s00371-021-02065-x
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00371-021-02065-x