Enhanced Activity Recognition Through Joint Utilization of Decimal Descriptors and Temporal Binary Motions

Gnouma, Mariem; Yahia, Samah; Ejbali, Ridha; Zaied, Mourad

doi:10.1007/978-3-031-70819-0_28

Mariem Gnouma^14,16,16,
Samah Yahia^15,16,
Ridha Ejbali^14,16 &
…
Mourad Zaied^14,16

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14811))

Included in the following conference series:

International Conference on Computational Collective Intelligence

256 Accesses

Abstract

Human Action Recognition (HAR) has been a prominent area of research within machine learning over the last few decades. Its applications span domains such as visual surveillance, robotics, and pedestrian detection. Despite the numerous techniques introduced by computer vision researchers to address HAR, persistent challenges include dealing with redundant features and computational cost. This paper specifically addresses the challenge of silhouette-based human activity recognition. While previous research on silhouette-based HAR has predominantly focused on recognition from a singular perspective, the aspect of view invariance has often been overlooked. This paper presents a novel framework that aims to achieve view-invariant Human Action Recognition. The proposed approach integrates a pre-processing stage based on the extraction of multiple 2D Differential History Binary Motions (DHBMs) from spatio-temporal frames capturing human motion. These multi-batch DHBMs are then used to capture and analyse human behaviour using the Decimal Descriptor Pattern (DDP) approach. This strategy enhances the extraction of intricate details from image data, contributing to a more robust HAR methodology. The selected features are processed by the Sparse Stacked Auto-encoder (SSAE), a representative of deep learning methods, to provide effective detection of human activity. The subsequent classification is performed using Softmax. The experiments are conducted on publicly available datasets, namely IXMAS and KTH. The results of the study demonstrate the superior performance of our methodology compared to previous approaches, achieving higher levels of accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 13727; Price includes VAT (Japan)

Softcover Book: JPY 9437; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Combining Handcrafted Spatio-Temporal and Deep Spatial Features for Effective Human Action Recognition

Article Open access 01 March 2025

Two-stream spatiotemporal feature fusion for human action recognition

Article 09 August 2020

On integration of multiple features for human activity recognition in video sequences

Article 31 July 2021

References

Zhang, S., Wei, Z., Nie, J., Huang, L., Wang, S., Li, Z.: A review on human activity recognition using vision-based method. J. Healthc. Eng. 2017, 3090343 (2017)
Article Google Scholar
Bhola, G., Vishwakarma, D.K.: A review of vision-based indoor HAR: state-of-the-art, challenges, and future prospects. Multimedia Tools and Applications 83(1), 1965–2005 (2024)
Article Google Scholar
Zebari, R., Abdulazeez, A., Zeebaree, D., Zebari, D., Saeed, J.: A comprehensive review of dimensionality reduction techniques for feature selection and feature extraction. J. Appli. Sci. Technolo. Trends 1(2), 56–70 (2020)
Article Google Scholar
Wu, D.; Sharma, N.; Blumenstein, M.: Recent advances in video-based human action recognition using deep learning: A review. In: Proceedings of the 2017 International Joint Conference on Neural Networks (IJCNN), Anchorage, AK, USA, 14–19 May 2017, pp. 2865–2872 (2017)
Google Scholar
Sargano, A.B., et al.: Human action recognition using transfer learning with deep representations. In: International Joint Conference on Neural Network (IJCNN), pp. 463–469 (2017)
Google Scholar
Mathe,E., et al.: A deep learning approach for human action recognition using skeletal information. In: GeNeDis, P. V. (ed.), pp. 105–114 (2018)
Google Scholar
Zhang, Z., Lv, Z., Gan, C., Zhu, Q.: Human action recognition using convolutional LSTM and fully-connected LSTM with different attentions. Neurocomputing 410, 304–316 (2020)
Article Google Scholar
Mukherjee, D., et al.: EnsemConvNet: a deep learning approach for human activity recognition using smartphone sensors for healthcare applications. Multimedia Tools Appl. 79 (2020)
Google Scholar
Dai, C., Liu, X., Lai, J.: Human action recognition using two-stream attention based LSTM networks. Appl. Soft Comput. 86, 105820 (2020)
Article Google Scholar
Khan, M.A., et al.: Emergence of a novel coronavirus, severe acute respiratory syndrome coronavirus 2: biology and therapeutic options. J. Clin. Microbiol. 58(5) (2020)
Google Scholar
Nabati, M., Navidan, H., Shahbazian, R., Ghorashi, S.A., Windridge, D.: Using synthetic data to enhance the accuracy of fingerprint-based localization: A Deep Learning Approach. IEEE Sensors Lett. 2020(4), 6000204 (2020)
Google Scholar
Chahoushi, M., Nabati, M., Asvadi, R., Ghorashi, S.A.: CSI-Based human activity recognition using multi-input multi-output autoencoder and fine-tuning. Sensors 23(7), 3591 (2023)
Article Google Scholar
Cheng, X., Huang, B., Zong, J.: Device-free human activity recognition based on GMM-HMM using channel state information. IEEE Access 9, 76592–76601 (2021)
Article Google Scholar
Gnouma, M., Ejbali, R., Zaied, M.: A temporal human activity recognition based on stacked auto encoder and extreme learning machine. In: 2023 9th International Conference on Control, Decision and Information Technologies (CoDIT), pp. 1571–1576. IEEE (2023)
Google Scholar
Yahia, S., Salem, Y.B., Abdelkrim, M.N.: Texture analysis of magnetic resonance brain images to assess multiple sclerosis lesions. Multimed. Tools Appl. 77(23), 30769–30789 (2018)
Article Google Scholar
Yahia, S, Yassine, B.S., Abdelktim Naceur, A.M.: Multiple sclerosis lesions detection from noisy magnetic resonance brain images tissue. In: International Multi-Conference on Systems, Signals & Devices (SSD), pp. 240–245. IEEE (2018)
Google Scholar
Youbi, Z., Boubchir, L., Boukrouche, A.: Human ear recognition based on local multi-scale LBP features with city-block distance. Multimedia Tools Appli. 78, 14425–14441 (2019)
Article Google Scholar
Gnouma, M., Ladjailia, A., Ejbali, R., Zaied, M.: Stacked sparse autoencoder and history of binary motion image for human activity recognition. Multimedia Tools Appli. 78(2) (2019)
Google Scholar
Shaikh, I.A.K., Krishna, P.V., Biswal, S.G., Kumar, A.S., Baranidharan, S., Singh, K.: Bayesian optimization with stacked sparse autoencoder based cryptocurrency price prediction model. In: 2023 5th International Conference on Smart Systems and Inventive Technology (ICSSIT), pp. 653–658. IEEE (January 2023)
Google Scholar
Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: A local SVM approach. In: Proc. 17th International Conference on Pattern Recognition ICPR, vol. 3, pp. 32–36. IEEE (2004)
Google Scholar
Weinland, D., Ronfard, R., Boyer, E.: Free viewpoint action recognition using motion history volumes. Comput. Vision Image Underst. 104(2–3), 249–257 (2006)
Article Google Scholar
Liu, H., et al.: Study of human action recognition based on improved spatio-temporal features. Human Motion Sensing Recogn. Fuzzy Qualit. Approach, 233–250 (2017)
Google Scholar
Chun, S., Lee, C.S.: Human action recognition using histogram of motion intensity and direction from multiple views. IET Comput. Vision 10(4), 250–257 (2016)
Article Google Scholar
Nida, N., Yousaf, M.H., Irtaza, A., Velastin, S.A.: Video augmentation technique for human action recognition using genetic algorithm. ETRI J. 44(2), 327–338 (2022)
Article Google Scholar
Kiran, S., et al.: Multi-layered deep learning features fusion for human action recognition. Comput. Mater. Continua 69(3) (2021)
Google Scholar
Malik, N.U.R., Sheikh, U.U., Abu-Bakar, S.A.R., Channa, A.: Multi-view human action recognition using skeleton based-fineknn with extraneous frame scrapping technique. Sensors 23(5), 2745 (2023)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Research Team in Intelligent Machines, Gabes, Tunisia
Mariem Gnouma, Ridha Ejbali & Mourad Zaied
Modelisation Analyse and Control of Systems, Gabes, Tunisia
Samah Yahia
National Engineering School of Gabes, University of Gabes, Street Omar Ibn El Khattab, 6029, Gabes, Zrig, Tunisia
Mariem Gnouma, Mariem Gnouma, Samah Yahia, Ridha Ejbali & Mourad Zaied

Authors

Mariem Gnouma
View author publications
You can also search for this author in PubMed Google Scholar
Samah Yahia
View author publications
You can also search for this author in PubMed Google Scholar
Ridha Ejbali
View author publications
You can also search for this author in PubMed Google Scholar
Mourad Zaied
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mariem Gnouma .

Editor information

Editors and Affiliations

Wrocław University of Science and Technology, Wrocław, Poland
Ngoc Thanh Nguyen
University of Leipzig, Leipzig, Germany
Bogdan Franczyk
University of Leipzig, Leipzig, Sachsen, Germany
André Ludwig
Universidad Complutense de Madrid, Madrid, Spain
Manuel Núñez
Vrije Universiteit Amsterdam, Amsterdam, Noord-Holland, The Netherlands
Jan Treur
University of Münster, Münster, Germany
Gottfried Vossen
Wrocław University of Science and Technology, Wrocław, Poland
Adrianna Kozierkiewicz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gnouma, M., Yahia, S., Ejbali, R., Zaied, M. (2024). Enhanced Activity Recognition Through Joint Utilization of Decimal Descriptors and Temporal Binary Motions. In: Nguyen, N.T., et al. Computational Collective Intelligence. ICCCI 2024. Lecture Notes in Computer Science(), vol 14811. Springer, Cham. https://doi.org/10.1007/978-3-031-70819-0_28

Download citation

DOI: https://doi.org/10.1007/978-3-031-70819-0_28
Published: 31 August 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-70818-3
Online ISBN: 978-3-031-70819-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics