Abstract
Human behavior analysis from big multimedia data has become a trending research area with applications to various domains such as surveillance, medical, sports, and entertainment. Facial expression analysis is one of the most prominent clues to determine the behavior of an individual, however, it is very challenging due to variations in face poses, illuminations, and different facial tones. In this paper, we analyze human behavior using facial expressions by considering some famous TV-series videos. Firstly, we detect faces using Viola-jones algorithm followed by tracking through Kanade-Lucas-Tomasi (KLT) algorithm. Secondly, we use histogram of oriented gradients (HOG) features with support vector machine (SVM) classifier for facial recognition. Next, we recognize facial expressions using the proposed light-weight convolutional neural network (CNN). We utilize data augmentation techniques to overcome the issue of appearance of faces from different views and lightening conditions in video data. Finally, we predict human behaviors using an occurrence matrix acquired from facial recognition and expressions. The subjective and objective experimental evaluations prove better performance for both facial expression recognition and human behavior understanding.







Similar content being viewed by others
References
Cornejo JYR, Pedrini H, Flórez-Revuelta F (2015) Facial Expression Recognition with Occlusions based on Geometric Representation. In: Iberoamerican Congress on Pattern Recognition, Springer
Lu H, Wang M, Sangaiah AK (2018) Human Emotion Recognition Using an EEG Cloud Computing Platform. Mobile Networks and Applications: 1–10
Khan SA, Hussain A, Usman M (2016) Facial expression recognition on real world face images using intelligent techniques: A survey. Optik-International Journal for Light and Electron Optics 127(15):6195–6203
Dartmann G, Song H, Schmeink A (2019) Big data analytics for cyber-physical systems: machine learning for the internet of things. Elsevier, Amsterdam
Chen M, Mao S, Liu Y (2014) Big data: A survey. Mobile Netw Appl 19(2):171–209
Sajjad M et al (2019) Raspberry Pi assisted facial expression recognition framework for smart security in law-enforcement services. Inf Sci 479:416–431
Lv Y, Feng Z, Xu C (2014) Facial expression recognition via deep learning. In: Smart Computing (SMARTCOMP), 2014 International Conference on. IEEE
Fridlund AJ (2014) Human facial expression: An evolutionary view. Academic Press, Cambridge
Hossain MS et al (2016) Audio-visual emotion recognition using big data towards 5G. Mobile Netw Appl 21(5):753–763
Bartlett MS et al (2005) Recognizing facial expression: machine learning and application to spontaneous behavior. In: Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, IEEE
Tian Y-L (2004) Evaluation of face resolution for expression analysis. In: Computer Vision and Pattern Recognition Workshop, 2004. CVPRW'04. Conference on, IEEE
Padgett C, Cottrell GW (1997) Representing face images for emotion classification. In: Advances in neural information processing systems
Cohen I et al (2003) Facial expression recognition from video sequences: temporal and static modeling. Comput Vis Image Underst 91(1):160–187
Moussaïd M et al (2010) The walking behaviour of pedestrian social groups and its impact on crowd dynamics. PLoS One 5(4):e10047
Robertson N, Reid I, Brady M (2008) Automatic human behaviour recognition and explanation for CCTV video surveillance. Secur J 21(3):173–188
Chua S-L, Marsland S, Guesgen HW (2009) Behaviour recognition from sensory streams in smart environments. In: Australasian Joint Conference on Artificial Intelligence, Springer
Sadilek A, Kautz H (2012) Location-based reasoning about complex multi-agent behavior. J Artif Intell Res 43:87–133
Baxter RH, Robertson NM, Lane DM (2015) Human behaviour recognition in data-scarce domains. Pattern Recogn 48(8):2377–2393
Uddin, M.Z., et al. (2017) A facial expression recognition system using robust face features from depth videos and deep learning. Comput Electr Eng
Lopes AT et al (2017) Facial expression recognition with Convolutional Neural Networks: Coping with few data and the training sample order. Pattern Recogn 61:610–628
Al-Shabi M, Cheah WP, Connie T (2016) Facial expression recognition using a hybrid CNN-SIFT aggregator. arXiv preprint arXiv:1608.02833
Chang K-Y, Chen C-S, Hung Y-P (2013) Intensity rank estimation of facial expressions based on a single image. in Systems, Man, and Cybernetics (SMC), 2013 IEEE International Conference on. IEEE
Yu Z, Zhang C (2015) Image based static facial expression recognition with multiple deep network learning. In: Proceedings of the 2015 ACM on International Conference on Multimodal Interaction. ACM
Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154
Bouguet J-Y (2001) Pyramidal implementation of the affine lucas kanade feature tracker description of the algorithm. Intel Corporation 5(1–10):4
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: international Conference on computer vision & Pattern Recognition (CVPR'05). IEEE Computer Society
Shokrani S, Moallem P, Habibi M (2014) Facial emotion recognition method based on Pyramid Histogram of Oriented Gradient over three direction of head. In: Computer and Knowledge Engineering (ICCKE), 2014 4th International eConference on. IEEE
Ullah A et al (2019) Action recognition using optimized deep autoencoder and CNN for surveillance data streams of non-stationary environments. Futur Gener Comput Syst 96:386–397
Muhammad K et al (2018) Convolutional Neural Networks Based Fire Detection in Surveillance Videos. IEEE Access 6:18174–18183
Al-Turjman F (2019) 5G-enabled devices and smart-spaces in social-IoT: an overview. Futur Gener Comput Syst 92:732–744
Lv Z et al (2017) Next-generation big data analytics: State of the art, challenges, and future research topics. IEEE Transactions on Industrial Informatics 13(4):1891–1899
Sajjad M et al (2019) Multi-grade brain tumor classification using deep CNN with extensive data augmentation. J Comput Sci 30:174–182
Hussain T et al. (2019) Cloud-Assisted Multi-View Video Summarization using CNN and Bi-Directional LSTM. IEEE Transactions on Industrial Informatics : 1
Muhammad K, Hussain T, Baik SW (2018) Efficient CNN based summarization of surveillance videos for resource-constrained devices. Pattern Recogn Lett
Sajjad M et al (2018) Integrating salient colors with rotational invariant texture features for image representation in retrieval systems. Multimed Tools Appl 77(4):4769–4789
Ullah A et al. (2018) Activity Recognition using Temporal Optical Flow Convolutional Features and Multi-Layer LSTM. IEEE Trans Ind Electron
Mehmood I et al. (2019) Efficient Image Recognition and Retrieval on IoT-Assisted Energy-Constrained Platforms from Big Data Repositories. IEEE Internet Things J
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems
Calvo MG, Lundqvist D (2008) Facial expressions of emotion (KDEF): Identification under different display-duration conditions. Behav Res Methods 40(1):109–115
Liew CF, Yairi T (2013) A comparison study of feature spaces and classification methods for facial expression recognition. In: 2013 IEEE International Conference on Robotics and Biomimetics (ROBIO)
Zhou Y, Shi BE (2017) Action unit selective feature maps in deep networks for facial expression recognition. In: 2017 International Joint Conference on Neural Networks (IJCNN). IEEE
Savoiu A, Wong J. Recognizing facial expressions using deep learning
Rao Q et al (2015) Multi-pose facial expression recognition based on SURF boosting. In: 2015 International Conference on Affective Computing and Intelligent Interaction (ACII)
Chen J et al (2014) Facial expression recognition based on facial components detection and hog features. In: International Workshops on Electrical and Computer Engineering Subfields
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Sajjad, M., Zahir, S., Ullah, A. et al. Human Behavior Understanding in Big Multimedia Data Using CNN based Facial Expression Recognition. Mobile Netw Appl 25, 1611–1621 (2020). https://doi.org/10.1007/s11036-019-01366-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11036-019-01366-9