An improved multi-scale face detection using convolutional neural network

Mliki, Hazar; Dammak, Sahar; Fendri, Emna

doi:10.1007/s11760-020-01680-w

An improved multi-scale face detection using convolutional neural network

Original Paper
Published: 30 March 2020

Volume 14, pages 1345–1353, (2020)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

Hazar Mliki²,
Sahar Dammak¹ &
Emna Fendri³

838 Accesses
Explore all metrics

Abstract

In this paper, we introduce a deep learning (CNN) based method for face detection in an uncontrolled environment. The proposed method consists in developing a CNN architecture dedicated to the face detection tasks by combining both of global and local features at multiple scales. Our architecture is composed of two main networks: A region proposal network that generates a list of regions of interest (ROIs) and a second corresponds to a network that use these ROIs for classification into face/non-face. Both of them share the full-image convolution features of a pre-trained ResNet-50 model. Experimental study was conducted on the famous WIDER Face and FDDB databases. The obtained results proved the efficiency as well as the feasibility of the proposed method to deal with multi-scale face detection problems.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Institutional subscriptions

CMS-RCNN: Contextual Multi-Scale Region-Based CNN for Unconstrained Face Detection

A novel deep facenet framework for real-time face detection based on deep learning model

Article 23 November 2023

An Overview of Recent Developments in Convolutional Neural Network (CNN) Based Face Detector

References

Bell, S., Lawrence Zitnick, C., Bala, K., Girshick, R.: Inside-outside net: detecting objects in context with skip pooling and recurrent neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2874–2883 (2016)
Berg, T.L., Berg, A.C., Edwards, J., Forsyth, D.A.: Who’s in the picture. In: Advances in Neural Information Processing Systems, pp. 137–144 (2005)
Chen, D., Ren, S., Wei, Y., Cao, X., Sun, J.: Joint cascade face detection and alignment. In: European Conference on Computer Vision, pp. 109–122. Springer (2014)
Duan, M., Li, K., Yang, C., Li, K.: A hybrid deep learning cnn-elm for age and gender classification. Neurocomputing 275, 448–461 (2018)
Article Google Scholar
Farfade, S.S., Saberian, M.J., Li, L.J.: Multi-view face detection using deep convolutional neural networks. In: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, pp. 643–650. ACM (2015)
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Jain, V., Learned-Miller, E.: FDDB: a benchmark for face detection in unconstrained settings. Technical Report UM-CS-2010-009, University of Massachusetts, Amherst (2010)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Li, H., Lin, Z., Shen, X., Brandt, J., Hua, G.: A convolutional neural network cascade for face detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5325–5334 (2015)
Li, Y., Sun, B., Wu, T., Wang, Y.: Face detection with end-to-end integration of a convnet and a 3d model. In: European Conference on Computer Vision, pp. 420–436. Springer (2016)
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., Berg, A.C.: SSD: Single shot multibox detector. In: European Conference on Computer Vision, pp. 21–37. Springer (2016)
Liu, W., Rabinovich, A., Berg, A.C.: Parsenet: looking wider to see better. arXiv preprint arXiv:1506.04579 (2015)
Qin, H., Yan, J., Li, X., Hu, X.: Joint training of cascaded CNN for face detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3456–3465 (2016)
Ramanan, D., Zhu, X.: Face detection, pose estimation, and landmark localization in the wild. In: Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2879–2886. Citeseer (2012)
Ranjan, R., Patel, V.M., Chellappa, R.: Hyperface: a deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition. IEEE Trans. Pattern Anal. Mach. Intell. 41(1), 121–135 (2017)
Article Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Faster r-cnn: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91–99 (2015)
Sun, X., Wu, P., Hoi, S.C.: Face detection using deep learning: an improved faster RCNN approach. Neurocomputing 299, 42–50 (2018)
Article Google Scholar
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
Viola, P., Jones, M.J.: Robust real-time face detection. Int. J. Comput. Vis. 57(2), 137–154 (2004)
Yang, H., Ciftci, U., Yin, L.: Facial expression recognition by de-expression residue learning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2168–2177 (2018)
Yang, S., Luo, P., Loy, C.C., Tang, X.: From facial parts responses to face detection: a deep learning approach. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3676–3684 (2015)
Yang, S., Luo, P., Loy, C.C., Tang, X.: Wider face: a face detection benchmark. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5525–5533 (2016)
Yang, S., Xiong, Y., Loy, C.C., Tang, X.: Face detection through scale-friendly deep convolutional networks. arXiv preprint arXiv:1706.02863 (2017)
Zheng, Y., Zhu, C., Luu, K., Bhagavatula, C., Le, T.H.N., Savvides, M.: Towards a deep learning framework for unconstrained face detection. In: 2016 IEEE 8th International Conference on Biometrics Theory, Applications and Systems (BTAS), pp. 1–8. IEEE (2016)

Download references

Author information

Authors and Affiliations

University of Sfax, MIRACL-FSEG, Sfax, Tunisia
Sahar Dammak
University of Sfax, MIRACL-ENET’COM, Sfax, Tunisia
Hazar Mliki
University of Sfax, MIRACL-FS, Sfax, Tunisia
Emna Fendri

Authors

Hazar Mliki
View author publications
You can also search for this author inPubMed Google Scholar
Sahar Dammak
View author publications
You can also search for this author inPubMed Google Scholar
Emna Fendri
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Sahar Dammak.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mliki, H., Dammak, S. & Fendri, E. An improved multi-scale face detection using convolutional neural network. SIViP 14, 1345–1353 (2020). https://doi.org/10.1007/s11760-020-01680-w

Download citation

Received: 07 August 2019
Revised: 11 December 2019
Accepted: 19 March 2020
Published: 30 March 2020
Issue Date: October 2020
DOI: https://doi.org/10.1007/s11760-020-01680-w

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Institutional subscriptions

An improved multi-scale face detection using convolutional neural network

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

CMS-RCNN: Contextual Multi-Scale Region-Based CNN for Unconstrained Face Detection

A novel deep facenet framework for real-time face detection based on deep learning model

An Overview of Recent Developments in Convolutional Neural Network (CNN) Based Face Detector

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now