Deep Learning Based Pedestrian Detection at Distance in Smart Cities | SpringerLink
Skip to main content

Deep Learning Based Pedestrian Detection at Distance in Smart Cities

  • Conference paper
  • First Online:
Intelligent Systems and Applications (IntelliSys 2019)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1038))

Included in the following conference series:

Abstract

Generative adversarial networks (GANs) have been promising for many computer vision problems due to their powerful capabilities to enhance the data for training and test. In this paper, we leveraged GANs and proposed a new architecture with a cascaded Single Shot Detector (SSD) for pedestrian detection at distance, which is yet a challenge due to the varied sizes of pedestrians in videos at distance. To overcome the low-resolution issues in pedestrian detection at distance, DCGAN is employed to improve the resolution first to reconstruct more discriminative features for a SSD to detect objects in images or videos. A crucial advantage of our method is that it learns a multi-scale metric to distinguish multiple objects at different distances under one image, while DCGAN serves as an encoder-decoder platform to generate parts of an image that contain better discriminative information. To measure the effectiveness of our proposed method, experiments were carried out on the Canadian Institute for Advanced Research (CIFAR) dataset, and it was demonstrated that the proposed new architecture achieved a much better detection rate, particularly on vehicles and pedestrians at distance, making it highly suitable for smart cities applications that need to discover key objects or pedestrians at distance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 22879
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 28599
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Dinakaran, R., et al.: Image resolution impact analysis on pedestrian detection in smart cities surveillance. In: Proceedings of the 1st International Conference on Internet of Things & Ma-chine Learning (IML 2017) (2017)

    Google Scholar 

  2. Ren, S., et al.: Faster R-CNN: towards real-time object detection with region proposal net-works. In: NIPS (2015)

    Google Scholar 

  3. Dosovitskiy, A., et al.; Discriminative unsupervised feature learning with exemplar convolutional neural networks. IEEE Trans. Pattern Anal. Mach. Intell. 99 (2015)

    Google Scholar 

  4. Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., Berg, A.C.: SSD: single shot multibox detector. In: ECCV (2016)

    Google Scholar 

  5. Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., WardeFarley, D., Ozair, S., Courville, A., Bengio, Y.: Generative adversarial nets. In: NIPS (2014)

    Google Scholar 

  6. Nguyen, A., et al.: Plug & play generative networks: conditional iterative generation of images in latent space. In: CVPR (2017)

    Google Scholar 

  7. Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations (2015)

    Google Scholar 

  8. Denton, E.L., et al.: Deep generative image models using a laplacian pyramid of adversarial networks. In: NIPS, vol. 2, pp. 1486–1494 (2015)

    Google Scholar 

  9. Ledig, C., et al.: Photo-realistic single image super resolution using a generative adversarial network. In: CVPR open access version by computer vision foundation, CVPR (2017)

    Google Scholar 

  10. Chen, X., et al.: InfoGAN: interpretable representation learning by information maximizing generative adversarial nets. In: Advances in Neural Information Processing Systems (NIPS 2016), vol. 29 (2019)

    Google Scholar 

  11. Finn, C., et al.: Unsupervised learning for physical interaction through video prediction. In: Advances in Neural information Processing Systems, NIPS, vol. 29 (2016)

    Google Scholar 

  12. Mathieu, M., et al.: Deep multi-scale video prediction beyond mean square error. In: ICLR (2016)

    Google Scholar 

  13. Storey, G., Jiang, R., Bouridane, A.: Role for 2D image generated 3D face models in the rehabilitation of facial palsy. IET Healthcare Technology Letters (2017)

    Google Scholar 

  14. Storey, G., Bouridane, A., Jiang, R.: Integrated deep model for face detection and landmark localisation from in the wild image, IEEE Access (in press)

    Google Scholar 

  15. Jiang, R., Ho, A.T., Cheheb, I., Al-Maadeed, N., Al-Maadeed, S., Bouridane, A.: Emotion recognition from scrambled facial images via many graph embedding. Pattern Recogn. 67, 245–251 (2017)

    Article  Google Scholar 

  16. Jiang, R., Al-Maadeed, S., Bouridane, A., Crookes, D., Celebi, M.E.: Face recognition in the scrambled domain via salience-aware ensembles of many kernels. IEEE Trans. Inf. Forensics Secur. 11(8), 1807–1817 (2016)

    Article  Google Scholar 

  17. Jiang, R., Bouridane, A., Crookes, D., Celebi, M.E., Wei, H.L.: Privacy-protected facial biometric verification via fuzzy forest learning. IEEE Trans. Fuzzy Syst. 24(4), 779–790 (2016)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ranjith K Dinakaran .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dinakaran, R.K. et al. (2020). Deep Learning Based Pedestrian Detection at Distance in Smart Cities. In: Bi, Y., Bhatia, R., Kapoor, S. (eds) Intelligent Systems and Applications. IntelliSys 2019. Advances in Intelligent Systems and Computing, vol 1038. Springer, Cham. https://doi.org/10.1007/978-3-030-29513-4_43

Download citation

Publish with us

Policies and ethics