Small Object Detection Based on SSD-ResNeXt101 | SpringerLink
Skip to main content

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 829))

Abstract

Object detection is closely related to video analysis and image retention, which has attracted many researchers to research in this field. Traditional object detection methods are built with hand-crafted features and shallow trainable architectures. The resulting accuracy of traditional object detection is very much influenced by the selected features. The development of the field of Artificial Intelligence (AI), especially Deep Learning (DL), has made DL a powerful model for object detection. This is because DL has semantic analysis capabilities, high-level, deeper features, which are problems that often arise in traditional object detection. However, there are a few things that need to be fixed regarding the application of object detection in the real world because there are many small objects and varied backgrounds. Manual labeling of small objects is quite a time consuming and costly. The lack of a dataset to train small objects greatly affects the accuracy of the Convolutional Neural Network (CNN) model that was built. Single Shot Multi box Detector (SDD) as an object detection framework can detect objects of different sizes. To improve SSD accuracy in detecting small objects, in this paper, we replaced the SSD backbone using ResNeXt101. The experimental results yield better accuracy than the previous SSD framework with ResNet101. SSD (ResNeXt101) reach accurate 67.17% while SSD (ResNet101) with accurate 66.09%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 34319
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
JPY 42899
Price includes VAT (Japan)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Zhao, Z.Q., Zheng, P., Xu, S.T., Wu, X.: Object detection with deep learning: a review. IEEE Trans. Neural Netw. Learn. Syst. 30(11), 3212–3232 (2018)

    Article  Google Scholar 

  2. Lu, F.R.X., Kang, X., Nishide, S.: Object detection based-on SSD ResNet. In: Proceedings of CCIS, pp. 89–92 (2019)

    Google Scholar 

  3. Wu, X., Sahoo, D., Hoi, S.C.H.: Recent advances in deep learning for object detection. Neurocomputing 396, 39–64 (2020). https://doi.org/10.1016/j.neucom.2020.01.085

    Article  Google Scholar 

  4. LeCun, G.H.Y., Bengio, Y.: Deep learning. Nature 521(7553), 436–444 (2015)

    Article  Google Scholar 

  5. Pitts, W., McCulloch, W.S.: How we know universals the perception of auditory and visual forms. Bull. Math. Biophys. 9(3), 127–147 (1947). https://doi.org/10.1007/BF02478291

    Article  Google Scholar 

  6. Zhu, H., Wei, H., Li, B., Yuan, X., Kehtarnavaz, N.: A review of video object detection: datasets, metrics and methods. Appl. Sci. 10(21), 1–24 (2020). https://doi.org/10.3390/app10217834

    Article  Google Scholar 

  7. Liu, N., Han, J., Zhang, D., Wen, S., Liu, T.: Predicting eye fixations using convolutional neural networks. In: Proceedings of IEEE Computer Society Conference on Computer Vision Pattern Recognition, 07–12 June 2015, pp. 362–370 (2015). https://doi.org/10.1109/CVPR.2015.7298633

  8. Yin, Z., Lin, J.: Research on SSD algorithm based on occlusion detection. In: 2019 IEEE 3rd International Conference on Electronic Information Technology and Computer Engineering, EITCE 2019, pp. 1987–1992 (2019). https://doi.org/10.1109/EITCE47263.2019.9094772

  9. Cheng, Y., Chen, C., Gan, Z.: Enhanced single shot multibox detector for pedestrian detection. In: ACM International Conference Proceeding Series (2019). https://doi.org/10.1145/3331453.3361665

  10. Yang, J., He, W.Y., Zhang, T.L., Zhang, C.L., Zeng, L., Nan, B.F.: Research on subway pedestrian detection algorithms based on SSD model. IET Intell. Transp. Syst. 14(11), 1491–1496 (2020). https://doi.org/10.1049/iet-its.2019.0806

    Article  Google Scholar 

  11. Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2

    Chapter  Google Scholar 

  12. Xie, S., Girshick, R., Dollár, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks. In: Proceedings - 30th IEEE Conference Computer Vision Pattern Recognition, CVPR 2017, vol. 2017-Janua, pp. 5987–5995 (2017). https://doi.org/10.1109/CVPR.2017.634

  13. Szegedy, C., Ioffe, S., Vanhoucke, V., Alemi, A.A.: Inceptionv4, inception-resnet and the impact of residualconnections on learning. In: ICLR Working (2016)

    Google Scholar 

  14. Szegedy, C., et al.: Going deeper with convolutions. In: Proceedings of IEEE Computer Society Conference on Computer Vision Pattern Recognition, 07–12 June 2015, pp. 1–9 (2015). https://doi.org/10.1109/CVPR.2015.7298594

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Uus Khusni .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Khusni, U., Arymurthy, A.M., Susanto, H. (2022). Small Object Detection Based on SSD-ResNeXt101. In: Mahyuddin, N.M., Mat Noor, N.R., Mat Sakim, H.A. (eds) Proceedings of the 11th International Conference on Robotics, Vision, Signal Processing and Power Applications. Lecture Notes in Electrical Engineering, vol 829. Springer, Singapore. https://doi.org/10.1007/978-981-16-8129-5_162

Download citation

Publish with us

Policies and ethics