Abstract
Object detection is closely related to video analysis and image retention, which has attracted many researchers to research in this field. Traditional object detection methods are built with hand-crafted features and shallow trainable architectures. The resulting accuracy of traditional object detection is very much influenced by the selected features. The development of the field of Artificial Intelligence (AI), especially Deep Learning (DL), has made DL a powerful model for object detection. This is because DL has semantic analysis capabilities, high-level, deeper features, which are problems that often arise in traditional object detection. However, there are a few things that need to be fixed regarding the application of object detection in the real world because there are many small objects and varied backgrounds. Manual labeling of small objects is quite a time consuming and costly. The lack of a dataset to train small objects greatly affects the accuracy of the Convolutional Neural Network (CNN) model that was built. Single Shot Multi box Detector (SDD) as an object detection framework can detect objects of different sizes. To improve SSD accuracy in detecting small objects, in this paper, we replaced the SSD backbone using ResNeXt101. The experimental results yield better accuracy than the previous SSD framework with ResNet101. SSD (ResNeXt101) reach accurate 67.17% while SSD (ResNet101) with accurate 66.09%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Zhao, Z.Q., Zheng, P., Xu, S.T., Wu, X.: Object detection with deep learning: a review. IEEE Trans. Neural Netw. Learn. Syst. 30(11), 3212–3232 (2018)
Lu, F.R.X., Kang, X., Nishide, S.: Object detection based-on SSD ResNet. In: Proceedings of CCIS, pp. 89–92 (2019)
Wu, X., Sahoo, D., Hoi, S.C.H.: Recent advances in deep learning for object detection. Neurocomputing 396, 39–64 (2020). https://doi.org/10.1016/j.neucom.2020.01.085
LeCun, G.H.Y., Bengio, Y.: Deep learning. Nature 521(7553), 436–444 (2015)
Pitts, W., McCulloch, W.S.: How we know universals the perception of auditory and visual forms. Bull. Math. Biophys. 9(3), 127–147 (1947). https://doi.org/10.1007/BF02478291
Zhu, H., Wei, H., Li, B., Yuan, X., Kehtarnavaz, N.: A review of video object detection: datasets, metrics and methods. Appl. Sci. 10(21), 1–24 (2020). https://doi.org/10.3390/app10217834
Liu, N., Han, J., Zhang, D., Wen, S., Liu, T.: Predicting eye fixations using convolutional neural networks. In: Proceedings of IEEE Computer Society Conference on Computer Vision Pattern Recognition, 07–12 June 2015, pp. 362–370 (2015). https://doi.org/10.1109/CVPR.2015.7298633
Yin, Z., Lin, J.: Research on SSD algorithm based on occlusion detection. In: 2019 IEEE 3rd International Conference on Electronic Information Technology and Computer Engineering, EITCE 2019, pp. 1987–1992 (2019). https://doi.org/10.1109/EITCE47263.2019.9094772
Cheng, Y., Chen, C., Gan, Z.: Enhanced single shot multibox detector for pedestrian detection. In: ACM International Conference Proceeding Series (2019). https://doi.org/10.1145/3331453.3361665
Yang, J., He, W.Y., Zhang, T.L., Zhang, C.L., Zeng, L., Nan, B.F.: Research on subway pedestrian detection algorithms based on SSD model. IET Intell. Transp. Syst. 14(11), 1491–1496 (2020). https://doi.org/10.1049/iet-its.2019.0806
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Xie, S., Girshick, R., Dollár, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks. In: Proceedings - 30th IEEE Conference Computer Vision Pattern Recognition, CVPR 2017, vol. 2017-Janua, pp. 5987–5995 (2017). https://doi.org/10.1109/CVPR.2017.634
Szegedy, C., Ioffe, S., Vanhoucke, V., Alemi, A.A.: Inceptionv4, inception-resnet and the impact of residualconnections on learning. In: ICLR Working (2016)
Szegedy, C., et al.: Going deeper with convolutions. In: Proceedings of IEEE Computer Society Conference on Computer Vision Pattern Recognition, 07–12 June 2015, pp. 1–9 (2015). https://doi.org/10.1109/CVPR.2015.7298594
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Khusni, U., Arymurthy, A.M., Susanto, H. (2022). Small Object Detection Based on SSD-ResNeXt101. In: Mahyuddin, N.M., Mat Noor, N.R., Mat Sakim, H.A. (eds) Proceedings of the 11th International Conference on Robotics, Vision, Signal Processing and Power Applications. Lecture Notes in Electrical Engineering, vol 829. Springer, Singapore. https://doi.org/10.1007/978-981-16-8129-5_162
Download citation
DOI: https://doi.org/10.1007/978-981-16-8129-5_162
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-8128-8
Online ISBN: 978-981-16-8129-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)