Efficient Object Detection and Classification of Ground Objects from Thermal Infrared Remote Sensing Image Based on Deep Learning | SpringerLink
Skip to main content

Efficient Object Detection and Classification of Ground Objects from Thermal Infrared Remote Sensing Image Based on Deep Learning

  • Conference paper
  • First Online:
Pattern Recognition and Computer Vision (PRCV 2021)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 13022))

Included in the following conference series:

Abstract

Wild searching and nature reserve monitoring are formidable tasks. In order to relieve the current pressure of general manpower observation, drone aerial surveillance using visible and thermal infrared (TIR) cameras is increasingly being adopted. Automatic data acquisition has become easier with advances in unmanned aerial vehicles (UAVs) and sensors like TIR cameras, which enables executives to search and detect ground objects at night. However, it’s still a challenge to accurately and quickly process the large amount of TIR data generated from this. In response to the above problems, this paper designs an enhanced ground object detection network (UAV-TIR Retinanet) for the UAV thermal imaging system. The network uses the Retinanet as infrastructure, extracts shallow features according to the characteristics of thermal infrared remote sensing images, introduces an attention mechanism and adaptive receptive field mechanism. The method achieves the best speed-accuracy trade-off on the dataset, reporting 74.47% AP at 23.48 FPS.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 10295
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 12869
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. He, Y., et al.: Infrared machine vision and infrared thermography with deep learning: a review. Infrared Phys. Technol. 2021, 103754 (2021)

    Google Scholar 

  2. Yao, H., Qin, R., Chen, X.: Unmanned aerial vehicle for remote sensing applications—a review. Remote Sens. 11(12), 1443 (2019)

    Article  Google Scholar 

  3. Feng, L., et al.: A comprehensive review on recent applications of unmanned aerial vehicle remote sensing with various sensors for high-throughput plant phenotyping. Comput. Electron. Agricult. 182, 106033 (2021)

    Google Scholar 

  4. Rawat, S.S., Verma, S.K., Kumar, Y.: Review on recent development in infrared small target detection algorithms. Procedia Comput. Sci. 167, 2496–2505 (2020)

    Article  Google Scholar 

  5. He, K., et al.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1904–1916 (2015)

    Article  Google Scholar 

  6. Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision (2015)

    Google Scholar 

  7. Ren, S., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. Adv. Neural Inf. Process. Syst. 28, 91–99 (2015)

    Google Scholar 

  8. Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) Computer Vision – ECCV 2016. ECCV 2016. LNCS, vol 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2

  9. Lin, T.-Y., et al.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision (2017)

    Google Scholar 

  10. Redmon, J., et al.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016)

    Google Scholar 

  11. Kundid Vasić, M., Papić, V.: Multimodel deep learning for person detection in aerial images. Electronics 9(9), 1459 (2020)

    Article  Google Scholar 

  12. Bondi, E., et al.: BIRDSAI: a dataset for detection and tracking in aerial thermal infrared videos. In: The IEEE Winter Conference on Applications of Computer Vision (2020)

    Google Scholar 

  13. Wu, Z., Shen, C., Van Den Hengel, A.: Wider or deeper: revisiting the resnet model for visual recognition. Pattern Recognit. 90, 119–133 (2019)

    Article  Google Scholar 

  14. Wang, Q., et al.: ECA-Net: efficient channel attention for deep convolutional neural networks. In: 2020 IEEE in CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE (2020)

    Google Scholar 

  15. Lin, T.-Y., et al.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017)

    Google Scholar 

  16. Tan, M., Pang, R., Le, Q.V.: Efficientdet: Scalable and efficient object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2020)

    Google Scholar 

  17. Luo, W., et al.: Understanding the effective receptive field in deep convolutional neural networks. In: Proceedings of the 30th International Conference on Neural Information Processing Systems (2016)

    Google Scholar 

  18. Liu, S., Huang, D., Wang, Y.: Receptive Field Block Net for Accurate and Fast Object Detection. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11215, pp. 404–419. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01252-6_24

    Chapter  Google Scholar 

  19. Szegedy, C., et al.: Inception-v4, inception-resnet and the impact of residual connections on learning. In: Thirty-first AAAI Conference on Artificial Intelligence (2017)

    Google Scholar 

  20. Liu, J., et al.: High precision detection algorithm based on improved RetinaNet for defect recognition of transmission lines. Energy Rep. 6, 2430–2440 (2020)

    Article  Google Scholar 

  21. Cartucho, J., Ventura, R., Veloso, M.: Robust object recognition through symbiotic deep learning in mobile robots. In: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE (2018)

    Google Scholar 

Download references

Acknowledgements

We would like to express gratitude to the efforts of Bondi, Elizabeth and her team members for creating and making publicly available scientific data. We would also like to thank all the reviewers on their time and insightful comments which improved our manuscript.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Falin Wu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wu, F., Zhou, G., He, J., Li, H., Liu, Y., Yang, G. (2021). Efficient Object Detection and Classification of Ground Objects from Thermal Infrared Remote Sensing Image Based on Deep Learning. In: Ma, H., et al. Pattern Recognition and Computer Vision. PRCV 2021. Lecture Notes in Computer Science(), vol 13022. Springer, Cham. https://doi.org/10.1007/978-3-030-88013-2_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-88013-2_14

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-88012-5

  • Online ISBN: 978-3-030-88013-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics