QPDet: Queuing People Detector for Aerial Images Based on Adaptive Soft Label Assignment Strategy | SpringerLink
Skip to main content

QPDet: Queuing People Detector for Aerial Images Based on Adaptive Soft Label Assignment Strategy

  • Conference paper
  • First Online:
Pattern Recognition (ICPR 2024)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 15305))

Included in the following conference series:

  • 168 Accesses

Abstract

As a branch of object detection task, the detection of queuing people in the aerial images emerges as a new research direction in recent years. However, it remains challenging due to the complex background, variations in the viewpoints and densely located tiny objects. In this paper, we propose a tiny object detector called QPDet to solve the above issues. Firstly, a large-scale dataset of queuing people with dense and diverse annotations has been constructed to enrich the knowledge of this field, in which a large number of queuing people have been sorted and collected using drones. Secondly, a modified YOLOX has been utilized as our detector, which effectively detects the tiny objects in the aerial images. Finally, an adaptive soft label assignment strategy has been proposed to improve the robustness of our detector. Extensive experiments have been conducted to demonstrate the effectiveness of our approach, the results exhibit the superiority of our detector.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 8465
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 10581
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Cheng, G., et al.: Towards large-scale small object detection: survey and benchmarks. IEEE Trans. Pattern Anal. Mach. Intell. (2023)

    Google Scholar 

  2. Kisantal, M., Wojna, Z., Murawski, J., Naruniec, J., Cho, K.: Augmentation for small object detection. arXiv preprint arXiv:1902.07296 (2019)

  3. Zhu, P., Wen, L., Bian, X., Ling, H., Hu, Q.: Vision meets drones: a challenge. arXiv preprint arXiv:1804.07437 (2018)

  4. Xu, C., Wang, J., Yang, W., Yu, L.: Dot distance for tiny object detection in aerial images. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1192–1201 (2021)

    Google Scholar 

  5. Yu, X., Gong, Y., Jiang, N., Ye, Q., Han, Z.: Scale match for tiny person detection. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 1257–1265 (2020)

    Google Scholar 

  6. Zhang, S., Benenson, R., Schiele, B.: Citypersons: a diverse dataset for pedestrian detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3213–3221 (2017)

    Google Scholar 

  7. Guo, C., Fan, B., Zhang, Q., Xiang, S., Pan, C.: Augfpn: improving multi-scale feature learning for object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12595–12604 (2020)

    Google Scholar 

  8. Li, Y., Chen, Y., Wang, N., Zhang, Z.: Scale-aware trident networks for object detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6054–6063 (2019)

    Google Scholar 

  9. Bai, Y., Zhang, Y., Ding, M., Ghanem, B.: Sod-mtgan: small object detection via multi-task generative adversarial network. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 206–221 (2018)

    Google Scholar 

  10. Singh, B., Najibi, M., Davis, L.S.: Sniper: efficient multi-scale training. In: Advances in Neural Information Processing Systems, vol. 31 (2018)

    Google Scholar 

  11. Yang, C., Huang, Z., Wang, N.: Querydet: cascaded sparse query for accelerating high-resolution small object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 13668–13677 (2022)

    Google Scholar 

  12. Bell, S., Zitnick, C.L., Bala, K., Girshick, R.: Inside-outside net: detecting objects in context with skip pooling and recurrent neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2874–2883 (2016)

    Google Scholar 

  13. Xu, C., Wang, J., Yang, W., Yu, H., Yu, L., Xia, G.-S.: RFLA: Gaussian receptive field based label assignment for tiny object detection. In: European Conference on Computer Vision, pp. 526–543. Springer, Cham (2022)

    Google Scholar 

  14. Zhang, S., et al.: Dense distinct query for end-to-end object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7329–7338 (2023)

    Google Scholar 

  15. Huang, X., Ge, Z., Jie, Z., Yoshie, O.: NMS by representative region: towards crowded pedestrian detection by proposal pairing. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10750–10759 (2020)

    Google Scholar 

  16. Chi, C., Zhang, S., Xing, J., Lei, Z., Li, S.Z., Zou, X.: Pedhunter: occlusion robust pedestrian detector in crowded scenes. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 10639–10646 (2020)

    Google Scholar 

  17. Oyelade, O.N., Ezugwu, A.E.-S.: A state-of-the-art survey on deep learning methods for detection of architectural distortion from digital mammography. IEEE Access 8, 148644–148676 (2020)

    Google Scholar 

  18. Wang, X., Girshick, R., Gupta, A., He, K.: Non-local neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7794–7803 (2018)

    Google Scholar 

  19. Bochkovskiy, A.: Yolov4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020)

  20. Jiang, Y., Tan, Z., Wang, J., Sun, X., Lin, M., Li, H.: Giraffedet: a heavy-neck paradigm for object detection. arXiv preprint arXiv:2202.04256 (2022)

  21. Tan, M., Pang, R., Le, Q.V.: Efficientdet: scalable and efficient object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10781–10790 (2020)

    Google Scholar 

  22. Li, Z., Hou, B., Zitong, W., Ren, B., Yang, C.: FCOSR: a simple anchor-free rotated detector for aerial object detection. Remote Sens. 15(23), 5499 (2023)

    Article  Google Scholar 

  23. Lin, T.-Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)

    Google Scholar 

  24. Ge, Z., Liu, S., Li, Z., Yoshie, O., Sun, J.: OTA: optimal transport assignment for object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 303–312 (2021)

    Google Scholar 

  25. Ge, Z., Liu, S., Wang, F., Li, Z., Sun, J.: Yolox: exceeding yolo series in 2021. arXiv preprint arXiv:2107.08430 (2021)

  26. Loshchilov, I., Hutter, F.: Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101 (2017)

  27. Kingma, D.P.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)

  28. Xu, S., et al. Pp-yoloe: an evolved version of yolo. arXiv preprint arXiv:2203.16250 (2022)

  29. Sun, Y., Cao, B., Zhu, P., Qinghua, H.: Drone-based RGB-infrared cross-modality vehicle detection via uncertainty-aware learning. IEEE Trans. Circuits Syst. Video Technol. 32(10), 6700–6713 (2022)

    Article  Google Scholar 

Download references

Acknowledgment

This work is supported by the Intelligent Policing Key Laboratory of Sichuan Province, No. ZNJW2024FKMS004.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kai Yi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhang, Y., Su, Y., Li, S., Yi, K. (2025). QPDet: Queuing People Detector for Aerial Images Based on Adaptive Soft Label Assignment Strategy. In: Antonacopoulos, A., Chaudhuri, S., Chellappa, R., Liu, CL., Bhattacharya, S., Pal, U. (eds) Pattern Recognition. ICPR 2024. Lecture Notes in Computer Science, vol 15305. Springer, Cham. https://doi.org/10.1007/978-3-031-78169-8_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-78169-8_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-78168-1

  • Online ISBN: 978-3-031-78169-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics