{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,24]],"date-time":"2025-03-24T09:04:18Z","timestamp":1742807058412,"version":"3.37.3"},"reference-count":23,"publisher":"MDPI AG","issue":"13","license":[{"start":{"date-parts":[[2023,6,21]],"date-time":"2023-06-21T00:00:00Z","timestamp":1687305600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Innovation Special Zone Project","award":["2016300TS00600113"]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"Addressing the challenges of low detection precision and excessive parameter volume presented by the high resolution, significant scale variations, and complex backgrounds in UAV aerial imagery, this paper introduces MFP-YOLO, a lightweight detection algorithm based on YOLOv5s. Initially, a multipath inverse residual module is designed, and an attention mechanism is incorporated to manage the issues associated with significant scale variations and abundant interference from complex backgrounds. Then, parallel deconvolutional spatial pyramid pooling is employed to extract scale-specific information, enhancing multi-scale target detection. Furthermore, the Focal-EIoU loss function is utilized to augment the model\u2019s focus on high-quality samples, consequently improving training stability and detection accuracy. Finally, a lightweight decoupled head replaces the original model\u2019s detection head, accelerating network convergence speed and enhancing detection precision. Experimental results demonstrate that MFP-YOLO improved the mAP50 on the VisDrone 2019 validation and test sets by 12.9% and 8.0%, respectively, compared to the original YOLOv5s. At the same time, the model\u2019s parameter volume and weight size were reduced by 79.2% and 73.7%, respectively, indicating that MFP-YOLO outperforms other mainstream algorithms in UAV aerial imagery detection tasks.<\/jats:p>","DOI":"10.3390\/s23135786","type":"journal-article","created":{"date-parts":[[2023,6,22]],"date-time":"2023-06-22T06:09:17Z","timestamp":1687414157000},"page":"5786","source":"Crossref","is-referenced-by-count":19,"title":["Lightweight Object Detection Algorithm for UAV Aerial Imagery"],"prefix":"10.3390","volume":"23","author":[{"given":"Jian","family":"Wang","sequence":"first","affiliation":[{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650504, China"},{"name":"Yunnan Key Lab of Artificial Intelligence, Kunming University of Science and Technology, Kunming 650504, China"}]},{"given":"Fei","family":"Zhang","sequence":"additional","affiliation":[{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650504, China"}]},{"given":"Yuesong","family":"Zhang","sequence":"additional","affiliation":[{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650504, China"}]},{"given":"Yahui","family":"Liu","sequence":"additional","affiliation":[{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650504, China"}]},{"given":"Ting","family":"Cheng","sequence":"additional","affiliation":[{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650504, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,6,21]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Ma\u2019sum, M.A., Arrofi, M.K., Jati, G., Arifin, F., Kurniawan, M.N., Mursanto, P., and Jatmiko, W. (2013, January 28\u201329). Simulation of intelligent unmanned aerial vehicle (UAV) for military surveillance. Proceedings of the 2013 International Conference on Advanced Computer Science and Information Systems (ICACSIS), Sanur Bali, Indonesia.","DOI":"10.1109\/ICACSIS.2013.6761569"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"14","DOI":"10.1134\/S1067413615010038","article-title":"The system of the ecological monitoring of environment which is based on the usage of UAV","volume":"46","author":"Danilov","year":"2015","journal-title":"Russ. J. Ecol."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"2055","DOI":"10.1016\/j.neucom.2017.10.037","article-title":"Path planning for solar-powered UAV in urban environment","volume":"275","author":"Wu","year":"2018","journal-title":"Neurocomputing"},{"key":"ref_4","unstructured":"Viola, P., and Jones, M. (2001, January 8\u201314). Rapid object detection using a boosted cascade of simple features. Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, Kauai, HI, USA."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1042","DOI":"10.1109\/34.254061","article-title":"Face recognition: Features versus templates","volume":"15","author":"Brunelli","year":"1993","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1016\/j.isprsjprs.2020.07.002","article-title":"Cross-regional oil palm tree counting and detection via a multi-level attention domain adaptation network","volume":"167","author":"Zheng","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast r-cnn. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1016\/j.isprsjprs.2021.01.008","article-title":"Growing status observation for oil palm trees using Unmanned Aerial Vehicle (UAV) images","volume":"173","author":"Zheng","year":"2021","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_10","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016). Proceedings of the Computer Vision\u2013ECCV 2016: 14th European Conference (Proceedings, Part I 14), Amsterdam, The Netherlands, 11\u201314 October 2016, Springer International Publishing."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"7817","DOI":"10.1038\/s41598-023-34892-4","article-title":"Lightweight aerial image object detection algorithm based on improved YOLOv5s","volume":"13","author":"Deng","year":"2023","journal-title":"Sci. Rep."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature pyramid networks for object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Liu, S., Qi, L., Qin, H., Shi, J., and Jia, J. (2018, January 18\u201323). Path aggregation network for instance segmentation. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00913"},{"key":"ref_14","unstructured":"Redmon, J., and Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Wang, C.Y., Bochkovskiy, A., and Liao, H.-Y.M. (2023, January 18\u201322). YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.00721"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., and Chen, L.-C. (2018, January 18\u201323). Mobilenetv2: Inverted residuals and linear bottlenecks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00474"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Woo, S., Park, J., Lee, J.-Y., and Kweon, I.S. (2018, January 8\u201314). Cbam: Convolutional block attention module. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Li, D., Hu, J., Wang, C., Li, X., She, Q., Zhu, L., Zhang, T., and Chen, Q. (2021, January 20\u201325). Involution: Inverting the inherence of convolution for visual recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01214"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"146","DOI":"10.1016\/j.neucom.2022.07.042","article-title":"Focal and efficient IOU loss for accurate bounding box regression","volume":"506","author":"Zhang","year":"2022","journal-title":"Neurocomputing"},{"key":"ref_20","unstructured":"Ge, Z., Liu, S., Wang, F., Li, Z., and Sun, J. (2021). Yolox: Exceeding yolo series in 2021. arXiv."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Chen, J., Kao, S.-H., He, H., Zhuo, W., Wen, S., Lee, C.-H., and Chan, S.-H.G. (2023, January 18\u201322). Run, Don\u2019t walk: Chasing higher FLOPS for faster neural networks. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.01157"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Ma, N., Zhang, X., Zheng, H.-T., and Sun, J. (2018, January 8\u201314). Shufflenet v2: Practical guidelines for efficient cnn architecture design. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01264-9_8"},{"key":"ref_23","unstructured":"Du, D., Zhu, P., Wen, L., Bian, X., Lin, H., Hu, Q., Peng, T., Zheng, J., Wang, X., and Zhang, Y. (2019, January 27\u201328). VisDrone-DET2019: The vision meets drone object detection in image challenge results. Proceedings of the IEEE\/CVF International Conference on Computer Vision Workshops, Seoul, Republic of Korea."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/13\/5786\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,12,15]],"date-time":"2023-12-15T13:36:52Z","timestamp":1702647412000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/13\/5786"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,6,21]]},"references-count":23,"journal-issue":{"issue":"13","published-online":{"date-parts":[[2023,7]]}},"alternative-id":["s23135786"],"URL":"https:\/\/doi.org\/10.3390\/s23135786","relation":{},"ISSN":["1424-8220"],"issn-type":[{"type":"electronic","value":"1424-8220"}],"subject":[],"published":{"date-parts":[[2023,6,21]]}}}