{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,4,8]],"date-time":"2025-04-08T05:45:24Z","timestamp":1744091124841,"version":"3.37.3"},"reference-count":35,"publisher":"MDPI AG","issue":"17","license":[{"start":{"date-parts":[[2022,8,30]],"date-time":"2022-08-30T00:00:00Z","timestamp":1661817600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["U1909206","61725305","T2121002"],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Postdoctoral Innovative Talent Support Program","award":["BX2021010"]},{"DOI":"10.13039\/501100002858","name":"China Postdoctoral Science Foundation","doi-asserted-by":"crossref","award":["2022M710214"],"id":[{"id":"10.13039\/501100002858","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Joint Fund of Ministry of Education for Equipment Pre-Research","award":["8091B022134"]},{"name":"S&T Program of Hebei","award":["F2020203037"]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"Autonomous underwater garbage grasping and collection pose a great challenge to underwater robots. To assist underwater robots in locating and recognizing underwater garbage objects efficiently, a modified U-Net-based architecture consisting of a deeper contracting path and an expansive path is proposed to accomplish end-to-end image semantic segmentation. In addition, a dataset for underwater garbage semantic segmentation is established. The proposed architecture is further verified in the underwater garbage dataset and the effects of different hyperparameters, loss functions, and optimizers on the performance of refining the predicted segmented mask are examined. It is confirmed that the focal loss function will lead to a boost in solving the target\u2013background unbalance problem. Eventually, the obtained results offer a solid foundation for fast and precise underwater target recognition and operations.<\/jats:p>","DOI":"10.3390\/s22176546","type":"journal-article","created":{"date-parts":[[2022,8,31]],"date-time":"2022-08-31T04:13:56Z","timestamp":1661919236000},"page":"6546","source":"Crossref","is-referenced-by-count":9,"title":["Image Semantic Segmentation of Underwater Garbage with Modified U-Net Architecture Model"],"prefix":"10.3390","volume":"22","author":[{"given":"Lifu","family":"Wei","sequence":"first","affiliation":[{"name":"Department of Advanced Manufacturing and Robotics, College of Engineering, Peking University, Beijing 100871, China"}]},{"given":"Shihan","family":"Kong","sequence":"additional","affiliation":[{"name":"Department of Advanced Manufacturing and Robotics, College of Engineering, Peking University, Beijing 100871, China"}]},{"given":"Yuquan","family":"Wu","sequence":"additional","affiliation":[{"name":"Institute of Software, Chinese Academy of Sciences, Beijing 100190, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6347-572X","authenticated-orcid":false,"given":"Junzhi","family":"Yu","sequence":"additional","affiliation":[{"name":"Department of Advanced Manufacturing and Robotics, College of Engineering, Peking University, Beijing 100871, China"}]}],"member":"1968","published-online":{"date-parts":[[2022,8,30]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"100010","DOI":"10.1016\/j.cscee.2020.100010","article-title":"Microplastics in the marine environment: A review of their sources, distribution processes, uptake and exchange in ecosystems","volume":"2","author":"Coyle","year":"2020","journal-title":"Case Stud. Chem. Environ. Eng."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"842","DOI":"10.1016\/S0025-326X(02)00220-5","article-title":"The pollution of the marine environment by plastic debris: A review","volume":"44","author":"Derraik","year":"2002","journal-title":"Mar. Pollut. Bull."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"28","DOI":"10.3389\/feart.2020.00028","article-title":"Urban river water level increase through plastic the accumulation at a rack structure","volume":"8","author":"Honingh","year":"2020","journal-title":"Front. Earth Sci."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1016\/S2542-5196(17)30140-7","article-title":"Threats to human health by great ocean garbage patches","volume":"1","author":"Efferth","year":"2017","journal-title":"Lancet Planet. Health"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"23729","DOI":"10.1007\/s11042-020-08976-6","article-title":"A review of object detection based on deep learning","volume":"79","author":"Xiao","year":"2020","journal-title":"Multimed. Tools Appl."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Perez, J.A., Deligianni, F., Ravi, D., and Yang, G.Z. (2018). Artificial intelligence and robotics. arXiv.","DOI":"10.31256\/WP2017.1"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"3212","DOI":"10.1109\/TNNLS.2018.2876865","article-title":"Object detection with deep learning: A review","volume":"30","author":"Zhao","year":"2019","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"ref_8","first-page":"3523","article-title":"Image segmentation using deep learning: A survey","volume":"44","author":"Minaee","year":"2022","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1016\/j.patrec.2022.04.025","article-title":"A hybrid active contour model based on pre-fitting energy and adaptive functions for fast image segmentation","volume":"158","author":"Ge","year":"2022","journal-title":"Pattern Recognit. Lett."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"115633","DOI":"10.1016\/j.eswa.2021.115633","article-title":"A level set method based on additive bias correction for image segmentation","volume":"185","author":"Weng","year":"2021","journal-title":"Expert Syst. Appl."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015, January 5\u20139). U-Net: Convolutional networks for biomedical image segmentation. Proceedings of the Medical Image Computing and Computer-Assisted Intervention\u2014MICCAI 2015, Munich, Germany.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Garcia-Garcia, A., Orts-Escolano, S., Oprea, S., Villena-Martinez, V., and Garcia-Rodriguez, J. (2017). A review on deep learning techniques applied to semantic segmentation. arXiv.","DOI":"10.1016\/j.asoc.2018.05.018"},{"key":"ref_13","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"318","DOI":"10.1109\/TPAMI.2018.2858826","article-title":"Focal Loss for dense object detection","volume":"42","author":"Lin","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"62","DOI":"10.1109\/TSMC.1979.4310076","article-title":"A threshold selection method from gray-level histogram","volume":"9","year":"1979","journal-title":"IEEE Trans. Syst. Man Cybern."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"679","DOI":"10.1109\/TPAMI.1986.4767851","article-title":"A computational approach to edge detection","volume":"PAMI-8","author":"Canny","year":"1986","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Kornilov, A.S., and Safonov, I.V. (2018). An overview of watershed algorithm implementations in open source libraries. J. Imaging, 4.","DOI":"10.3390\/jimaging4100123"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"2278","DOI":"10.1109\/5.726791","article-title":"Gradient-based learning applied to document recognition","volume":"86","author":"Lecun","year":"1998","journal-title":"Proc. IEEE"},{"key":"ref_19","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012). Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst., 25, Available online: https:\/\/papers.nips.cc\/paper\/2012\/hash\/c399862d3b9d6b76c8436e924a68c45b-Abstract.html."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You only look once: Unified, real-Time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016, January 11\u201314). SSD: Single shot multibox detector. Proceedings of the Computer Vision\u2014ECCV 2016, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE conference on computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_24","unstructured":"Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., and Yuille, A.L. (2014). Semantic image segmentation with deep convolutional nets and fully connected CRFs. arXiv."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"382","DOI":"10.1109\/TCE.2018.2859629","article-title":"Deep learning based robot for automatically picking up garbage on the grass","volume":"64","author":"Bai","year":"2018","journal-title":"IEEE Trans. Consum. Electron."},{"key":"ref_26","unstructured":"Lakshmi, M.D., and Santhanam, S.M. (2020, January 21\u201323). Underwater image recognition detector using deep ConvNet. Proceedings of the 2020 National Conference on Communications (NCC), Kharagpur, India."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"1729881420976307","DOI":"10.1177\/1729881420976307","article-title":"Underwater target recognition methods based on the framework of deep learning: A survey","volume":"17","author":"Teng","year":"2020","journal-title":"Int. J. Adv. Robot. Syst."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"1345","DOI":"10.1109\/TKDE.2009.191","article-title":"A survey on transfer learning","volume":"22","author":"Pan","year":"2009","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"94614","DOI":"10.1109\/ACCESS.2019.2928976","article-title":"Underwater image enhancement with a deep residual framework","volume":"7","author":"Liu","year":"2019","journal-title":"IEEE Access"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"64381","DOI":"10.1109\/ACCESS.2021.3075951","article-title":"Improving road semantic segmentation using generative adversarial Network","volume":"9","author":"Abdollahi","year":"2021","journal-title":"IEEE Access"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Enshaei, N., Ahmad, S., and Naderkhani, F. (2020, January 8\u201310). Automated detection of textured-surface defects using UNet-based semantic segmentation network. Proceedings of the 2020 IEEE International Conference on Prognostics and Health Management (ICPHM), Detroit, MI, USA.","DOI":"10.1109\/ICPHM49022.2020.9187023"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"1217","DOI":"10.1631\/FITEE.2100473","article-title":"A modified YOLOv4 detection method for a vision-based underwater garbage cleaning robot","volume":"23","author":"Tian","year":"2022","journal-title":"Front. Inf. Technol. Electron. Eng."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., and Adam, H. (2018, January 8\u201314). Encoder-decoder with atrous separable convolution for semantic image segmentation. Proceedings of the Proceedings of the European conference on computer vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Zhao, H., Shi, J., Qi, X., Wang, X., and Jia, J. (2017, January 21\u201326). Pyramid scene parsing network. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.660"},{"key":"ref_35","first-page":"12077","article-title":"SegFormer: Simple and efficient design for semantic segmentation with transformers","volume":"34","author":"Xie","year":"2021","journal-title":"Adv. Neural Inf. Process. Syst."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/17\/6546\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,20]],"date-time":"2025-01-20T22:04:58Z","timestamp":1737410698000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/17\/6546"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,8,30]]},"references-count":35,"journal-issue":{"issue":"17","published-online":{"date-parts":[[2022,9]]}},"alternative-id":["s22176546"],"URL":"https:\/\/doi.org\/10.3390\/s22176546","relation":{},"ISSN":["1424-8220"],"issn-type":[{"type":"electronic","value":"1424-8220"}],"subject":[],"published":{"date-parts":[[2022,8,30]]}}}