{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T14:52:36Z","timestamp":1740149556569,"version":"3.37.3"},"reference-count":32,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2022,12,23]],"date-time":"2022-12-23T00:00:00Z","timestamp":1671753600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62076251"],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"Multi-view 3D reconstruction technology based on deep learning is developing rapidly. Unsupervised learning has become a research hotspot because it does not need ground truth labels. The current unsupervised method mainly uses 3DCNN to regularize the cost volume to regression image depth. This approach results in high memory requirements and long computing time. In this paper, we propose an end-to-end unsupervised multi-view 3D reconstruction network framework based on PatchMatch, Unsup_patchmatchnet. It dramatically reduces memory requirements and computing time. We propose a feature point consistency loss function. We incorporate various self-supervised signals such as photometric consistency loss and semantic consistency loss into the loss function. At the same time, we propose a high-resolution loss method. This improves the reconstruction of high-resolution images. The experiment proves that the memory usage of the network is reduced by 80% and the running time is reduced by more than 50% compared with the network using 3DCNN method. The overall error of reconstructed 3D point cloud is only 0.501 mm. It is superior to most current unsupervised multi-view 3D reconstruction networks. Then, we test on different data sets and verify that the network has good generalization.<\/jats:p>","DOI":"10.3390\/s23010136","type":"journal-article","created":{"date-parts":[[2022,12,27]],"date-time":"2022-12-27T07:53:11Z","timestamp":1672127591000},"page":"136","source":"Crossref","is-referenced-by-count":1,"title":["Unsupervised 3D Reconstruction with Multi-Measure and High-Resolution Loss"],"prefix":"10.3390","volume":"23","author":[{"given":"Yijie","family":"Zheng","sequence":"first","affiliation":[{"name":"College of Command and Control Engineering, Army Engineering University of PLA, Nanjing 210007, China"}]},{"given":"Jianxin","family":"Luo","sequence":"additional","affiliation":[{"name":"College of Command and Control Engineering, Army Engineering University of PLA, Nanjing 210007, China"}]},{"given":"Weiwei","family":"Chen","sequence":"additional","affiliation":[{"name":"College of Command and Control Engineering, Army Engineering University of PLA, Nanjing 210007, China"}]},{"given":"Yanyan","family":"Zhang","sequence":"additional","affiliation":[{"name":"College of Command and Control Engineering, Army Engineering University of PLA, Nanjing 210007, China"}]},{"given":"Haixun","family":"Sun","sequence":"additional","affiliation":[{"name":"College of Command and Control Engineering, Army Engineering University of PLA, Nanjing 210007, China"}]},{"given":"Zhisong","family":"Pan","sequence":"additional","affiliation":[{"name":"College of Command and Control Engineering, Army Engineering University of PLA, Nanjing 210007, China"}]}],"member":"1968","published-online":{"date-parts":[[2022,12,23]]},"reference":[{"key":"ref_1","unstructured":"Mi, Q., and Gao, T. (July, January 29). 3D reconstruction based on the depth image: A review. Proceedings of the International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing, Kitakyushu, Japan."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"5496","DOI":"10.1109\/JSTARS.2022.3189528","article-title":"Modeling and predicting land use land cover spatiotemporal changes: A case study in chalus watershed, Iran","volume":"15","author":"Jalayer","year":"2022","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Darmon, F., Bascle, B., Devaux, J.C., Monasse, P., and Aubry, M. (2021, January 1\u20133). Deep multi-view stereo gone wild. Proceedings of the International Conference on 3D Vision (3DV), Prague, Czech Republic.","DOI":"10.1109\/3DV53792.2021.00058"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Sinha, S.N., Mordohai, P., and Pollefeys, M. (2007, January 14\u201321). Multi-view stereo via graph cuts on the dual of an adaptive tetrahedral mesh. Proceedings of the 2007 IEEE 11th International Conference on Computer Vision, Rio de Janeiro, Brazil.","DOI":"10.1109\/ICCV.2007.4408997"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Ulusoy, A.O., Black, M.J., and Geiger, A. (2017, January 21\u201326). Semantic multi-view stereo: Jointly estimating objects and voxels. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.482"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1362","DOI":"10.1109\/TPAMI.2009.161","article-title":"Accurate, dense, and robust multiview stereopsis","volume":"32","author":"Furukawa","year":"2009","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Locher, A., Perdoch, M., and Van Gool, L. (2016, January 27\u201330). Progressive prioritized multi-view stereo. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.353"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Galliani, S., Lasinger, K., and Schindler, K. (2015, January 7\u201313). Massively parallel multiview stereopsis by surface normal diffusion. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.106"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Xu, Q., and Tao, W. (2019, January 15\u201320). Multi-scale geometric consistency guided multi-view stereo. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00563"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1007\/s11263-016-0902-9","article-title":"Large-scale data for multiple-view stereopsis","volume":"120","author":"Jensen","year":"2016","journal-title":"Int. J. Comput. Vis."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3072959.3073599","article-title":"Tanks and temples: Benchmarking large-scale scene reconstruction","volume":"36","author":"Knapitsch","year":"2017","journal-title":"ACM Trans. Graph."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Ji, M., Gall, J., Zheng, H., Liu, Y., and Fang, L. (2017, January 22\u201329). Surfacenet: An end-to-end 3d neural network for multiview stereopsis. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.253"},{"key":"ref_13","unstructured":"Kar, A., H\u00e4ne, C., and Malik, J. (2017). Learning a multi-view stereo machine. arXiv."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Yao, Y., Luo, Z., Li, S., Fang, T., and Quan, L. (2018, January 8\u201314). Mvsnet: Depth inference for unstructured multi-view stereo. Proceedings of the European conference on computer vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01237-3_47"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Yu, Z., and Gao, S. (2020, January 13\u201319). Fast-mvsnet: Sparse-to-dense multi-view stereo with learned propagation and gauss-newton refinement. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00202"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Yang, J., Mao, W., Alvarez, J.M., and Liu, M. (2020, January 13\u201319). Cost volume pyramid based depth inference for multi-view stereo. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00493"},{"key":"ref_17","unstructured":"Khot, T., Agrawal, S., Tulsiani, S., Mertz, C., Lucey, S., and Hebert, M. (2019). Learning unsupervised multi-view stereopsis via robust photometric consistency. arXiv."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Dai, Y., Zhu, Z., Rao, Z., and Li, B. (2019, January 16\u201319). Mvs2: Deep unsupervised multi-view stereo with multi-view symmetry. Proceedings of the International Conference on 3D Vision (3DV), Qu\u00e9bec City, QC, Canada.","DOI":"10.1109\/3DV.2019.00010"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Huang, B., Yi, H., Huang, C., He, Y., Liu, J., and Liu, X. (2021, January 19\u201322). M3VSNet: Unsupervised multi-metric multi-view stereo network. Proceedings of the International Conference on Image Processing (ICIP), Anchorage, AK, USA.","DOI":"10.1109\/ICIP42928.2021.9506469"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Xu, H., Zhou, Z., Qiao, Y., Kang, W., and Wu, Q. (2021, January 2\u20139). Self-supervised multi-view stereo via effective co-segmentation and data-augmentation. Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA.","DOI":"10.1609\/aaai.v35i4.16411"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Yao, Y., Luo, Z., Li, S., Shen, T., Fang, T., and Quan, L. (2019, January 15\u201320). Recurrent mvsnet for high-resolution multi-view stereo depth inference. Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00567"},{"key":"ref_22","unstructured":"Luo, K., Guan, T., Ju, L., Huang, H., and Luo, Y. (November, January 27). P-mvsnet: Learning patch-wise matching confidence aggregation for multi-view stereo. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Gu, X., Fan, Z., Zhu, S., Dai, Z., Tan, F., and Tan, P. (2020, January 13\u201319). Cascade cost volume for high-resolution multi-view stereo and stereo matching. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00257"},{"key":"ref_24","unstructured":"Chen, R., Han, S., Xu, J., and Su, H. (November, January 27). Point-based multi-view stereo network. Proceedings of the IEEE\/CVF international conference on computer vision, Seoul, Republic of Korea."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Wang, F., Galliani, S., Vogel, C., Speciale, P., and Pollefeys, M. (2021, January 20\u201325). Patchmatchnet: Learned multi-view patchmatch stereo. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01397"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1145\/1531326.1531330","article-title":"PatchMatch: A randomized correspondence algorithm for structural image editing","volume":"28","author":"Barnes","year":"2009","journal-title":"ACM Trans. Graph. (TOG)"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Yang, J., Alvarez, J.M., and Liu, M. (2021, January 20\u201325). Self-supervised learning of depth inference for multi-view stereo. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00744"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Hui, T.W., Loy, C.C., and Tang, X. (2016, January 8\u201316). Depth map super-resolution by deep multi-scale guidance. Proceedings of the European conference on computer vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46487-9_22"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Xu, H., Zhou, Z., Wang, Y., Kang, W., Sun, B., Li, H., and Qiao, Y. (2021, January 10\u201317). Digging into uncertainty in self-supervised multi-view stereo. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, QC, Canada.","DOI":"10.1109\/ICCV48922.2021.00602"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"18839","DOI":"10.1007\/s11042-021-10646-0","article-title":"2D object recognition: A comparative analysis of SIFT, SURF and ORB feature descriptors","volume":"80","author":"Bansal","year":"2021","journal-title":"Multimed. Tools Appl."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"903","DOI":"10.1007\/s00138-011-0346-8","article-title":"Efficient large-scale multi-view stereo for ultra high-resolution image sets","volume":"23","author":"Tola","year":"2012","journal-title":"Mach. Vis. Appl."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Campbell, N.D., Vogiatzis, G., Hern\u00e1ndez, C., and Cipolla, R. (2008, January 12\u201318). Using multiple hypotheses to improve depth-maps for multi-view stereo. Proceedings of the European Conference on Computer Vision, Marseille, France.","DOI":"10.1007\/978-3-540-88682-2_58"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/1\/136\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,11]],"date-time":"2024-08-11T14:42:31Z","timestamp":1723387351000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/1\/136"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,23]]},"references-count":32,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,1]]}},"alternative-id":["s23010136"],"URL":"https:\/\/doi.org\/10.3390\/s23010136","relation":{},"ISSN":["1424-8220"],"issn-type":[{"type":"electronic","value":"1424-8220"}],"subject":[],"published":{"date-parts":[[2022,12,23]]}}}