Abstract
The huge computational complexity, occlusion and low texture region problems make stereo matching a big challenge. In this work, we use multi-baseline trinocular camera model to study how to accelerate the stereo matching algorithms and improve the accuracy of disparity estimation. A special scheme named the trinocular dynamic disparity range (T-DDR) was designed to accelerate the stereo matching algorithms. In this scheme, we optimize matching cost calculation, cost aggregation and disparity computation steps by narrowing disparity searching range. Meanwhile, we designed another novel scheme called the trinocular disparity confidence measure (T-DCM) to improve the accuracy of the disparity map. Based on those, we proposed the semi-global matching with T-DDR (T-DDR-SGM) and T-DCM (T-DCM-SGM) algorithms for trinocular stereo matching. According to the evaluation results, the T-DDR-SGM could not only significantly reduce the computational complexity but also slightly improving the accuracy, while the T-DCM-SGM could excellently handle the occlusion and low texture region problems. Both of them achieved a better result. Moreover, the optimization schemes we designed can be extended to the other stereo matching algorithms which possesses pixel-wise matching cost calculation and aggregation steps not only the SGM. We proved that the proposed optimization methods for the trinocular stereo matching are effective and the trinocular stereo matching is useful for either improving accuracy or reducing computational complexity.
Similar content being viewed by others
References
Akin A, Capoccia R, Narinx J, Baz I, Schmid A, Leblebici Y (2015) Trinocular adaptive window size disparity estimation algorithm and its real-time hardware. In VLSI Design, Automation and Test (VLSI-DAT), pp 1–4: IEEE
Dosovitskiy A, Ros G, Codevilla F, Lopez A, Koltun V (2017) CARLA: An open urban driving simulator. arXiv preprint arXiv: 03938
El Jaafari I, El Ansari M, Koutti L, Mazoul A, Ellahyani A (2016) Fast spatio-temporal stereo matching for advanced driver assistance systems. Neurocomputing 194:24–33
Gehrig SK, Eberli F, Meyer T (2009) A real-time low-power stereo vision engine using semi-global matching. In International Conference on Computer Vision Systems, pp 134–143: Springer
Geiger A, Lenz P, Stiller C, Urtasun R (2013) "Vision meets robotics: the KITTI dataset," (in English). Int J Robot Res 32(11):1231–1237
Hirschmuller H (2005) Accurate and efficient stereo processing by semi-global matching and mutual information. In 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), vol 2 pp 807–814: IEEE
Hirschmuller H (2007) Stereo processing by semiglobal matching and mutual information. IEEE Trans Pattern Anal 30(2):328–341
Honegger D, Sattler T, Pollefeys M (2017) Embedded real-time multi-baseline stereo. In 2017 IEEE International Conference on Robotics and Automation (ICRA), pp 5245–5250: IEEE
Hu X, P. J. I. T. o. P. A. Mordohai, and M. Intelligence (2012) A quantitative evaluation of confidence measures for stereo vision. vol 34, no 11, pp 2121–2133
Kendall A et al (2017) End-to-end learning of geometry and context for deep stereo regression. In Proceedings of the IEEE International Conference on Computer Vision, pp 66–75
Lee KJ, Bong K, Kim C, Yoo H-J (2016) An intelligent ADAS processor with real-time semi-global matching and intention prediction for 720p stereo vision. In 2016 IEEE Hot Chips 28 Symposium (HCS), pp 1–1: IEEE
Li M, Shi L, Chen X, Du S, Li Y (2019) Using temporal correlation to optimize stereo matching in video sequences. IEICE Trans Inf 102(6):1183–1196
Luo W, Schwing AG, Urtasun R (2016) Efficient deep learning for stereo matching. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 5695–5703
Menze M, Geiger A (2015) Object scene flow for autonomous vehicles. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 3061–3070
Michael M, Salmen J, Stallkamp J, Schlipsing M (2013) Real-time stereo vision: Optimizing semi-global matching. In 2013 IEEE Intelligent Vehicles Symposium (IV), pp 1197–1202: IEEE
Narinx J, Demirci T, Akin A, Leblebici Y (2017) A single-chip 2048× 1080 resolution 32fps 380mW trinocular disparity estimation processor in 28nm CMOS technology. In 2017 Symposium on VLSI Circuits, pp C228-C229: IEEE
Park H, Lee KM (2016) Look wider to match image patches with convolutional neural networks. IEEE Signal Process Lett 24(12):1788–1792
Pritchett P, Zisserman A (1998) Matching and reconstruction from widely separated views. In European workshop on 3D structure from multiple images of large-scale environments
Scharstein D, Szeliski R (2002) A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. Int J Comput Vis 47(1–3):7–42
Strecha C, Pylvänäinen T, Fua P (2010) Dynamic and scalable large scale image reconstruction. In 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 406–413: IEEE
Yang W, Zhang G, Bao H, Kim J, Lee HY (2012) Consistent depth maps recovery from a trinocular video sequence. In 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp 1466–1473: IEEE
Zagoruyko S, Komodakis N (2015) Learning to compare image patches via convolutional neural networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4353–4361
Zbontar J, LeCun Y (2015) Computing the stereo matching cost with a convolutional neural network. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1592–1599
Zbontar J, LeCun Y (2016) Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches. J Mach Learn Res 17(1–32):2
Zhang K, Lu J, Lafruit G (2009) Cross-based local stereo matching using orthogonal integral images. IEEE Trans Circ Syst Video Technol 19(7):1073–1079
Zhou J, Wang L, Gu X, Xu K, Zhang Y (2015) A depth map estimation approach for trinocular stereo. In 2015 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, pp 1–6: IEEE
Author information
Authors and Affiliations
Corresponding authors
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Wang, J., Peng, C., Li, M. et al. The study of stereo matching optimization based on multi-baseline trinocular model. Multimed Tools Appl 81, 12961–12972 (2022). https://doi.org/10.1007/s11042-022-12579-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-12579-8