Abstract
3D video is composed out of two or more, temporally synchronized, 2D video streams acquired at different camera poses and accompanied by geometrical information. In a mixed resolution 3D video stream, a subset of views is coded at reduced resolution. It has been shown in the literature that subjective quality of mixed resolution 3D video is close to that of full resolution 3D video. In order to improve the coding gain in mixed resolution coding scenario we present a new depth encoding method called view upsampling optimization. A novel depth distortion metric based on the performance of the depth-based super resolution is also presented. Finally, to improve the quality of the decoded video an improved depth-based super resolution method that uses view synthesis quality mapping is used for upsampling of low resolution views. The simulations, performed with the recently standardized MVC+D encoder, show that the proposed solution combined with the state of the art view synthesis distortion outperforms the anchor MVC+D coding scheme by 14.5 % of dBR on average for the total coded bitrate and by 17 % of dBR on average for the synthesized views.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Aflaki, P., Hannuksela, M. M., Hakkinen, J., Lindroos, P., & Gabbouj, M. (2010). Subjective study on compressed asymmetric stereoscopic video. In 17th IEEE international conference on image processing (pp. 4021–4024).
Aflaki, P., Su, W., Joachimiak, M., Rusanovskyy, D., Hannuksela, M. M., Li, H., & Gabbouj, M. (2013). Coding of mixed-resolution multiview video in 3D video application. In International conference on image processing (pp. 1704–1708).
Aksay, A., Bilen, C., Kurutepe, E., Ozcelebi, T., Akar, G. B., Civanlar, M. R., & Tekalp, A. M. (2006). Temporal and spatial scaling for stereoscopic video compression. In Proceedings of EUSIPCO (Vol. 6, No. 8).
Bjontegaard, G. (2001). Calculation of average PSNR differences between RD-curves. ITU-T SG16 Q.6 document VCEG-M33.
Brust, H., Smolic, A., Mueller, K., Tech, G., & Wiegand, T. (2009). Mixed resolution coding of stereoscopic video for mobile devices. In 3DTV conference: The true vision-capture, transmission and display of 3D Video.
Chen, Y., Hannuksela, M. M., Suzuki, T., & Hattori, S. (2013). Overview of the MVC+D 3D video coding standard. Journal of Visual Communication and Image Representation, 25(4), 679–688.
Chen, Y., Liu, S., Wang, Y.-K., Hannuksela, M. M., Li, H., & Gabbouj, M. (2008). Low-complexity asymmetric multiview video coding. In IEEE international conference on multimedia and expo (pp. 773–776).
Chen, Y., Wang, Y.-K., Gabbouj, M., & Hannuksela, M. M. (2009). Regionally adaptive filtering for asymmetric stereoscopic video coding. In IEEE International Symposium on Circuits and Systems, 2009 (ISCAS 2009) (pp. 2585–2588). IEEE.
Chen, Y., Wang, Y.-K., Hannuksela, M. M., & Gabbouj, M. (2008). Picture-level adaptive filter for asymmetric stereoscopic video. In 15th IEEE international conference on image processing (pp. 1944–1947).
Chen, Y., Wang, Y.-K., Ugur, K., Hannuksela, M. M., Lainema, J., & Gabbouj, M. (2009). The emerging MVC standard for 3D video services. EURASIP Journal on Advances in Signal Processing.
Dinstein, I., Kim, M. G., Tselgov, J., & Henik, A. (1989). Compression of stereo images and the evaluation of its effects on 3-D perception. In Proceedings of SPIE, applications of digital image processing XII (Vol. 1153, p. 522).
Dong, J., He, Y., & Ye, Y. (2012). Downsampling filters for anchor generation for scalable extensions of HEVC. ISO/IEC JTC1/SC29/WG11 MPEG2012/M23485.
Fehn, C. (2004). Depth-image-based rendering (DIBR), compression and transmission for a new approach on 3D-TV. In SPIE conference on stereoscopic displays and virtual reality systems XI (Vol. 5291, pp. 93–104).
Fehn, C., Kauff, P., Cho, S., Kwon, H., Hur, N., & Kim, J. (2007). Asymmetric coding of stereoscopic video for transmission over T-DMB. In 3DTV conference.
Garcia, D. C., Dorea, C., & de Queiroz, R. L. (2012). Super resolution for multiview images using depth information. IEEE Transactions on Circuits and Systems for Video Technology, 22(9), 1249–1256.
Hannuksela, M. M., Rusanovskyy, D., Su, W., Chen, L., Li, R., Aflaki, P., et al. (2013). Multiview-video-plus-depth coding based on the advanced video coding standard. IEEE Transactions on Image Processing, 22(9), 3449–3458.
ISO/IEC JTC1/SC29/WG11. (2013). Common test conditions of 3DV core experiments. JCT3V-E1100.
ITU-T and ISO/IEC JTC 1. (2014). Advanced video coding for generic audiovisual services. ITU-T Recommendation H.264 and ISO/IEC 14496–10 (MPEG-4 AVC).
Joachimiak, M., Aflaki, P., Hannuksela M. M., & Gabbouj, M. (2014). Evaluation of depth-based super resolution on compressed mixed resolution 3D video. In Asian conference on computer vision.
Joachimiak, M., Hannuksela, M. M., & Gabbouj, M. (2014). View synthesis quality mapping for depth-based super resolution on mixed resolution 3D video. In 3DTV-conference: The true vision-capture, transmission and display of 3D video (3DTV-CON).
Julesz, B. (1971). Foundations of cyclopean perception (xiv 406 pp.). University of Chicago Press.
Kauff, P., Atzpadin, N., Fehn, C., Muller, M., Schreer, O., Smolic, A., et al. (2007). Depth map creation and image-based rendering for advanced 3DTV services providing interoperability and scalability. EURASIP International Journal on Signal Processing, 22(2), 217–234.
Kurutepe, E., Civanlar, M. R., & Tekalp, A. M. (2007). Client-driven selective streaming of multiview video for interactive 3DTV. IEEE Transactions on Circuits and Systems for Video Technology, 17(11), 1558–1565.
Meegan, D. V., Stelmach, L. B., & Tam, W. J. (2001). Unequal weighting of monocular inputs in binocular combination: implications for the compression of stereoscopic imagery. Journal of Experimental Psychology: Applied, 7(2), 143.
Milanfar, P. (Ed.). (2010). Super-resolution imaging. Boca Raton: CRC Press.
Oh, B. T., Lee, J., & Park, D. S. (2011). Depth map coding based on synthesized view distortion function. IEEE Journal of Selected Topics in Signal Processing, 5(7), 1344–1352.
Ortega, A., & Ramchandran, K. (1998). Rate-distortion methods for image and video compression. IEEE Signal Processing Magazine, 15(6), 23–50.
Perkins, M. G. (1992). Data compression of stereopairs. IEEE Transactions on Communications, 40(4), 684–696.
Quan, J., Hannuksela, M. M., & Li, H. (2011). Asymmetric spatial scalability in stereoscopic video coding. In 3DTV conference: The true vision-capture, transmission and display of 3D video (3DTV-CON).
Rusanovskyy, D., Hannuksela, M. M., & Su, W. (2013). Depth-based coding of MVD data for 3D video extension of H. 264/AVC. 3D Research, 4(2), 1–10.
Saygili, G., Gurler, C. G., & Tekalp, A. M. (2010). Quality assessment of asymmetric stereo video coding. In 17th IEEE international conference on image processing (ICIP) (pp. 4009–4012).
Schwarz, H., et al. (2011). Description of 3D Video Technology Proposal by Fraunhofer HHI (MVC compatible). ISO/IEC JTC1/SC29/WG11 MPEG2011/M22569.
Seuntiens, P., Meesters, L., & Ijsselsteijn, W. (2006). Perceived quality of compressed stereoscopic images: Effects of symmetric and asymmetric JPEG coding and camera separation. ACM Transactions on Applied Perception (TAP), 3(2), 95–109.
Stelmach, L., Tam, W. J., Meegan, D., & Vincent, A. (2000). Stereo image quality: Effects of mixed spatio-temporal resolution. IEEE Transactions on Circuits and Systems for Video Technology, 10(2), 188–193.
Tam, W. J. (2007). Image and depth quality of asymmetrically coded stereoscopic video for 3D-TV. Joint Video Team document JVT-W094.
Tian, J., Chen, L., & Liu, Z. (2012). Dual regularization-based image resolution enhancement for asymmetric stereoscopic images. Signal Processing, 92(2), 490–497.
Wenge, H., & Feng, L. (2009). Asymmetric stereoscopic video encoding algorithm based on joint compensation prediction. In WRI international conference on communications and mobile computing (Vol. 2, pp. 191–194).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Joachimiak, M., Hannuksela, M.M. & Gabbouj, M. View upsampling optimization for mixed resolution 3D video coding. Multidim Syst Sign Process 27, 763–783 (2016). https://doi.org/10.1007/s11045-015-0332-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11045-015-0332-9