Abstract
It can be seen that the success rate of the video compression standard H.264 is quite high owing to its wide range of adoption. However, the legacy version of video compression techniques built on the top of H.264 might ensure reduced bitrate while preserving the quality of the reconstructed video signal to some extent, but it is achieved at the price of compression complexity. Our investigation reveals that H.264 achieves higher compression efficiency with MPEG files but does not ensure optimized performance for intra-prediction mode decision with rate distortion. The prime reason lies into less emphasis on redesigning the encoder with optimized cost function which could improvise the performance of existing Lagrangian cost function (LCF). This area of research was explored and accentuated in many studies but still very few talked in a favor to comprehend the cost optimization performance for encoding I-frame and P-frame in MPEG videos. Unlike existing literature, the study found scope of redesigning the H.264-based encoder module considering the baseline of LCF and in addition introduces a novel cost function to optimize the flow of execution of encoding. In this regard, a computational framework is introduced for realizing the numerical modeling integrating both LCF and proposed CF for H.264 intra-prediction mode decision. The novelty of this system is that it does not have dependency on the operations of entropy coding during rate-distortion (RD) optimization and this approach positively impacts on the computing performance of the system. The evaluated performance outcome shows a detailed comparison between both the methods on the basis of peak-signal-to-noise ratio (PSNR), bit-error rate (bit/s) and encoding time (s). Unlike LCF, the outcome of the study claims its effectiveness in maintaining a well-balanced performance between reconstructed signal quality and encoding time.






Similar content being viewed by others
Data availability
The dataset generated and analyzed during the current study are available from the corresponding author on reasonable request.
References
ISO/IEC 14496-10:2003: Coding of Audiovisual Objects –Part 10: Advanced Video coding. 2003 and ITU-T Recommendation H.264: Advanced video coding for generic audiovisual services (2003).
Stockhammer T, Hannuksela MM, T. Wiegand, “H.264/AVC in wireless environments,” IEEE Trans. circuits and system. Video Technology. 2003;13(7):688–703.
Shen L, Zhang Z, Liu Z. Adaptive inter-mode decision for HEVC jointly utilizing inter-level and spatiotemporal correlations. IEEE Trans Circuits Syst Video Technol. 2014;24(10):1709–22.
Sharabayko MP, Markov NG. Entropy-based intra-coding RDO estimation for HEVC. In: 2014 9th international forum on strategic technology (IFOST). IEEE; 2014. p. 56–9.
Liu X, Yoo K-Y, Kim SW. Low complexity intra prediction algorithm for MPEG-2 to H.264/AVC transcoder. IEEE Trans Consum Electron. 2010;56(2):987–94.
Wu C-Y, Su P-C. Fast intra-coding for H.264/AVC by using projection-based predicted block residuals. IEEE Trans Multimedia. 2013;15(5):1083–93.
Li HL, Ngan KN, Wei ZY. Fast and efficient method for block edge classification and its application in H.264/AVC video coding. IEEE Trans Circuits Syst Video Technol. 2008;18(6):756–68.
Tseng C-H, Wang H-M, Yang J-F. Enhanced intra-4 × 4 mode decision for H.264/AVC coders. IEEE Trans Circuits Syst Video Technol. 2006;16(8):1027–32.
Wang J-C, Wang J-F, Yang J-F, Chen J-T. A fast mode decision algorithm and its VLSI design for H.264/AVC intra-prediction. IEEE Trans Circuits Syst Video Technol. 2007;17(10):1414–22.
Tsai A-C, Paul A, Wang J-C, Wang J-F. Intensity gradient technique for efficent intra-prediction in H.264/AVC. IEEE Trans Circuits Syst Video Technol. 2008;18(5):694–8.
Cassa MB, Naccari M, Pereira F. Fast rate distortion optimization for the emerging HEVC standard. In: Picture Coding Symposium, Poland, May 2012. https://doi.org/10.1109/PCS.2012.6213262.
G. J. Sullivan, T. Wiegand, D. Marpe, and A. Luthra 2004 Text of ISO/IEC14496–10 Advanced Video Coding 3rd Edition, document N6540.doc, ISO/IEC JTC1/SC29/WG11.
Sullivan GJ, Wiegand T. Rate-distortion optimization for video video compression. IEEE Signal Process Mag. 1998;15(6):74–90. https://doi.org/10.1109/79.733497.
Wiegand T, Schwarz H, Joch A, Kossentini F, Sullivan GJ. Rate-constrained coder control and comparison of video coding standards. IEEE Trans Circuits Syst Video Technol. 2003;13(7):688–703.
Do Q, Yo-Sung H. Categorization for fast intra prediction mode decision in H.264/AVC. Consumer Electron IEEE Transact. 2010;56(2):1049–56.
SongLongYangYang YJKG. Complexity scalable intra-prediction mode decision algorithm for mobile video applications. Communications, IET. 2014;8(9):1654–62.
Chen MJ, Wu YD, Yeh CH, Lin KM, Lin SD. Efficient CU and PU decision based on motion information for inter-prediction of HEVC. IEEE Trans Industr Inf. 2018;14(11):4736–5.
LimKimLeePakLee KSJDS. Fast block size and mode decision algorithm for intra prediction in H.264/AVC. Consum Electron IEEE Transact. 2012;58(2):654–60.
Young Lee J, HyunWook Park. “A fast mode decision method based on motion cost and intra prediction cost for H264/AVC,” circuits and systems for video technology. IEEE Transactions on. 2012;22(2):393–402.
Pejman H, Zargari F. An efficient fast intra mode decision method based on orthogonal modes elimination. Consum Electron IEEE Transact. 2012;58(4):1445–52.
Huanqiang Z, Kai-Kuang M, Canhui C. Hierarchical intra mode decision for H264/AVC. Circuits Syst Video Technol IEEE Transact. 2010;20(6):907–12.
You J, Choi C, Jeong J. Modified rate distortion optimization using inter-block dependence for H.264/AVC intra coding. IEEE Trans Consum Electron. 2008;54(3):1383–8. https://doi.org/10.1109/TCE.2008.4637631.
Heng-YaoKuan-HsienBin-DaJar-Ferr LWLY. An efficient VLSI architecture for transform-based intra prediction in H.264/AVC. Circuits Syst Video Technol IEEE Transact. 2010;20(6):894–906.
An-ChaoJhing-FaJar-FerrWei-Guang TWYL. Effective subblock-based and pixel-based fast direction detections for H.264 intra prediction. Circuits Syst Video Technol IEEE Transact. 2008;18(7):9 75-979.
Bharanitharan K, Bin-Da L, Jar-Ferr Y, Wen-Chih T. A low complexity detection of discrete cross differences for fast H.264/AVC intra prediction Multimedia. IEEE Transact. 2008;10:1250–60.
Ghasempour M, Ghanbari M. A low complexity system for multiple data embedding into H.264 coded video bit-stream. IEEE Trans Circuits Syst Video Technol. 2020;30(11):4009–19. https://doi.org/10.1109/TCSVT.2019.2947545.
Zhang H, You W, Zhao X. A Video steganalytic approach against quantized transform coefficient-based H.264 steganography by exploiting in-loop deblocking filtering. IEEE Access. 2020;8:186862–78. https://doi.org/10.1109/ACCESS.2020.3030685.
Xu J, Xu M, Wei Y, Wang Z, Guan Z. Fast H.264 to HEVC transcoding: a deep learning method. IEEE Trans Multimedia. 2019;21(7):1633–45. https://doi.org/10.1109/TMM.2018.2885921.
IEG Richardson (2003) H.264/MPEG-4 Part 10 White Paper: Inter Prediction [Online]. Available: http://www.vcodex.com/h264.html.
T Stockhammer, D Kontopodis, T Wiegand “Ratedistortion optimization for JVT/H.26L video coding in packet loss environment,” in Int. Packet Video Workshop, 2002. [Online]. Available: http://research.microsoft.com/enus/um/beijing/events/pv2002/papers/90-hoermdetoc.pdf.
Sullivan GJ, Wiegand T. Rate-distortion optimization for video compression. Signal Proc Magazine IEEE. 1998;15(6):74–90.
T Wiegand, B Girod, “Lagrange multiplier selection in hybrid video coder control,” In Image Processing, 2001. Proceedings. 2001 International Conference on, vol. 3. IEEE, 2001, pp. 542–545. [Online]. Available: http://iphome.hhi.de/wiegand/assets/gzs/icip01c.ps.gz.
Joint Video Team. Reference Software JM10.2 [Online]. Available: http://iphome.hhi.de/suehring/tml/download/old−jm/.
https://see.xidian.edu.cn/vipsl/database_Video.html#video. Accessed Jan 2023.
Funding
No funding received for this research.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
No conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article is part of the topical collection “Advances in Computational Approaches for Image Processing, Wireless Networks, Cloud Applications and Network Security” guest edited by P. Raviraj, Maode Ma and Roopashree H R.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Kumari, R.D.A., Udupa, A.N. Optimal Cost Modeling Scheme for Efficient Intra-Prediction Mode in Video Compression. SN COMPUT. SCI. 4, 352 (2023). https://doi.org/10.1007/s42979-023-01737-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42979-023-01737-w