Compression
Summary
Compression had always played an important role in traditional video communication systems, by enabling the video information to be represented by tens of times lesser amount of bytes. The traditional video coding approaches applied to 2D video signals are also applicable to 3D video representation formats, despite their inability to remove the vast amount of redundancies in them. This chapter focuses on the widely deployed and modern 3D and multi-view video compression techniques. The chapter discusses video coding principles and standards on 3D video coding. Finally, it provides a detailed explanation on the most common forms of 3D video, i.e. stereoscopic, multi-view and multi-view with depth map.
Controlled Vocabulary Terms
stereo image processing; three-dimensional displays; video coding; video communication; video compression
References
- Wiegand, T., Sullivan, G. J., Bjontegaard, G. and Luthra, A. (2003) ‘Overview of the H.264/AVC video coding standard’, IEEE Transactions on Circuits and Systems for Video Technology, 13, 560–576.
- Karczewics, M. and Kurceren, R. (2003) ‘The SP- and SI-frames design for H.264/AVC’, IEEE Transactions on Circuits and Systems for Video Technology, 13, 637–644.
-
Ostermann, J., Bormans, J., List, P., Marpe, D., Narroschke, M., Pereira, F., Stockhammer, T. and Wedi, T. (2004) ‘Video coding with H.264/AVC: Tools, performance, and complexity’, IEEE Circuits and Systems Magazine, 1, 7–28.
10.1109/MCAS.2004.1286980 Google Scholar
- Sullivan, G. and Wiegand, T. (1998) ‘Rate-distortion optimization for video compression’, IEEE Signal Processing Magazine, November.
-
Ghanbari, M. (2003) Standard Codecs: Image Compression to Advanced Video Coding, London: Institution of Engineering and Technology.
10.1049/PBTE049E Google Scholar
- ISO/IEC JTC1/SC29/WG11 Coding of Moving Pictures and Audio, ‘Vision, applications and requirements for high efficiency video coding (HEVC)’, Tech. Rep. N11872, Video and Requirements Subgroups and JCT-VC, 2011.
- Sullivan, G.J., Ohm, J-R., Han, W-J. and Wiegand, T. (2012) ‘Overview of the High Efficiency Video Coding (HEVC) standard’, IEEE Transactions on Circuits and Systems for Video Technology, 22, 1649–1668.
- Lainema, J., Bossen, F., Han, W-J., Min, J. and Ugur, K. (2012) ‘Intra coding of the HEVC standard’, IEEE Transactions on Circuits and Systems for Video Technology, 22, 1792–1801.
- Fu, C.-M., Alshina, E., Alshin, A., Huang, Y-W., Chen, C-Y., Tsai, C-Y., Hsu, C.-W., Lei, S-M., Park, J.-H. and Han, W.-J. (2012) ‘Sample adaptive offset in the HEVC standard’, IEEE Transactions on Circuits and Systems for Video Technology, 22, 1755–1764.
- Sze, V. and Budagavi, M. (2012) ‘High throughput CABAC entropy coding in HEVC’, IEEE Transactions on Circuits and Systems for Video Technology, 22, 1778–1791.
- Balamuralii, B., Eran, E. and Helmut, B. (2005) ‘An extended H.264 CODEC for stereoscopic video coding’, Proceedings of SPIE—The International Society for Optical Engineering, pp. 116–126.
- Moellenho, M.S. and Maier, M.W. (1998) ‘DCT transform coding of stereo images for multimedia applications’, IEEE Transactions on Industrial Electronics, 45, 38–43.
- Aksay, A., Bilen, C., Kurutepe, E., Ozcelebi, T., Akar, G.B., Civanlar, M.R. and Tekalp, A.M. (2006) ‘Temporal and spatial scaling for stereoscopic video compression’, in Proceedings of IEEE European Signal Processing Conf. EUSIPCO 2006, Florence, Italy, September.
- Stelmach, L.B. Tam, W.J. Meegan, D. and Vincent, A. (2000) ‘Stereo image quality: Effects of mixed spatiotemporal resolution’, IEEE Transactions on Circuits Systems for Video Technology, 10, 188–193.
- ISO/IEC 14 496-2 (2001) ‘Generic coding of audio-visual objects part 2: Visual’, Tech. Rep., Doc. N4350.
- Vetro, A., Wiegand, T. and Sullivan, G. (2011) ‘Overview of the stereo and multi-view video coding extensions of the H.264/MPEG-4 AVC standard’, Proceedings of the IEEE, 99, 626–642.
- ISO/IEC JTC1/SC29/WG11 (2001) ‘List of ad-hoc groups established at the 58th meeting in Pattaya’, Tech. Rep., N371.
- Smolic, A. and McCutchen, D. (2004) ‘3DAV exploration of video-based rendering technology in MPEG’, IEEE Transactions on Circuits and Systems for Video Technology, 14, 348–356.
- Schwarz, H., Hinz, T., Smolic, A., Oelbaum, T., Wiegand, T., Mueller, K. and Merkle, P. (2006) ‘Multi-view video coding based on H.264/MPEG4-AVC using hierarchical B pictures’, in Proceedings of Picture Coding Symposium, China.
- Merkle, P., Smolic, A., Muller, K., and Wiegand, T. (2007) ‘Efficient prediction structures for multiview video coding’, IEEE Transactions on Circuits and Systems for Video Technology, 17, 1461–1473.
- Shum, H., Kang, S., and Chan, S. (2003) ‘Survey of image based representations and compression techniques’, IEEE Transactions on Circuits and Systems for Video Technology, 13, 1020–1037.
- Martinian, E., Behrens, A., Xin, J., and Vetro, A. (2006) ‘View synthesis for multiview video compression’, in Proceedings of Picture Coding Symposium, China.
- Kimata, H., Kitahara, M., Kamikura, K. and Yashima, Y. (2004) ‘Multi-view video coding using reference picture selection for freeviewpoint video communication’, in Proceedings of Picture Coding Symposium, Lisbon, Portugal, December.
- Yamamoto, K. (2007) ‘SIMVC: Multi-view video coding using view interpolation and color correction’, IEEE Transactions on Circuits and Systems for Video Technology, 17 (11), 1436–1449.
- Zitnick, C.L. (2004) ‘High-quality video view interpolation using a layered representation’, ACM Siggraph and ACM Transactions on Graphics, 23, (3), 600–608.
- Fehn, C. (2004) ‘Depth-image-based rendering (DIBR), compression and transmission for a new approach on 3DTV’, in Proceedings of SPIE Conference on Stereoscopic Displays and Virtual Reality Systems XI, 5291, CA, USA, pp. 93–104, January.
- Kamolrat, B., Fernando, W. and Mrak, M. (2008) ‘3D motion estimation for depth information compression in 3D-TV applications’, IET Electronic Letters, 44, 1244–1245.
- Grewatsch, S. and Miller, E. (2004) ‘Sharing of motion vectors in 3D video coding’, in IEEE International Conference on Image Processing, Singapore, pp. 3271–3274.
- Ekmekcioğlu, E., Worrall, S.T. and Kondoz, A.M. (2008) ‘Low-delay random view access in multi-view coding using a bit-rate adaptive downsampling approach’, in Proceedings of IEEE International Conference on Multimedia and Expo, pp. 745–748, June.
- Karim, H.A., Worrall, S. and Kondoz, A.M. (2008) ‘Reduced resolution depth compression for scalable 3D video coding’, in Proceedings of Visual Information Engineering, Workshop on Scalable Coded Media Beyond Compression, Xian, China, July.
- Ekmekcioğlu, E., Worrall, S. and Kondoz, A.M. (2008) ‘Bit-rate adaptive downsampling for the coding of multi-view video with depth information’, in Proceedings of 3DTV Conference: The True Vision: Capture, Transmission and Display of 3D Video, Istanbul, Turkey.
- Morvan, Y., Farin, D. and de With, P.H.N. (2007) ‘Depth-image compression based on an R-D optimized quadtree decomposition for the transmission of multiview images’, in IEEE International Conference on Image Processing, San Antonio, TX, September.
- Merkle, P., Smolic, A., Muller, K. and Wiegand, T. (2007) ‘Multi-view video plus depth representation and coding’, in Proceedings of IEEE International Conference on Image Processing 2007, October.
- Klimaszewski, K., Wegner, K. and Dománski, M. (2009) ‘Distortions of synthesized views caused by compression of views and depth maps’, in Proceedings of 3DTV-Conference 2009, The True Vision: Capture, Transmission and Display of 3D Video, Potsdam, Germany, May.
- Tikanmäki, A., Gotchev, A., Smolic, A. and Müller, K. (2008) ‘Quality assessment of 3D video in rate allocation experiments’, in Proceedings of IEEE International Symposium on Consumer Electronics (ISCE'08), Algarve, Portugal, April.
- Morvan, Y., Farin, D. and de With, P.H.N. (2007) ‘Joint depth/texture bit-allocation for multi-view video compression’, in Proceedings of Picture Coding Symposium, Lisbon, Portugal, November.
- Liu, Y. (2009) ‘Compression-induced rendering distortion analysis for texture/depth rate allocation in 3D video compression’, in Proceedings of IEEE Data Compression Conference, pp. 352–361.
- Liu, Y. (2009) ‘Joint video/depth rate allocation for 3D video coding based on view synthesis distortion model’, Proceedings of Signal Processing: Image Communications, 24, 666–681.
- Merkle, P., Morvan, Y., Smolic, A., Farin, D., Müeller, K., de With, P. and Wiegand, T. (2009) ‘The effects of multiview depth video compression on multiview rendering’, Signal Processing: Image Communication, 24, 73–88.
-
Silva, D.D. and Fernando, W. (2009) ‘Intra mode selection for depth map coding to minimize rendering distortions in 3D video’, IEEE Transactions on Consumer Electronics, 55, 2385–2393.
10.1109/TCE.2009.5373814 Google Scholar
- Ekmekcioğlu, E., Velisavljevic, V. and Worrall, S. (2010) ‘Content adaptive enhancement of multi-view depth maps for free viewpoint video’, IEEE Journal of Selected Topics in Signal Processing, 5, 352–361.
- Silva, D.D., Fernando, W., Kodikaraarachchi, H., Worrall, S. and Kondoz, A. (2011) ‘Improved depth map filtering for 3D-TV systems’, in 2011 IEEE International Conference on Consumer Electronics (ICCE), pp. 645–646, January.
- Silva D. D., Fernando, W., Kodikaraarachchi, H., Worrall, S. and Kondoz, A. (2011) ‘Adaptive sharpening of depth maps for 3D-TV’, IET Electronics Letters, 46, 1546–1548.
- Ekmekcioğlu, E., Worrall, S., Velisavljevic, V., Silva, D.D. and Kondoz, A. (2011) ‘Multi-view depth pre-processing using joint filtering for improved coding performance’, Tech. Rep., ISO MPEG Doc m20070, Geneva, March.
- V.R. Group (2008) ‘Vision on 3D video’, Tech. Rep., ISO/IEC JTC1/SC29/WG11 N10357, February.
- Ohm, J-R., Rusanovskyy, D., Vetro, A. and Müller, K. (2012) ‘Work plan in 3D standards development’, Tech. Rep. JCT3V-B1006, Joint Collaborative Team on 3D Video Coding Extension Development, October.