Learning-based high-efficiency compression framework for light field videos

Wang, Bing, Xiang, Wei, Wang, Eric, Peng, Qiang, Gao, Pan, and Wu, Xiao (2022) Learning-based high-efficiency compression framework for light field videos. Multimedia Tools and Applications, 81 (6). pp. 7527-7560.

[img] PDF (Published Version) - Published Version
Restricted to Repository staff only

View at Publisher Website: https://doi.org/10.1007/s11042-022-11955...


The massive amount of data usage for light field (LF) information poses grand challenges for efficient compression designs. There have been several LF video compression methods focusing on exploring efficient prediction structures reported in the literature. However, the number of possible prediction structures is infinite, and these methods fail to fully exploit the intrinsic geometry between views of an LF video. In this paper, we propose a deep learning-based high-efficiency LF video compression framework by exploiting the inherent geometrical structure of LF videos. The proposed framework is composed of several crucial components, namely sparse coding based on a universal view sampling method (UVSM) and a CNN-based LF view synthesis algorithm (LF-CNN), a high-efficiency adaptive prediction structure (APS), and a synthesized candidate reference (SCR)-based inter-frame prediction strategy. Specifically, instead of encoding all the views in an LF video, only parts of views are compressed while the remaining views are reconstructed from the encoded views with LF-CNN. The prediction structure of the selected views is able to adapt itself to the similarity between views. Inspired by the effectiveness of view synthesis algorithms, synthesized results are served as additional candidate references to further reduce inter-frame redundancies. Experimental results show that the proposed LF video compression framework can achieve an average of over 34% bitrate savings against state-of-the-art LF video compression methods over multiple LF video datasets.

Item ID: 74603
Item Type: Article (Research - C1)
ISSN: 1573-7721
Keywords: Light field video compression, Prediction structure, Sparse coding, View synthesis
Copyright Information: © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022
Date Deposited: 24 Nov 2022 01:51
FoR Codes: 40 ENGINEERING > 4009 Electronics, sensors and digital hardware > 400999 Electronics, sensors and digital hardware not elsewhere classified @ 100%
Downloads: Total: 1
More Statistics

Actions (Repository Staff Only)

Item Control Page Item Control Page