Light field multi-view video coding with two-directional parallel inter-view prediction

Wang, Gengkun, Xiang, Wei, Pickering, Mark, and Chen, Chang Wen (2016) Light field multi-view video coding with two-directional parallel inter-view prediction. IEEE Transactions on Image Processing, 25 (11). pp. 5104-5117.

[img] PDF (Published Version) - Published Version
Restricted to Repository staff only

View at Publisher Website:


Light field (LF) technology has been popularly adopted by a wide range of conventional industries. However, one problem when dealing with LFs is the sheer size of data volume. There have been many multi-view video coding (MVC)-based LF video coding methods reported in the literature, aiming at finding the best prediction structure for LF video coding. It is clear that the number of possible prediction structures is unlimited, and it is also observed that the coding bit-rate can be reduced by increasing the number of bi-directionally encoded views in the prediction structure. However, none work has been conducted to analyze the relationship of the prediction structure with its coding performance. In light of this observation, we first design a new LF-MVC prediction structure by extending the inter-view prediction into a two-directional parallel structure. Analytical models for source coding rate and encoding time are developed to analyze their relationships with the prediction structure, and are proven to be well-matched to our experimental results. Experimental evaluation of two LF video sequences demonstrates that the proposed LF-MVC prediction structure can achieve a factor of 26% bit-rate reduction against the conventional MVC prediction structure for an LF video with 5 × 5 views, and a further 34% bit-rate reduction for an LF video with a larger 10 × 10 views. Compared with the state-of-the-art MVC-based LF video coding prediction struc- tures in the literature, LF-MVC can achieve the best coding performance, and with its high encoding efficiency, is well suited for deployment in practical LF-based 3D systems.

Item ID: 46127
Item Type: Article (Research - C1)
ISSN: 1941-0042
Keywords: light field video, multi-view video coding, prediction structure, source coding rate, encoding time
Date Deposited: 19 Oct 2016 02:11
FoR Codes: 40 ENGINEERING > 4006 Communications engineering > 400699 Communications engineering not elsewhere classified @ 50%
46 INFORMATION AND COMPUTING SCIENCES > 4603 Computer vision and multimedia computation > 460306 Image processing @ 50%
SEO Codes: 89 INFORMATION AND COMMUNICATION SERVICES > 8999 Other Information and Communication Services > 899999 Information and Communication Services not elsewhere classified @ 100%
Downloads: Total: 6
More Statistics

Actions (Repository Staff Only)

Item Control Page Item Control Page