End-to-end multi-domain joint coding framework for 3D light field video based on viewpoint-disparity representation

Key Points

An 81% BD-rate reduction and a 1.998 dB BD-PSNR improvement are achieved compared to the MV-HEVC standard.
The approach implements a closed-loop pathway, linking motion vector, disparity, and view synthesis effectively.
Decoding time is decoupled from viewpoint quantity, with a stable latency of 28 ms during 96-viewpoint rendering.
The method establishes a robust theoretical foundation for efficient real-time transmission of 3D light field video.

Abstract

This paper proposes an end-to-end, multi-domain joint compression method for 3D light field video based on a viewpoint-disparity representation. By compressing dense viewpoints into sparse viewpoints with associated disparity and establishing a closed-loop “motion vector → disparity → view synthesis” pathway, our method achieves an 81% BD-rate reduction and a 1.998 dB BD-PSNR improvement compared to the MV-HEVC standard. Furthermore, the approach successfully decouples decoding time from the number of viewpoints, maintaining a stable latency of 28 ms during 96-viewpoint rendering. This work provides an effective solution for efficient compression of dense 3D light field video while establishing a theoretical foundation for its real-time transmission.

End-to-end multi-domain joint coding framework for 3D light field video based on viewpoint-disparity representation

Key Points

Abstract

Cite This Study