This paper proposes an end-to-end, multi-domain joint compression method for 3D light field video based on a viewpoint-disparity representation. By compressing dense viewpoints into sparse viewpoints with associated disparity and establishing a closed-loop “motion vector → disparity → view synthesis” pathway, our method achieves an 81% BD-rate reduction and a 1.998 dB BD-PSNR improvement compared to the MV-HEVC standard. Furthermore, the approach successfully decouples decoding time from the number of viewpoints, maintaining a stable latency of 28 ms during 96-viewpoint rendering. This work provides an effective solution for efficient compression of dense 3D light field video while establishing a theoretical foundation for its real-time transmission.
Li et al. (Wed,) studied this question.