Leveraging limited data to synthesize an additional training set is essential for robotic vision, particularly in dynamic environments where collecting large datasets is impractical. Traditional robotic vision systems rely on extensive training data for object recognition and scene understanding but struggle to generalize to real-world variations, such as lighting conditions, occlusions, and sensor noise. This article proposes causal diffuse variational autoencoder (causal DiffuseVAE), a novel method integrating causal inference with high-fidelity image synthesis to generate counterfactual images. By combining the disentanglement properties of variational autoencoders (VAEs) with the generative capabilities of diffusion models, causal DiffuseVAE produces realistic, interpretable simulations of variations, such as shadows and occlusions. This combination enables data-efficient generative modeling by learning from small subsets and synthesizing missing or unseen samples. In addition, causal inference ensures that generated data follow real-world dependencies, making it robust and interpretable for deployment in unpredictable environments. Four baseline approaches are evaluated across six different datasets, demonstrating that causal DiffuseVAE consistently outperforms the four baseline approaches.
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhaoan Ye
Dezong Zhao
Li Zhang
IEEE Transactions on Industrial Informatics
ENLIGHTEN (Jurnal Bimbingan dan Konseling Islam)
University of Glasgow
Royal Holloway University of London
Building similarity graph...
Analyzing shared references across papers
Loading...
Ye et al. (Thu,) studied this question.
www.synapsesocial.com/papers/69a75abec6e9836116a20f3d — DOI: https://doi.org/10.1109/tii.2026.3652484