March 3, 2026

Unsupervised multimodal emotion-unified representation learning with dual-level language-driven cross-modal emotion alignment

Emotion alignment enhances understanding of diverse emotional expressions across different media.
Key outcomes demonstrate improved alignment metrics in multimodal datasets across various emotional contexts.
Unsupervised representation learning facilitates cross-modal interactions without requiring labeled data or supervision.
Findings suggest broader implications for AI systems that analyze emotions in text, audio, and visuals.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Shaoze Feng

Qiyin Zhou

Yuanyuan Liu

Pattern Recognition

Huazhong University of Science and Technology

China University of Geosciences

Building similarity graph...

Analyzing shared references across papers