What question did this study set out to answer?

This study aims to develop a privacy-preserving framework for multimodal data fusion using federated learning.

May 8, 2026Open Access

Federated high order tensor fusion for privacy preserving multimodal social media analysis

Key Points

This study aims to develop a privacy-preserving framework for multimodal data fusion using federated learning.
Introduced a federated learning framework integrating high-order tensor-based multimodal data fusion.
Utilized tensor Tucker decomposition to analyze complex relationships between different data types.
Maintained raw data locally to ensure user privacy during the training process.
Achieved higher Mean Average Precision (MAP) in text-dominant conditions on the TREC2017 dataset.
Confirmed effectiveness in modeling intermodal correlations using the CMU-MOSI multimodal sentiment benchmark.
Demonstrated adaptive learning capabilities without increasing redundant model parameters.

Abstract

The rapid evolution of social networks has positioned multimodal content, including text, images, and audio, as a pivotal medium for self-expression and public sentiment analysis. However, existing multimodal fusion methods are often limited by privacy risks, parameter redundancy, and insufficient exploitation of intermodal correlations. To overcome these challenges, this study introduces a novel federated learning framework that integrates high-order tensor-based multimodal data fusion with privacy-aware decentralized training by keeping raw data local. It leverages tensor Tucker decomposition to capture complex spatial and semantic relationships between modalities, enhancing fusion accuracy while supporting user privacy through local data retention. Experimental results on the separate TREC2017 Precision Medicine Track Scientific Abstracts dataset and on the CMU-MOSI multimodal sentiment benchmark demonstrate that the proposed algorithm outperforms existing methods. The TREC2017 experiments validate the framework’s performance in text-dominant conditions (higher Mean Average Precision, MAP)), while the CMU-MOSI experiments confirm the effectiveness of the high-order tensor fusion in modeling intermodal correlations for multimodal tasks. Furthermore, our framework demonstrates adaptive learning capabilities, efficiently processing diverse multimodal data types without expanding redundant model parameters. This research opens new avenues for privacy-aware multimodal data fusion in social media, offering a robust solution for monitoring and managing online public opinion while supporting user privacy through local data retention.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Wan Li

Bin Zhang

Journals

PLoS ONE

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Federated high order tensor fusion for privacy preserving multimodal social media analysis

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study