Adversarial Training and Cross-modal Feature Fusion in Multimodal Sentiment Analysis

Li, Junhuai, Lin, Chuang, Wang, Huaijun, Zhi, Yuxing, Chen, Jing, and Huang, Tao (2025) Adversarial Training and Cross-modal Feature Fusion in Multimodal Sentiment Analysis. In: Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing. From: ICASSP 2025: IEEE International Conference on Acoustics, Speech and Signal Processing, 6-11 April 2025, Hyderabad, India.

[img] PDF (Published Version) - Published Version
Restricted to Repository staff only

View at Publisher Website: https://doi.org/10.1109/ICASSP49660.2025...
 
1


Abstract

Multimodal sentiment analysis recognizes emotions through text, audio, and visual modalities, but data incompleteness is a major challenge. Existing methods often focus on specific types of deficiencies and perform poorly when multiple types of noise are present simultaneously. To address this issue, we propose a noise-prompted adversarial training framework with a multimodal interaction model to enhance the model's robustness to missing modalities. The model first extracts common and unique features from each modality using a BERT text encoder and a shared-private encoder. Correlation measurements are then used to calculate the similarity between modalities, and a weighting mechanism is applied to the shared features. These features are deeply fused using a Transformer, and adversarial training combined with semantic reconstruction supervision helps the model learn a unified representation of noisy and clean data. Experimental results show that this method significantly improves the performance of multimodal sentiment analysis.

Item ID: 86896
Item Type: Conference Item (Research - E1)
ISBN: 979-8-3503-6874-1
Keywords: Adversarial Training, Deep Fusion, Multimodal Interaction Model, Multimodal Sentiment Analysis, Shared-Private Encoder
Date Deposited: 18 Feb 2026 23:47
FoR Codes: 46 INFORMATION AND COMPUTING SCIENCES > 4602 Artificial intelligence > 460212 Speech recognition @ 70%
46 INFORMATION AND COMPUTING SCIENCES > 4602 Artificial intelligence > 460202 Autonomous agents and multiagent systems @ 30%
SEO Codes: 22 INFORMATION AND COMMUNICATION SERVICES > 2204 Information systems, technologies and services > 220403 Artificial intelligence @ 100%
Downloads: Total: 1
Last 12 Months: 1
More Statistics

Actions (Repository Staff Only)

Item Control Page Item Control Page