Hybrid Contrastive Learning of Tri-Modal Representation for Multimodal Sentiment Analysis

Mai, Sijie; Zeng, Ying; Zheng, Shuangjia; Hu, Haifeng

doi:10.1109/taffc.2022.3172360

articleIEEE Transactions on Affective ComputingMay 3, 2022Closed access

Hybrid Contrastive Learning of Tri-Modal Representation for Multimodal Sentiment Analysis

SMSijie Mai YZYing Zeng SZShuangjia Zheng HHHaifeng Hu

Sun Yat-sen University

Indexed incrossref

Abstract

The wide application of smart devices enables the availability of multimodal data, which can be utilized in many tasks. In the field of multimodal sentiment analysis, most previous works focus on exploring intra- and inter-modal interactions. However, training a network with cross-modal information (language, audio and visual) is still challenging due to the modality gap. Besides, while learning dynamics within each sample draws great attention, the learning of inter-sample and inter-class relationships is neglected. Moreover, the size of datasets limits the generalization ability of the models. To address the afore-mentioned issues, we propose a novel framework HyCon for hybrid contrastive learning of…

Citation impact

238

total citations

FWCI: 29.57
Percentile: 100%
References: 96

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Computer science
Modality (human–computer interaction)
Artificial intelligence
Sentiment analysis
Generalization
Margin (machine learning)
Representation (politics)
Modal

UN Sustainable Development Goals

Quality Education

No related works found for this paper.

Funding

NN
National Natural Science Foundation of China
Award: 62076262