articleIEEE Transactions on Affective ComputingMay 3, 2022Closed access

Hybrid Contrastive Learning of Tri-Modal Representation for Multimodal Sentiment Analysis

Sun Yat-sen University

Indexed incrossref

Abstract

The wide application of smart devices enables the availability of multimodal data, which can be utilized in many tasks. In the field of multimodal sentiment analysis, most previous works focus on exploring intra- and inter-modal interactions. However, training a network with cross-modal information (language, audio and visual) is still challenging due to the modality gap. Besides, while learning dynamics within each sample draws great attention, the learning of inter-sample and inter-class relationships is neglected. Moreover, the size of datasets limits the generalization ability of the models. To address the afore-mentioned issues, we propose a novel framework HyCon for hybrid contrastive learning of…

Citation impact

238
total citations
FWCI
29.57
Percentile
100%
References
96
Citations per year

Authors

4

Topics & keywords

Keywords
  • Computer science
  • Modality (human–computer interaction)
  • Artificial intelligence
  • Sentiment analysis
  • Generalization
  • Margin (machine learning)
  • Representation (politics)
  • Modal
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.

Funding