Hybrid Contrastive Learning of Tri-Modal Representation for Multimodal Sentiment Analysis
Indexed incrossref
Abstract
The wide application of smart devices enables the availability of multimodal data, which can be utilized in many tasks. In the field of multimodal sentiment analysis, most previous works focus on exploring intra- and inter-modal interactions. However, training a network with cross-modal information (language, audio and visual) is still challenging due to the modality gap. Besides, while learning dynamics within each sample draws great attention, the learning of inter-sample and inter-class relationships is neglected. Moreover, the size of datasets limits the generalization ability of the models. To address the afore-mentioned issues, we propose a novel framework HyCon for hybrid contrastive learning of…
Citation impact
238
total citations
- FWCI
- 29.57
- Percentile
- 100%
- References
- 96
Citations per year
Authors
4Topics & keywords
Topics
Keywords
- Computer science
- Modality (human–computer interaction)
- Artificial intelligence
- Sentiment analysis
- Generalization
- Margin (machine learning)
- Representation (politics)
- Modal
UN Sustainable Development Goals
- Quality Education
No related works found for this paper.