Deep Multimodal Representation Learning: A Survey
Fuzhou University · Fujian Normal University
Abstract
Multimodal representation learning, which aims to narrow the heterogeneity gap among different modalities, plays an indispensable role in the utilization of ubiquitous multimodal data. Due to the powerful representation ability with multiple levels of abstraction, deep learning-based multimodal representation learning has attracted much attention in recent years. In this paper, we provided a comprehensive survey on deep multimodal representation learning which has never been concentrated entirely. To facilitate the discussion on how the heterogeneity gap is narrowed, according to the underlying structures in which different modalities are integrated, we category deep multimodal representation learning methods…
Citation impact
- FWCI
- 17.66
- Percentile
- 100%
- References
- 206
Authors
3Topics & keywords
- Computer science
- Modalities
- Representation (politics)
- Multimodal learning
- Feature learning
- Artificial intelligence
- Deep learning
- Key (lock)