Comparing Recognition Performance and Robustness of Multimodal Deep Learning Models for Multimodal Emotion Recognition
Shanghai Jiao Tong University · Ruijin Hospital · +3 more institutions
Abstract
Multimodal signals are powerful for emotion recognition since they can represent emotions comprehensively. In this article, we compare the recognition performance and robustness of two multimodal emotion recognition models: 1) deep canonical correlation analysis (DCCA) and 2) bimodal deep autoencoder (BDAE). The contributions of this article are threefold: 1) we propose two methods for extending the original DCCA model for multimodal fusion: a) weighted sum fusion and b) attention-based fusion; 2) we systemically compare the performance of DCCA, BDAE, and traditional approaches on five multimodal data sets; and 3) we investigate the robustness of DCCA, BDAE, and traditional approaches on SEED-V and DREAMER…
Citation impact
- FWCI
- 68.96
- Percentile
- 100%
- References
- 70
Authors
4- WLWei LiuCorresponding
Shanghai Jiao Tong University, Ruijin Hospital, Shanghai Municipal Education Commission
- JQJielin Qiu
Carnegie Mellon University
- WZWei‐Long Zheng
Massachusetts Institute of Technology
- BLBao‐Liang Lu
Shanghai Jiao Tong University, Ruijin Hospital, Shanghai Municipal Education Commission
Topics & keywords
- Robustness (evolution)
- Computer science
- Artificial intelligence
- Discriminative model
- Pattern recognition (psychology)
- Autoencoder
- Data set
- Machine learning
- Reduced inequalities