articleOct 31, 2016Closed access
Video-based emotion recognition using CNN-RNN and C3D hybrid networks
Indexed incrossref
Abstract
In this paper, we present a video-based emotion recognition system submitted to the EmotiW 2016 Challenge. The core module of this system is a hybrid network that combines recurrent neural network (RNN) and 3D convolutional networks (C3D) in a late-fusion fashion. RNN and C3D encode appearance and motion information in different ways. Specifically, RNN takes appearance features extracted by convolutional neural network (CNN) over individual video frames as input and encodes motion later, while C3D models appearance and motion of video simultaneously. Combined with an audio module, our system achieved a recognition accuracy of 59.02% without using any additional emotion-labeled video clips in training set,…
Citation impact
558
total citations
- FWCI
- 27.03
- Percentile
- 100%
- References
- 30
Citations per year
Authors
4Topics & keywords
Topics
Keywords
- Recurrent neural network
- Computer science
- Artificial intelligence
- Convolutional neural network
- Emotion recognition
- ENCODE
- Set (abstract data type)
- Computer vision
No related works found for this paper.