Video-based emotion recognition using CNN-RNN and C3D hybrid networks

Yin, Fan; Lu, Xiangju; Li, Dian; Liu, Yuanliu

doi:10.1145/2993148.2997632

articleOct 31, 2016Closed access

Video-based emotion recognition using CNN-RNN and C3D hybrid networks

FYFan Yin XLXiangju Lu DLDian Li YLYuanliu Liu

iQIYI (China)

Indexed incrossref

Abstract

In this paper, we present a video-based emotion recognition system submitted to the EmotiW 2016 Challenge. The core module of this system is a hybrid network that combines recurrent neural network (RNN) and 3D convolutional networks (C3D) in a late-fusion fashion. RNN and C3D encode appearance and motion information in different ways. Specifically, RNN takes appearance features extracted by convolutional neural network (CNN) over individual video frames as input and encodes motion later, while C3D models appearance and motion of video simultaneously. Combined with an audio module, our system achieved a recognition accuracy of 59.02% without using any additional emotion-labeled video clips in training set,…

Citation impact

558

total citations

FWCI: 27.03
Percentile: 100%
References: 30

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Recurrent neural network
Computer science
Artificial intelligence
Convolutional neural network
Emotion recognition
ENCODE
Set (abstract data type)
Computer vision

No related works found for this paper.