articleOct 31, 2016Closed access

Video-based emotion recognition using CNN-RNN and C3D hybrid networks

iQIYI (China)

Indexed incrossref

Abstract

In this paper, we present a video-based emotion recognition system submitted to the EmotiW 2016 Challenge. The core module of this system is a hybrid network that combines recurrent neural network (RNN) and 3D convolutional networks (C3D) in a late-fusion fashion. RNN and C3D encode appearance and motion information in different ways. Specifically, RNN takes appearance features extracted by convolutional neural network (CNN) over individual video frames as input and encodes motion later, while C3D models appearance and motion of video simultaneously. Combined with an audio module, our system achieved a recognition accuracy of 59.02% without using any additional emotion-labeled video clips in training set,…

Citation impact

558
total citations
FWCI
27.03
Percentile
100%
References
30
Citations per year

Authors

4

Topics & keywords

Keywords
  • Recurrent neural network
  • Computer science
  • Artificial intelligence
  • Convolutional neural network
  • Emotion recognition
  • ENCODE
  • Set (abstract data type)
  • Computer vision
No related works found for this paper.