Adieu features? End-to-end speech emotion recognition using a deep convolutional recurrent network

Trigeorgis, George; Ringeval, Fabien; Brueckner, Raymond; Marchi, Erik; Nicolaou, Mihalis A.; Schuller, Björn W.; Zafeiriou, Stefanos

doi:10.1109/icassp.2016.7472669

articleMar 1, 2016Closed access

Adieu features? End-to-end speech emotion recognition using a deep convolutional recurrent network

GTGeorge Trigeorgis FRFabien Ringeval RBRaymond Brueckner EMErik Marchi MAMihalis A. Nicolaou

Imperial College London · Technical University of Munich · +2 more institutions

Indexed incrossref

Abstract

The automatic recognition of spontaneous emotions from speech is a challenging task. On the one hand, acoustic features need to be robust enough to capture the emotional content for various styles of speaking, and while on the other, machine learning algorithms need to be insensitive to outliers while being able to model the context. Whereas the latter has been tackled by the use of Long Short-Term Memory (LSTM) networks, the former is still under very active investigations, even though more than a decade of research has provided a large set of acoustic descriptors. In this paper, we propose a solution to the problem of ‘context-aware’ emotional relevant feature extraction, by combining Convolutional Neural…

Citation impact

858

total citations

FWCI: 109.70
Percentile: 100%
References: 42

Citations per year

Authors

7

Topics & keywords

Topics

Keywords

End-to-end principle
Computer science
Deep learning
Speech recognition
Artificial intelligence

No related works found for this paper.