articleIEEE Signal Processing LettersJan 23, 2017GREEN OA

Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification

JSJustin SalamonJPJuan Pablo Bello

New York University

Indexed inarxivcrossref

Abstract

The ability of deep convolutional neural networks (CNNs) to learn discriminative spectro-temporal patterns makes them well suited to environmental sound classification. However, the relative scarcity of labeled data has impeded the exploitation of this family of high-capacity models. This study has two primary contributions: first, we propose a deep CNN architecture for environmental sound classification. Second, we propose the use of audio data augmentation for overcoming the problem of data scarcity and explore the influence of different augmentations on the performance of the proposed CNN architecture. Combined with data augmentation, the proposed model produces state-of-the-art results for environmental…

Citation impact

1,292
total citations
FWCI
78.97
Percentile
100%
References
29
Citations per year

Authors

2
  • JS
    Justin SalamonCorresponding

    New York University

  • JP
    Juan Pablo Bello

    New York University

Topics & keywords

Keywords
  • Discriminative model
  • Convolutional neural network
  • Deep learning
  • Scarcity
  • Pattern recognition (psychology)
  • Data modeling
  • Artificial neural network
  • Class (philosophy)
No related works found for this paper.

Funding