articleOct 13, 2015Closed access

ESC

Warsaw University of Technology

Indexed incrossref

Abstract

One of the obstacles in research activities concentrating on environmental sound classification is the scarcity of suitable and publicly available datasets. This paper tries to address that issue by presenting a new annotated collection of 2000 short clips comprising 50 classes of various common sound events, and an abundant unified compilation of 250000 unlabeled auditory excerpts extracted from recordings available through the Freesound project. The paper also provides an evaluation of human accuracy in classifying environmental sounds and compares it to the performance of selected baseline classifiers using features derived from mel-frequency cepstral coefficients and zero-crossing rate.

Citation impact

1,325
total citations
FWCI
17.97
Percentile
100%
References
17
Citations per year

Authors

1

Topics & keywords

Keywords
  • Computer science
  • Mel-frequency cepstrum
  • Baseline (sea)
  • Speech recognition
  • Scarcity
  • Sound (geography)
  • Artificial intelligence
  • Machine learning
UN Sustainable Development Goals
  • Life in Land
No related works found for this paper.