Environmental Sound Recognition With Time–Frequency Audio Features

University of Southern California

Indexed incrossref

Abstract

The paper considers the task of recognizing environmental sounds for the understanding of a scene or context surrounding an audio sensor. A variety of features have been proposed for audio recognition, including the popular Mel-frequency cepstral coefficients (MFCCs) which describe the audio spectral shape. Environmental sounds, such as chirpings of insects and sounds of rain which are typically noise-like with a broad flat spectrum, may include strong temporal domain signatures. However, only few temporal-domain features have been developed to characterize such diverse audio signals previously. Here, we perform an empirical feature analysis for audio environment characterization and propose to use the…

Citation impact

636
total citations
FWCI
28.56
Percentile
100%
References
50
Citations per year

Authors

3

Topics & keywords

Keywords
  • Mel-frequency cepstrum
  • Computer science
  • Speech recognition
  • Environmental noise
  • Feature (linguistics)
  • Context (archaeology)
  • Artificial intelligence
  • Active listening
UN Sustainable Development Goals
  • Life in Land
No related works found for this paper.