Environmental Sound Recognition With Time–Frequency Audio Features
University of Southern California
Indexed incrossref
Abstract
The paper considers the task of recognizing environmental sounds for the understanding of a scene or context surrounding an audio sensor. A variety of features have been proposed for audio recognition, including the popular Mel-frequency cepstral coefficients (MFCCs) which describe the audio spectral shape. Environmental sounds, such as chirpings of insects and sounds of rain which are typically noise-like with a broad flat spectrum, may include strong temporal domain signatures. However, only few temporal-domain features have been developed to characterize such diverse audio signals previously. Here, we perform an empirical feature analysis for audio environment characterization and propose to use the…
Citation impact
636
total citations
- FWCI
- 28.56
- Percentile
- 100%
- References
- 50
Citations per year
Authors
3Topics & keywords
Topics
Keywords
- Mel-frequency cepstrum
- Computer science
- Speech recognition
- Environmental noise
- Feature (linguistics)
- Context (archaeology)
- Artificial intelligence
- Active listening
UN Sustainable Development Goals
- Life in Land
No related works found for this paper.