articleNov 22, 2002Closed access
Construction and evaluation of a robust multifeature speech/music discriminator
Indexed incrossref
Abstract
We report on the construction of a real-time computer system capable of distinguishing speech signals from music signals over a wide range of digital audio input. We have examined 13 features intended to measure conceptually distinct properties of speech and/or music signals, and combined them in several multidimensional classification frameworks. We provide extensive data on system performance and the cross-validated training/test setup used to evaluate the system. For the datasets currently in use, the best classifier classifies with 5.8% error on a frame-by-frame basis, and 1.4% error when integrating long (2.4 second) segments of sound.
Citation impact
874
total citations
- FWCI
- 51.21
- Percentile
- 100%
- References
- 10
Citations per year
Authors
2Topics & keywords
Topics
Keywords
- Discriminator
- Computer science
- Speech recognition
- Frame (networking)
- Classifier (UML)
- Pitch detection algorithm
- Speech processing
- Artificial intelligence
UN Sustainable Development Goals
- Reduced inequalities
No related works found for this paper.