Construction and evaluation of a robust multifeature speech/music discriminator

Scheirer, Eric D.; Slaney, Malcolm

doi:10.1109/icassp.1997.596192

articleNov 22, 2002Closed access

Construction and evaluation of a robust multifeature speech/music discriminator

EDEric D. Scheirer MSMalcolm Slaney

MIT Lincoln Laboratory

Indexed incrossref

Abstract

We report on the construction of a real-time computer system capable of distinguishing speech signals from music signals over a wide range of digital audio input. We have examined 13 features intended to measure conceptually distinct properties of speech and/or music signals, and combined them in several multidimensional classification frameworks. We provide extensive data on system performance and the cross-validated training/test setup used to evaluate the system. For the datasets currently in use, the best classifier classifies with 5.8% error on a frame-by-frame basis, and 1.4% error when integrating long (2.4 second) segments of sound.

Citation impact

874

total citations

FWCI: 51.21
Percentile: 100%
References: 10

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Discriminator
Computer science
Speech recognition
Frame (networking)
Classifier (UML)
Pitch detection algorithm
Speech processing
Artificial intelligence

UN Sustainable Development Goals

Reduced inequalities

No related works found for this paper.