articleNov 22, 2002Closed access

Construction and evaluation of a robust multifeature speech/music discriminator

MIT Lincoln Laboratory

Indexed incrossref

Abstract

We report on the construction of a real-time computer system capable of distinguishing speech signals from music signals over a wide range of digital audio input. We have examined 13 features intended to measure conceptually distinct properties of speech and/or music signals, and combined them in several multidimensional classification frameworks. We provide extensive data on system performance and the cross-validated training/test setup used to evaluate the system. For the datasets currently in use, the best classifier classifies with 5.8% error on a frame-by-frame basis, and 1.4% error when integrating long (2.4 second) segments of sound.

Citation impact

874
total citations
FWCI
51.21
Percentile
100%
References
10
Citations per year

Authors

2

Topics & keywords

Keywords
  • Discriminator
  • Computer science
  • Speech recognition
  • Frame (networking)
  • Classifier (UML)
  • Pitch detection algorithm
  • Speech processing
  • Artificial intelligence
UN Sustainable Development Goals
  • Reduced inequalities
No related works found for this paper.