preprintMar 1, 2016Closed access

Deep clustering: Discriminative embeddings for segmentation and separation

Mitsubishi Electric (United States) · Columbia University

Indexed incrossref

Abstract

We address the problem of "cocktail-party" source separation in a deep learning framework called deep clustering. Previous deep network approaches to separation have shown promising performance in scenarios with a fixed number of sources, each belonging to a distinct signal class, such as speech and noise. However, for arbitrary source classes and number, "class-based" methods are not suitable. Instead, we train a deep network to assign contrastive embedding vectors to each time-frequency region of the spectrogram in order to implicitly predict the segmentation labels of the target spectrogram from the input mixtures. This yields a deep network-based analogue to spectral clustering, in that the embeddings form…

Citation impact

1,427
total citations
FWCI
93.92
Percentile
100%
References
47
Citations per year

Authors

4

Topics & keywords

Keywords
  • Spectrogram
  • Cluster analysis
  • Computer science
  • Spectral clustering
  • Artificial intelligence
  • Deep learning
  • Pattern recognition (psychology)
  • Embedding
UN Sustainable Development Goals
  • Reduced inequalities
No related works found for this paper.