Deep clustering: Discriminative embeddings for segmentation and separation
Mitsubishi Electric (United States) · Columbia University
Abstract
We address the problem of "cocktail-party" source separation in a deep learning framework called deep clustering. Previous deep network approaches to separation have shown promising performance in scenarios with a fixed number of sources, each belonging to a distinct signal class, such as speech and noise. However, for arbitrary source classes and number, "class-based" methods are not suitable. Instead, we train a deep network to assign contrastive embedding vectors to each time-frequency region of the spectrogram in order to implicitly predict the segmentation labels of the target spectrogram from the input mixtures. This yields a deep network-based analogue to spectral clustering, in that the embeddings form…
Citation impact
- FWCI
- 93.92
- Percentile
- 100%
- References
- 47
Authors
4Topics & keywords
- Spectrogram
- Cluster analysis
- Computer science
- Spectral clustering
- Artificial intelligence
- Deep learning
- Pattern recognition (psychology)
- Embedding
- Reduced inequalities