Blind Separation of Speech Mixtures via Time-Frequency Masking
University of Maryland, College Park · University College Dublin
Abstract
Binary time-frequency masks are powerful tools for the separation of sources from a single mixture. Perfect demixing via binary time-frequency masks is possible provided the time-frequency representations of the sources do not overlap: a condition we call W-disjoint orthogonality. We introduce here the concept of approximate W-disjoint orthogonality and present experimental results demonstrating the level of approximate W-disjoint orthogonality of speech in mixtures of various orders. The results demonstrate that there exist ideal binary time-frequency masks that can separate several speech signals from one mixture. While determining these masks blindly from just one mixture is an open problem, we show that we…
Citation impact
- FWCI
- 51.74
- Percentile
- 100%
- References
- 29
Authors
2Topics & keywords
- Disjoint sets
- Binary number
- Orthogonality
- Estimator
- Histogram
- Mixing (physics)
- Mathematics
- Mixture model