Blind Separation of Speech Mixtures via Time-Frequency Masking

Yılmaz, Özgür; Rickard, Scott

doi:10.1109/tsp.2004.828896

articleIEEE Transactions on Signal ProcessingJun 21, 2004Closed access

Blind Separation of Speech Mixtures via Time-Frequency Masking

ÖYÖzgür Yılmaz SRScott Rickard

University of Maryland, College Park · University College Dublin

Indexed incrossref

Abstract

Binary time-frequency masks are powerful tools for the separation of sources from a single mixture. Perfect demixing via binary time-frequency masks is possible provided the time-frequency representations of the sources do not overlap: a condition we call W-disjoint orthogonality. We introduce here the concept of approximate W-disjoint orthogonality and present experimental results demonstrating the level of approximate W-disjoint orthogonality of speech in mixtures of various orders. The results demonstrate that there exist ideal binary time-frequency masks that can separate several speech signals from one mixture. While determining these masks blindly from just one mixture is an open problem, we show that we…

Citation impact

1,470

total citations

FWCI: 51.74
Percentile: 100%
References: 29

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Disjoint sets
Binary number
Orthogonality
Estimator
Histogram
Mixing (physics)
Mathematics
Mixture model

No related works found for this paper.