A Regression Approach to Speech Enhancement Based on Deep Neural Networks
University of Science and Technology of China · Georgia Institute of Technology
Abstract
In contrast to the conventional minimum mean square error (MMSE)-based noise reduction techniques, we propose a supervised method to enhance speech by means of finding a mapping function between noisy and clean speech signals based on deep neural networks (DNNs). In order to be able to handle a wide range of additive noises in real-world situations, a large training set that encompasses many possible combinations of speech and noise types, is first designed. A DNN architecture is then employed as a nonlinear regression function to ensure a powerful modeling capability. Several techniques have also been proposed to improve the DNN-based speech enhancement system, including global variance equalization to…
Citation impact
- FWCI
- 53.05
- Percentile
- 100%
- References
- 63
Authors
4Topics & keywords
- Computer science
- Speech enhancement
- Smoothing
- Noise (video)
- Dropout (neural networks)
- Artificial neural network
- Speech recognition
- Minimum mean square error
- Peace, Justice and strong institutions