Deep Learning Enabled Semantic Communications With Speech Recognition and Synthesis
Queen Mary University of London · Tsinghua University · +2 more institutions
Abstract
In this paper, we develop a deep learning based semantic communication system for speech transmission, named DeepSC-ST. We take the speech recognition and speech synthesis as the transmission tasks of the communication system, respectively. First, the speech recognition-related semantic features are extracted for transmission by a joint semantic-channel encoder and the text is recovered at the receiver based on the received semantic features, which significantly reduces the required amount of data transmission without performance degradation. Then, we perform speech synthesis at the receiver, which dedicates to re-generate the speech signals by feeding the recognized text and the speaker information into a…
Citation impact
- FWCI
- 40.73
- Percentile
- 100%
- References
- 71
Authors
6Topics & keywords
- Computer science
- Speech recognition
- Channel (broadcasting)
- Voice activity detection
- Transmission (telecommunications)
- Encoder
- Artificial neural network
- Communications system
- Peace, Justice and strong institutions