articleIEEE Transactions on Wireless CommunicationsFeb 6, 2023HYBRID OA

Deep Learning Enabled Semantic Communications With Speech Recognition and Synthesis

Queen Mary University of London · Tsinghua University · +2 more institutions

Indexed incrossref

Abstract

In this paper, we develop a deep learning based semantic communication system for speech transmission, named DeepSC-ST. We take the speech recognition and speech synthesis as the transmission tasks of the communication system, respectively. First, the speech recognition-related semantic features are extracted for transmission by a joint semantic-channel encoder and the text is recovered at the receiver based on the received semantic features, which significantly reduces the required amount of data transmission without performance degradation. Then, we perform speech synthesis at the receiver, which dedicates to re-generate the speech signals by feeding the recognized text and the speaker information into a…

Citation impact

248
total citations
FWCI
40.73
Percentile
100%
References
71
Citations per year

Authors

6

Topics & keywords

Keywords
  • Computer science
  • Speech recognition
  • Channel (broadcasting)
  • Voice activity detection
  • Transmission (telecommunications)
  • Encoder
  • Artificial neural network
  • Communications system
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.

Funding