Deep Learning Enabled Semantic Communications With Speech Recognition and Synthesis

Weng, Zhenzi; Qin, Zhijin; Tao, Xiaoming; Pan, Chengkang; Liu, Guangyi; Li, Geoffrey Ye

doi:10.1109/twc.2023.3240969

articleIEEE Transactions on Wireless CommunicationsFeb 6, 2023HYBRID OA

Deep Learning Enabled Semantic Communications With Speech Recognition and Synthesis

ZWZhenzi Weng ZQZhijin Qin XTXiaoming Tao CPChengkang Pan GLGuangyi Liu

Queen Mary University of London · Tsinghua University · +2 more institutions

Indexed incrossref

Abstract

In this paper, we develop a deep learning based semantic communication system for speech transmission, named DeepSC-ST. We take the speech recognition and speech synthesis as the transmission tasks of the communication system, respectively. First, the speech recognition-related semantic features are extracted for transmission by a joint semantic-channel encoder and the text is recovered at the receiver based on the received semantic features, which significantly reduces the required amount of data transmission without performance degradation. Then, we perform speech synthesis at the receiver, which dedicates to re-generate the speech signals by feeding the recognized text and the speaker information into a…

Citation impact

248

total citations

FWCI: 40.73
Percentile: 100%
References: 71

Citations per year

Authors

6

Topics & keywords

Topics

Keywords

Computer science
Speech recognition
Channel (broadcasting)
Voice activity detection
Transmission (telecommunications)
Encoder
Artificial neural network
Communications system

UN Sustainable Development Goals

Peace, Justice and strong institutions

No related works found for this paper.