articleApr 9, 2020Closed access

Parallel Wavegan: A Fast Waveform Generation Model Based on Generative Adversarial Networks with Multi-Resolution Spectrogram

Line Corporation (Japan) · Naver (South Korea)

Indexed incrossref

Abstract

We propose Parallel WaveGAN, a distillation-free, fast, and small-footprint waveform generation method using a generative adversarial network. In the proposed method, a non-autoregressive WaveNet is trained by jointly optimizing multi-resolution spectrogram and adversarial loss functions, which can effectively capture the time-frequency distribution of the realistic speech waveform. As our method does not require density distillation used in the conventional teacher-student framework, the entire model can be easily trained. Furthermore, our model is able to generate high-fidelity speech even with its compact architecture. In particular, the proposed Parallel WaveGAN has only 1.44 M parameters and can generate…

Citation impact

836
total citations
FWCI
84.03
Percentile
100%
References
50
Citations per year

Authors

3

Topics & keywords

Keywords
  • Computer science
  • Spectrogram
  • Waveform
  • Autoregressive model
  • Generative adversarial network
  • High fidelity
  • Adversarial system
  • Distillation
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.