Association for Computing Machinery (ACM)

SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation

Feiyang Chen ¹

Chenye Cui ¹

Chen feiyang ²

Yi Ren ¹

Jinglin Liu ¹

Zhou Zhao ¹

Baoxing Huai ²

Zhefeng Wang ²

Hide authors affiliations Show authors affiliations: 2 affiliations

Zhejiang University, HangZhou, China |

Huawei Cloud, Hangzhou, China |

Publication type: Proceedings Article

Publication date: 2022-10-10

Association for Computing Machinery (ACM)

DOI: 10.1145/3503161.3547854

Copy DOI

Abstract

Deep generative models have achieved significant progress in speech synthesis to date, while high-fidelity singing voice synthesis is still an open problem for its long continuous pronunciation, rich high-frequency parts, and strong expressiveness. Existing neural vocoders designed for text-to-speech cannot directly be applied to singing voice synthesis because they result in glitches and poor high-frequency reconstruction. In this work, we propose SingGAN, a generative adversarial network designed for high-fidelity singing voice synthesis. Specifically, 1) to alleviate the glitch problem in the generated samples, we propose source excitation with the adaptive feature learning filters to expand the receptive field patterns and stabilize long continuous signal generation; and 2) SingGAN introduces global and local discriminators at different scales to enrich low-frequency details and promote high-frequency reconstruction; and 3) To improve the training efficiency, SingGAN includes auxiliary spectrogram losses and sub-band feature matching penalty loss. To the best of our knowledge, SingGAN is the first work designed toward high-fidelity singing voice vocoding. Our evaluation of SingGAN demonstrates the state-of-the-art results with higher-quality (MOS 4.05) samples. Also, SingGAN enables a sample speed of 50x faster than real-time on a single NVIDIA 2080Ti GPU. We further show that SingGAN generalizes well to the mel-spectrogram inversion of unseen singers, and the end-to-end singing voice synthesis system SingGAN-SVS enjoys a two-stage pipeline to transform the music scores into expressive singing voices. Audio samples are available at \url{https://SingGAN.github.io/}

Found

Top-30

Journals

	1 2
Expert Systems with Applications	Expert Systems with Applications, 2, 8% Expert Systems with Applications 2 publications, 8%
International Journal of Web Information Systems	International Journal of Web Information Systems, 1, 4% International Journal of Web Information Systems 1 publication, 4%
ACM Transactions on Multimedia Computing, Communications and Applications	ACM Transactions on Multimedia Computing, Communications and Applications, 1, 4% ACM Transactions on Multimedia Computing, Communications and Applications 1 publication, 4%
Neurocomputing	Neurocomputing, 1, 4% Neurocomputing 1 publication, 4%
Neural Networks	Neural Networks, 1, 4% Neural Networks 1 publication, 4%
Lecture Notes in Computer Science	Lecture Notes in Computer Science, 1, 4% Lecture Notes in Computer Science 1 publication, 4%
IEEE/ACM Transactions on Audio Speech and Language Processing	IEEE/ACM Transactions on Audio Speech and Language Processing, 1, 4% IEEE/ACM Transactions on Audio Speech and Language Processing 1 publication, 4%
Industrial Management and Data Systems	Industrial Management and Data Systems, 1, 4% Industrial Management and Data Systems 1 publication, 4%
	1 2

Publishers

	2 4 6 8 10 12
Institute of Electrical and Electronics Engineers (IEEE)	Institute of Electrical and Electronics Engineers (IEEE), 11, 44% Institute of Electrical and Electronics Engineers (IEEE) 11 publications, 44%
Elsevier	Elsevier, 4, 16% Elsevier 4 publications, 16%
Association for Computing Machinery (ACM)	Association for Computing Machinery (ACM), 3, 12% Association for Computing Machinery (ACM) 3 publications, 12%
Emerald	Emerald, 2, 8% Emerald 2 publications, 8%
Springer Nature	Springer Nature, 1, 4% Springer Nature 1 publication, 4%
	2 4 6 8 10 12

We do not take into account publications without a DOI.
Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.

Metrics

Publisher

Association for Computing Machinery (ACM)