WU, Zhizheng
Associate Professor
Ph.D., Nanyang Technological University
Professor Wu Zhizheng is currently an Associate Professor at The Chinese University of Hong Kong, Shenzhen. He holds the position of Deputy Director at the Shenzhen Key Laboratory of Cross-Modal Cognitive Computing. Professor Wu has been consistently listed in Stanford University’s “World’s Top 2% Scientists” and has received multiple Best Paper Awards.
He earned his Ph.D. from Nanyang Technological University and has held research and leadership roles at internationally renowned institutions, including Meta (formerly Facebook), Apple, the University of Edinburgh, and Microsoft Research Asia.
Professor Wu has initiated several influential open-source projects, such as Merlin, Amphion, and Emilia, which have been adopted by over 700 organizations worldwide, including OpenAI. Notably, Amphion has topped GitHub’s trending list multiple times, while Emilia has become the most popular audio dataset (Most Liked) on HuggingFace. He also initiated and organized the first ASVspoof Challenge and the first Voice Conversion Challenge and served as the organizer of the Blizzard Challenge 2019, a prestigious international speech synthesis competition.
Currently, Professor Wu serves on the editorial boards of IEEE/ACM Transactions on Audio, Speech and Language Processing and IEEE Signal Processing Letters, and is the General Chair of the IEEE Spoken Language Technology Workshop 2024.
Yuancheng Wang, Haoyue Zhan, Liwei Liu, Ruihong Zeng, Haotian Guo, Jiachen Zheng, Qiang Zhang, Xueyao Zhang, Shunsi Zhang, Zhizheng Wu, MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer, ICLR 2025
Yicheng Gu, Xueyao Zhang, Liumeng Xue, Haizhou Li, Zhizheng Wu, An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoders, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
Junyi Ao*, Yuancheng Wang*, Xiaohai Tian, Dekun Chen, Jun Zhang, Lu Lu, Yuxuan Wang, Haizhou Li, Zhizheng Wu, SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words, NeurIPS 2024 (Data and Benchmark Track)
Yuancheng Wang, Zeqian Ju, Xu Tan, Lei He, Zhizheng Wu, Jiang Bian, Sheng Zhao, AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models, NeurIPS 2023
Yi Zhou, Zhizheng Wu, Xiaohai Tian, Haizhou Li, Optimization of Cross-Lingual Voice Conversion With Linguistics Losses to Reduce Foreign Accents, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
Zhizheng Wu, Junichi Yamagishi, Tomi Kinnunen, Cemal Hanilçi, Mohammed Sahidullah, Aleksandr Sizov, Nicholas Evans, Massimiliano Todisco, Hector Delgado, Asvspoof: The automatic speaker verification spoofing and countermeasures challenge, IEEE Journal of Selected Topics in Signal Processing, Vol.11, 588-604, 2017.
Zhizheng Wu, Oliver Watts, Simon King, Merlin: An Open Source Neural Network Speech Synthesis System, SSW, 202-207, 2016.
Zhizheng Wu, Cassia Valentini-Botinhao, Oliver Watts, Simon King, Deep Neural Networks Employing Multi-task Learning and Stacked Bottleneck Features for Speech Synthesis, ICASSP, 2015.
Zhizheng Wu, Nicholas Evans, Tomi Kinnunen, Junichi Yamagishi, Federico Alegre, Haizhou Li, Spoofing and countermeasures for speaker verification: a survey, Speech Communication Vol. 66, 130-153, 2015.
Zhizheng Wu, Tuomas Virtanen, Eng Siong Chng, Haizhou Li, Exemplar-based sparse representation with residual compensation for voice conversion, IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 22, 1506-1521, 2014.
For more academic works, please click on https://scholar.google.com/citations?user=K6zhweAAAAAJ&hl=en to view.