武执政

副教授

教育背景

南洋理工大学博士
南开大学硕士
杭州电子科技大学学士

 

研究领域
语音交互、语音生成、音频鉴伪
个人网站
电子邮箱
wuzhizheng@cuhk.edu.cn
办公室
综合楼C715
个人简介

武执政博士现任香港中文大学(深圳)副教授,入选国家级青年人才,深圳市跨模态认知计算重点实验室副主任,华为火花奖获得者,并连续多次入选斯坦福大学“全球前2%顶尖科学家”榜单及获得最佳论文奖。武教授于南洋理工大学获得博士学位,曾任职于Meta(原Facebook)、苹果、爱丁堡大学及微软亚洲研究院等国际知名机构,从事学术研究和技术领导工作。

武教授发起了多个开源项目,包括Merlin、Amphion和Emilia,已被全球超过700家单位采用(含OpenAI)。其中,Amphion多次登顶GitHub趋势榜,Emilia成为HuggingFace音频类最受欢迎数据集(Most Liked)。武教授发起了语音鉴伪和语音转换国际评测,并担任2019年语音合成国际评测(Blizzard Challenge 2019)的组织者。

武教授现为人工智能语音领域权威期刊IEEE/ACM TASLP、SPL等编委,并出任IEEE Spoken Language Technology Workshop 2024大会主席。

学术著作

Yuancheng Wang, Haoyue Zhan, Liwei Liu, Ruihong Zeng, Haotian Guo, Jiachen Zheng, Qiang Zhang, Xueyao Zhang, Shunsi Zhang, Zhizheng Wu, MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer, ICLR 2025

Yicheng Gu, Xueyao Zhang, Liumeng Xue, Haizhou Li, Zhizheng Wu, An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoders, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024

Junyi Ao*, Yuancheng Wang*, Xiaohai Tian, Dekun Chen, Jun Zhang, Lu Lu, Yuxuan Wang, Haizhou Li, Zhizheng Wu, SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words, NeurIPS 2024 (Data and Benchmark Track)

Yuancheng Wang, Zeqian Ju, Xu Tan, Lei He, Zhizheng Wu, Jiang Bian, Sheng Zhao, AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models, NeurIPS 2023

Yi Zhou, Zhizheng Wu, Xiaohai Tian, Haizhou Li, Optimization of Cross-Lingual Voice Conversion With Linguistics Losses to Reduce Foreign Accents, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023

Zhizheng Wu, Junichi Yamagishi, Tomi Kinnunen, Cemal Hanilçi, Mohammed Sahidullah, Aleksandr Sizov, Nicholas Evans, Massimiliano Todisco, Hector Delgado, Asvspoof: The automatic speaker verification spoofing and countermeasures challenge, IEEE Journal of Selected Topics in Signal Processing, Vol.11, 588-604, 2017.

Zhizheng Wu, Oliver Watts, Simon King, Merlin: An Open Source Neural Network Speech Synthesis System, SSW, 202-207, 2016.

Zhizheng Wu, Cassia Valentini-Botinhao, Oliver Watts, Simon King, Deep Neural Networks Employing Multi-task Learning and Stacked Bottleneck Features for Speech Synthesis, ICASSP, 2015.

Zhizheng Wu, Nicholas Evans, Tomi Kinnunen, Junichi Yamagishi, Federico Alegre, Haizhou Li, Spoofing and countermeasures for speaker verification: a survey, Speech Communication Vol. 66, 130-153, 2015.

Zhizheng Wu, Tuomas Virtanen, Eng Siong Chng, Haizhou Li, Exemplar-based sparse representation with residual compensation for voice conversion, IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 22, 1506-1521, 2014.

更多学术著作,请点击 https://scholar.google.com/citations?user=K6zhweAAAAAJ&hl=en 查看