李丹

教授
副院长(本科)

教育背景

香港中文大学博士
香港中文大学硕士
香港中文大学学士

学术领域
计算机科学, 机器学习与人工智能, 语音与自然语言处理
研究领域
语音信号处理、言语技术、语言及副语言分析、沟通障碍辅助技术
电子邮箱
tanlee@cuhk.edu.cn
办公室
道远楼 413
个人简介

李丹,香港中文大学电子工程博士,长期从事语音和语言相关的研究。曾领导开发针对粤语的口语语言技术,在工业界获得广泛应用。李丹最近的工作着重跨学科的深入实质性合作,涉及领域包括语言学、教育学、心理学和医学。他致力于将信号处理和机器学习技术应用于人类交流的各种现实场景中。李丹参与开发的ACEHearing产品曾获得2011年亚洲创新奖铜奖。他于2023年创办 Vocofy AI,这是一家用人工智能技术帮助有言语障碍的人进行言语交流的社会企业。李丹曾担任 IEEE/ACM 音频、语音和语言处理学报和 EURASIP 信号处理期刊的副主编。他曾任ISCA中文口语处理专业小组副主席,INTERSPEECH 2014、2016和2018技术项目委员会子领域主席。

李丹曾担任香港中文大学工程学院专管教育的副院长(2021-2024),以及专管学生事务的副院长(2011-2014)。他曾获得香港中文大学校长模范教学奖,并连续多年获得工程学院模范教学奖。李丹曾任清华大学计算机系姚班客席教授,北京大学 Globex 暑期课程客席教授。李丹曾负责香港资优教育学院优秀中学生的教学和指导工作。他于2022年至2025年期间担任香港中文大学善衡书院学生辅导长。

学术著作

1.    Si Ioi Ng, Cymie Wing-Yee Ng, Jiarui Wang, and Tan Lee, “Automatic detection of speech sound disorder in Cantonese-speaking pre-school children,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 4355-4368, 2024.

2.    Yusheng Tian, Jingyu Li, and Tan Lee, “Creating personalized synthetic voices from articulation impaired speech using augmented reconstruction loss”, Proceedings of ICASSP 2024, pp.11501-11505.

3.    Dehua Tao, Tan Lee, Harold Chui, and Sarah Luk, “Modelling intrapersonal and interpersonal influences for automatic estimation of therapist empathy in counselling conversation,” Proceedings of ICASSP 2024, pp.12692-12696.

4.    Guangyan Zhang, Ying Qin, Wenjie Zhang, Jialun Wu, Mei Li, Yutao Gai, Feijun Jiang, and Tan Lee, “iEmoTTS: Toward robust cross-speaker emotion transfer and control for speech synthesis based on disentanglement between prosody and timbre,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.31, pp.1693-1705, 2023.

5.    Jonathan Him Nok Lee, Eddie S. K. Chong, Harold Chui, Tan Lee, Sarah Luk, Dehua Tao, and Nicolette Wing Tung Lee, “A curvilinear association between therapists’ use of discourse particles and therapist empathy in psychotherapy,” Journal of Counselling Psychology, 70(5), 562-570, July 2023.

6.    Si-Ioi Ng, Rui-Si Ma, Tan Lee and Raymond Kim-Wai Sum, “Acoustical analysis of speech under physical stress in relation to physical activities and physical literacy,” Proceedings of Speech Prosody 2022, pp.200-204, Lisbon, Portugal, May 23-26, 2022.

7.    Shuiyang Mao, P. C. Ching and Tan Lee, “Enhancing segment-based speech emotion recognition by iterative self-learning,” IEEE/ACM Trans. on Audio, Speech, and Language Processing, vol. 30, pp.123-134, 2022.

8.    Matthew King-Hang Ma, Manson Cheuk-Man Fong, Chenwei Xie, Tan Lee, Guanrong Chen and William Shiyuan Wang, “Regularity and randomness in ageing: Differences in resting-state EEG complexity measured by largest Lyapunov exponent,” Neuroimage: Reports, December 2021.

9.    Xurong Xie, Xunying Liu, Tan Lee and Lan Wang, “Bayesian learning for deep neural network adaptation,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 29, pp. 2096-2110, 2021.

10.  Daxin Tan and Tan Lee, “Fine-grained style modeling, transfer and prediction in text-to-speech synthesis via phone-level content-style disentanglement,” Proceedings of INTERSPEECH 2021, pp.4683-4687.

11.  Y. Qin, Tan Lee and Anthony P. H. Kong, “Automatic assessment of speech impairment in Cantonese-speaking people with aphasia,” IEEE Journal of Selected Topic in Signal Processing, Vol. 14, No. 2, pp. 331-345, February 2020.

12.  Si-Ioi Ng, Cymie Wing-Yee Ng, Jiarui Wang, Tan Lee, Kathy Yuet-Sheung Lee, and Michael Chi-Fai Tong, “CUCHILD: A large-scale Cantonese corpus of child speech for phonology and articulation assessment,” Proceedings of INTERSPEECH 2020, pp.424-428, Shanghai, October 2020.

13.  Yuzhong Wu and Tan Lee, “Enhancing sound texture in CNN-based acoustic scene classification,” Proceedings of ICASSP 2019, pp.815-819, Brighton, May 2019.

14.  Xurong Xie, Xunying Liu, Tan Lee, Shoukang Hu, and Lan Wang, “BLHUC: Bayesian learning of hidden unit contributions for deep neural network speaker adaptation,” Proceedings of ICASSP 2019, pp.5711-5715, Brighton, May 2019. [Best Student Paper]