王趵翔

助理教授

教育背景

博士(香港中文大学)

学士(上海交通大学)

研究领域
强化学习,在线学习和学习理论
电子邮件
bxiangwang@cuhk.edu.cn
个人简介

王趵翔教授于2020年9月加入香港中文大学(深圳)数据科学学院任助理教授一职。 王教授于2014年在上海交通大学获信息安全专业工程学士学位;其后赴香港中文大学计算机科学与工程系深造,并于2020年获博士学位。

王趵翔教授的研究方向包括强化学习,在线学习和学习理论等,他的研究成果曾发表在国际学习表征会议、国际人工智能联合会议等知名国际会议上。就读博士期间,他曾在阿尔伯塔大学、纽约Cubist Systematic Strategies公司、普林斯顿西门子研究所、香港应用科技研究院等机构参与研究项目。

学术著作

1. Baoxiang Wang, Shuai Li, Jiajin Li, Siu On Chan (2020). The Gambler's Problem and Beyond, International Conference on Learning Representations. 

2. Andrej Bogdanov, Baoxiang Wang (2020).  Learning and Testing Variable Partitions
Innovations in Theoretical Computer Science. 

3. Baoxiang Wang, Nidhi Hegde (2019). Privacy-preserving Q-Learning with Functional Noise in Continuous Spaces, Advances in Neural Information Processing Systems. 
Nidhi has a blogpost on its implications to the bank. 

4. Baoxiang Wang (2019). Recurrent Existence Determination Through Policy Optimization, International Joint Conference on Artificial Intelligence. 

5. Kenny Young, Baoxiang Wang, Matthew E. Taylor (2019). Metatrace Actor-Critic: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control, International Joint Conference on Artificial Intelligence. 

6. Baoxiang Wang, Tongfang Sun, Xianjun Sam Zheng (2019). Beyond Winning and Losing: Modeling Human Motivations and Behaviors Using Inverse Reinforcement Learning, Artificial Intelligence and Interactive Digital Entertainment.

7. Jiajin Li, Baoxiang Wang (2018). Policy Optimization with Second-Order Advantage Information, International Joint Conference on Artificial Intelligence.

8. Shuai Li, Baoxiang Wang, Shengyu Zhang, Wei Chen (2016). Contextual Combinatorial Cascading Bandits, International Conference on Machine Learning.

9. Cuiyun Gao, Baoxiang Wang, Pinjia He, Jieming Zhu, Yangfan Zhou, Michael R. Lyu (2015). PAID: Prioritizing App Issues for Developers by Tracking User Reviews Over Versions, International Symposium on Software Reliability Engineering.