【数据科学名家讲坛】On the Value Functions in Reinforcement Learning for Ridesharing - With an Introduction to Operations Research in Industry (Zhiwei (Tony) QIN, Principal Scientist, Lyft)
主题:On the Value Functions in Reinforcement Learning for Ridesharing - With an Introduction to Operations Research in Industry
报告人:Zhiwei (Tony) QIN, Principal Scientist, Lyft
主持人:Jim DAI, Dean, School of Data Science, CUHK-Shenzhen
日期:August 29 (Tuesday), 2023
时间:16:00 - 17:00, Beijing Time
形式:Offline
地点:205 Meeting Room, ABE
语言:English
摘要:
We will first introduce the applications of operations research in a few key industry domains (e.g., ridesharing, supply chain, delivery), describing the real-world problems and potential OR solutions. We will then turn our attention to sequential decision problems, to which reinforcement learning is a core family of methodologies.
A focal point of reinforcement learning for ridesharing is the value function learning. In this talk, we will recapitulate the RL methods based on offline learning, and then talk about recently deployed works in the domain that shift toward online on-policy updates. We will discuss reward representation in this two-sided marketplace and demonstrate how shared value functions can be adopted to coordinate multiple rideshare levers. Finally, we will discuss learning the value functions for individual market participating units (both supply and demand) while making sure that they collectively approximate the system values well.
简介:
Zhiwei (Tony) Qin is Principal Scientist at Lyft, working on core problems in ridesharing marketplace optimization. Previously, he was Principal Research Scientist and Director of the Decision Intelligence group at DiDi AI Labs and Staff Scientist in supply chain and inventory optimization at Walmart Global E-commerce. Tony received his Ph.D. in Operations Research from Columbia University. His research interests span optimization and machine learning, with a particular focus in reinforcement learning and its applications in operational optimization, digital marketing, and smart transportation. He is Associate Editor of the ACM Journal on Autonomous Transportation Systems. He has published more than 40 papers in top-tier conferences and journals in machine learning and optimization. He has served as Area Chair/Senior PC of KDD, AAAI, and ECML-PKDD, and a referee of top journals. He is an INFORMS Franz Edelman Award Finalist and Laureate in 2023, received the INFORMS Daniel H. Wagner Prize for Excellence in Operations Research Practice in 2019 and was selected for the NeurIPS 2018 Best Demo Awards. Tony holds more than 10 US patents in intelligent transportation, supply chain, and recommendation systems.