【数据科学名家讲坛】New Advances in Genetic Risk Prediction: Model Fine-tuning, Benchmarking, and Portability (Qiongshi Lu, Associate Professor, Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison)
主题:New Advances in Genetic Risk Prediction: Model Fine-tuning, Benchmarking, and Portability
报告人:Qiongshi Lu, Associate Professor, Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison
主持人:Jin LIU, Associate Professor, School of Data Science, CUHK-Shenzhen
日期:31 May (Wednesday), 2023
时间:11:00 AM to 12:00 PM, Beijing Time
形式:Hybrid
线下地点:103 Meeting Room, Daoyuan Building
语言:Chinese
摘要:
Genetic risk prediction is a major focus in human genetics research. Continued success in genome-wide association studies (GWAS) in the past decade has facilitated the development of polygenic risk scores (PRS) that aggregate the effects of millions of genetic variants for many complex traits. Compared to earlier methods requiring individual-level data for model training, PRS which only relies on marginal association data from GWAS is much more generally applicable due to the wide availability of summary statistics. However, several lingering challenges create a significant gap between PRS methodology and applications. In this talk, I will mainly discuss two issues involving model fine-tuning and cross-ancestry portability. First, most PRS models include tuning parameters which improve predictive performance when properly selected. However, existing model-tuning methods require individual-level genetic data independent from both training and testing samples. These data almost never exist in practice. We introduced an innovative statistical framework named PUMAS to conduct classic modeling tuning procedures (e.g., cross validation) using GWS summary statistics alone as input. Built on this framework, I will also introduce our recent effort to benchmark PRS models and perform ensemble learning without requiring external data for model fitting. Second, PRS trained from existing GWAS are known to have substantially reduced predictive accuracy in non-European populations, limiting its clinical utility and raising concerns about health disparities across ancestral populations. I will introduce our new approach named X-Wing designed to tackle this problem. X-Wing quantifies genetic effect correlation between populations, employs annotation-dependent statistical regularization, and combines population-specific PRS through ensemble learning, which leads to substantially improved performance compared to existing methods. I will give many examples to demonstrate the superior performance of these approaches. Taken together, we believe these new advances resolve several fundamental problems without a current solution and will shed important light on the future application of genetic prediction for human complex traits.
简介:
Dr. Qiongshi Lu received his B.S. in mathematics from Tsinghua University in 2012 and Ph.D. in biostatistics from Yale University in 2017. In 2017, he was appointed in the Department of Biostatistics and Medical Informatics at University of Wisconsin-Madison. Dr. Lu's research focuses on developing statistical and computational methods to study complex trait genetics. In particular, he is interested in noncoding genome annotation, genetic risk prediction, genetic correlation estimation, and gene-environment interaction.
