“统计大讲堂”第205讲预告:无监督多任务中的高斯混合模型迁移学习
2023-01-02
报告时间:2023年1月5日
上午10:00-11:00
报告地点:腾讯会议
(会议ID:770-749-297)
报告嘉宾:Yang Feng
报告主题:Unsupervised Multi-task and Transfer Learning on Gaussian Mixture Models
报告摘要
Unsupervised Multi-task and Transfer Learning on Gaussian Mixture Models
Unsupervised learning has been widely used in many real-world applications. One of the simplest and most important unsupervised learning models is the Gaussian mixture model (GMM). In this work, we study the multi-task learning problem on GMMs, which aims to leverage potentially similar GMM parameter structures among tasks to obtain improved learning performance compared to single-task learning. We propose a multi-task GMM learning procedure based on the EM algorithm that not only can effectively utilize unknown similarities between related tasks but is also robust against a fraction of outlier tasks from arbitrary sources. The proposed procedure is shown to achieve the minimax optimal rate of convergence for both parameter estimation error and the excess mis-clustering error, in a wide range of regimes. Moreover, we generalize our approach to tackle the problem of transfer learning for GMMs, where similar theoretical results are derived. Finally, we implement the algorithms in a new R package mtlgmm and demonstrate the effectiveness of our methods through simulations and real data analysis. To the best of our knowledge, this is the first work studying multi-task and transfer learning on GMMs with theoretical guarantees.
个人简介
Yang Feng is an Associate Professor of Biostatistics at New York University. He got his Ph.D. in Operations Research from Princeton University in 2010. His research interests include machine learning, high-dimensional statistics, social networks, nonparametric statistics, and bioinformatics. Dr. Feng has published over 50 articles in leading statistics and machine learning journals. He is currently an associate editor of the Journal of the American Statistical Association, Journal of Business and Economic Statistics, Statistica Sinica, and Stat. He is a fellow of the American Statistical Association and an elected member of the International Statistical Institute. His research has been supported by multiple NSF and NIH grants.