新闻公告

“统计大讲堂”系列讲座第141讲

2021-01-05

报告时间：2021年1月7日（周四）上午 9:00

报告形式：腾讯会议

（会议 ID：292 707 250）

报告嘉宾：HaiYing Wang

报告主题：Maximum sampled conditional likelihood estimation for informative subsample

Maximum sampled conditional likelihood estimation for informative subsample

Subsampling is an effective approach to extract useful information from massive data sets when computing resources are limited. Existing investigations focus on developing better sampling procedures and deriving probabilities with higher estimation efficiency. After a subsample is taken from the full data, most available methods use an inverse probability weighted target function to define the estimator. This type of weighted estimator reduces the contributions of more informative data points, and thus it does not fully utilize information in the selected subsample. This paper focuses on parameter estimation with selected subsample, and proposes to use the maximum sampled conditional likelihood estimator (MSCLE) based on the sampled data. We established the asymptotic normality of the MSCLE, and prove that its variance covariance matrix reaches the lower bound of asymptotically unbiased estimators. Specifically, the MSCLE has a higher estimation efficiency than the weighted estimator. We further discuss the asymptotic results with the L-optimal subsampling probabilities, and illustrate the estimation procedure with generalized linear models. Numerical experiments are provided to evaluate the practical performance of the proposed method.

HaiYing Wang is an Assistant Professor in the Department of Statistics at the University of Connecticut. He was an Assistant Professor in the Department of Mathematics and Statistics at the University of New Hampshire from 2013 to 2017. He obtained his Ph.D. from the Department of Statistics at the University of Missouri in 2013, and his M.S. from the Academy of Mathematics and Systems Science, Chinese Academy of Sciences in 2006. His research interests include informative subdata selection for big data, model selection, model averaging, measurement error models, and semi-parametric regression.

“统计大讲堂”系列讲座第140讲

“统计大讲堂”系列讲座第142讲

教育部人文社会科学重点研究基地

新闻公告

新闻公告

“统计大讲堂”系列讲座第141讲

2021-01-05

上一篇

下一篇