新闻公告

“统计大讲堂”第211讲预告：过宽神经网络在一维数据上的泛化能力

2023-03-29

报告时间：2023年3月31日

上午10:00-11:00

报告地点：中国人民大学明德主楼1037

（腾讯会议ID：549 125 631）

报告嘉宾：林乾

报告主题：Generalization ability of wide neural networks on R

报告摘要

Generalization ability of wide neural networks on R

We perform a study on the generalization ability of the wide two-layer ReLU neural network on R. We first establish some spectral properties of the neural tangent kernel (NTK): a) , the NTK defined on Rd , is positive definite; b) λi (K1) , the i-th largest eigenvalue of K1, is proportional to i-2. We then show that: i) when the width m→∞, the neural network kernel (NNK) uniformly converges to the NTK; ii) the minimax rate of regression over the RKHS associated to K1 is n-2/3; iii) if one adopts the early stopping strategy in training a wide neural network, the resulting neural network achieves the minimax rate; iv) if one trains the neural network till it overfits the data, the resulting neural network can not generalize well. Finally, we provide an explanation to reconcile our theory and the widely observed “benign overfitting phenomenon”.

个人简介

林乾，清华大学统计学研究中心副教授, 2010年在麻省理工数学系获得博士学位。2017年8月至今在清华大学任教。从事高维充分性降维，深度学习的数理基础等问题的研究。

“统计大讲堂”第209讲预告：Causal Effects and Posterior Causal Effects

“统计大讲堂”第212讲预告：多重门限变平面模型：估计理论及其在子族识别中的应用

教育部人文社会科学重点研究基地

新闻公告

新闻公告

“统计大讲堂”第211讲预告：过宽神经网络在一维数据上的泛化能力

2023-03-29

上一篇

下一篇