To understand double descent, we need to understand VC theory

被引：3

作者：

Cherkassky, Vladimir ^{[1
]}

Lee, Eng Hock ^{[1
]}

机构：

[1] Univ Minnesota Twin Cities, Dept Elect & Comp Engn, Minneapolis, MN 55455 USA

来源：

NEURAL NETWORKS | 2024年 / 169卷

基金：

美国国家科学基金会;

关键词：

Double descent; Deep learning; Complexity control; Structural risk minimization; VC-dimension; VC-generalization bounds; NEURAL-NETWORKS; BOUNDS; DIMENSION;

D O I：

10.1016/j.neunet.2023.10.014

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We analyze generalization performance of over-parameterized learning methods for classification, under VC theoretical framework. Recently, practitioners in Deep Learning discovered 'double descent' phenomenon, when large networks can fit perfectly available training data, and at the same time, achieve good generalization for future (test) data. The current consensus view is that VC-theoretical results cannot account for good generalization performance of Deep Learning networks. In contrast, this paper shows that double descent can be explained by VC-theoretical concepts, such as VC-dimension and Structural Risk Minimization. We also present empirical results showing that double descent generalization curves can be accurately modeled using classical VC-generalization bounds. Proposed VC-theoretical analysis enables better understanding of generalization curves for data sets with different statistical characteristics, such as low vs high-dimensional data and noisy data. In addition, we analyze generalization performance of transfer learning using pre-trained Deep Learning networks.

引用

页码：242 / 256

页数：15

共 50 条

[1] Do We Need to Understand
Gokcenur, C.
MASSACHUSETTS REVIEW, 2014, 55 (04): : 669 - 671
[2] To Understand Deep Learning We Need to Understand Kernel Learning
Belkin, Mikhail
Ma, Siyuan
Mandal, Soumik
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[3] 'We need the BPS to understand who we are'
Terry, Jenny
PSYCHOLOGIST, 2022, 35 : 46 - 49
[4] The Need to Understand the Traditions We Carry
Onuki, Daisuke
INTERNATIONAL JOURNAL OF SEXUAL HEALTH, 2019, 31 : A150 - A151
[5] We need to understand the reasons for the Brexit vote
Bodroghy, Balint
NEW SCIENTIST, 2019, 243 (3247) : 26 - 26
[6] Do we really need a new theory to understand over-parameterization?
Oneto, Luca
Ridella, Sandro
Anguita, Davide
NEUROCOMPUTING, 2023, 543
[7] To contribute to health, we need to understand who we are treating
Hughes, David
BRITISH JOURNAL OF SPORTS MEDICINE, 2015, 49 (19)
[8] THE NEED TO UNDERSTAND
DEMOND, AS
PHYLON, 1957, 18 (02) : 119 - 123
[9] Do we really need configuration interaction theory to understand the negative vacancy in silicon?
Gerstmann, U
Rauls, E
Overhof, H
Frauenheim, T
PHYSICA B-CONDENSED MATTER, 2001, 308 : 497 - 501
[10] Why we need to truly understand the medical literature
Jordan, Beth
Mooney, Chris
CONTRACEPTION, 2007, 75 (06) : 405 - 406

← 1 2 3 4 5 →