共 50 条
- [31] Stochastic gradient descent tricks Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2012, 7700 LECTURE NO : 421 - 436
- [33] Byzantine Stochastic Gradient Descent ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [34] NOISE REDUCTION BY GRADIENT DESCENT INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS, 1993, 3 (01): : 113 - 118
- [35] Gradient Descent with Early Stopping is Provably Robust to Label Noise for Overparameterized Neural Networks INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 4313 - 4324
- [36] First Exit Time Analysis of Stochastic Gradient Descent Under Heavy-Tailed Gradient Noise ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [37] Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to Improve Generalization INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [40] Convergence of Stochastic Gradient Descent for PCA INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48