DeepMoM: Robust Deep Learning With Median-of-Means

被引:1
|
作者
Huang, Shih-Ting [1 ]
Lederer, Johannes [1 ]
机构
[1] Ruhr Univ, Dept Math, Bochum, Germany
关键词
Deep learning; Median-of-means; Robust estimator;
D O I
10.1080/10618600.2022.2090947
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Data used in deep learning is notoriously problematic. For example, data are usually combined from diverse sources, rarely cleaned and vetted thoroughly, and sometimes corrupted on purpose. Intentional corruption that targets the weak spots of algorithms has been studied extensively under the label of "adversarial attacks." In contrast, the arguably much more common case of corruption that reflects the limited quality of data has been studied much less. Such "random" corruptions are due to measurement errors, unreliable sources, convenience sampling, and so forth. These kinds of corruption are common in deep learning, because data are rarely collected according to strict protocols-in strong contrast to the formalized data collection in some parts of classical statistics. This article concerns such corruption. We introduce an approach motivated by very recent insights into median-of-means and Le Cam's principle, we show that the approach can be readily implemented, and we demonstrate that it performs very well in practice. In conclusion, we believe that our approach is a very promising alternative to standard parameter training based on least-squares and cross-entropy loss.
引用
收藏
页码:181 / 195
页数:15
相关论文
共 50 条
  • [1] ROBUST MACHINE LEARNING BY MEDIAN-OF-MEANS: THEORY AND PRACTICE
    Lecue, Guillaume
    Lerasle, Matthieu
    ANNALS OF STATISTICS, 2020, 48 (02): : 906 - 931
  • [2] Robust Clustered Federated Learning with Bootstrap Median-of-Means
    Xie, Ming
    Ma, Jie
    Long, Guodong
    Zhang, Chengqi
    WEB AND BIG DATA, PT I, APWEB-WAIM 2022, 2023, 13421 : 237 - 250
  • [3] Robust Kernel Density Estimation with Median-of-Means principle
    Humbert, Pierre
    Le Bars, Batiste
    Minvielle, Ludovic
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [4] Efficient and Robust Median-of-Means Algorithms for Location and Regression
    Kogler, Alexander
    Traxler, Patrick
    PROCEEDINGS OF 2016 18TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC), 2016, : 206 - 213
  • [5] MONK - Outlier-Robust Mean Embedding Estimation by Median-of-Means
    Lerasle, Matthieu
    Szabo, Zoltan
    Mathieu, Timothee
    Lecue, Guillaume
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [6] Risk minimization by median-of-means tournaments
    Lugosi, Gabor
    Mendelson, Shahar
    JOURNAL OF THE EUROPEAN MATHEMATICAL SOCIETY, 2020, 22 (03) : 925 - 965
  • [7] Variance Reduced Median-of-Means Estimator for Byzantine-Robust Distributed Inference
    Tu, Jiyuan
    Liu, Weidong
    Mao, Xiaojun
    Chen, Xi
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [8] Variance reduced median-of-means estimator for byzantine-robust distributed inference
    Tu, Jiyuan
    Liu, Weidong
    Mao, Xiaojun
    Chen, Xi
    Journal of Machine Learning Research, 2021, 22
  • [9] Regularization, sparse recovery, and median-of-means tournaments
    Lugosi, Gabor
    Mendelson, Shahar
    BERNOULLI, 2019, 25 (03) : 2075 - 2106
  • [10] Median-of-means approach for repeated measures data
    Zhang, Yangchun
    Liu, Pengfei
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2021, 50 (17) : 3903 - 3912