MONK - Outlier-Robust Mean Embedding Estimation by Median-of-Means

被引:0
|
作者
Lerasle, Matthieu [1 ,2 ]
Szabo, Zoltan [3 ]
Mathieu, Timothee [1 ]
Lecue, Guillaume [4 ]
机构
[1] Univ Paris Sud, Lab Math Orsay, Paris, France
[2] Univ Paris Saclay, CNRS, Paris, France
[3] Ecole Polytech, CMAP, Palaiseau, France
[4] CREST ENSAE ParisTech, Paris, France
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97 | 2019年 / 97卷
关键词
KERNELS; METRICS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mean embeddings provide an extremely flexible and powerful tool in machine learning and statistics to represent probability distributions and define a semi-metric (MMD, maximum mean discrepancy; also called N-distance or energy distance), with numerous successful applications. The representation is constructed as the expectation of the feature map defined by a kernel. As a mean, its classical empirical estimator, however, can be arbitrary severely affected even by a single outlier in case of unbounded features. To the best of our knowledge, unfortunately even the consistency of the existing few techniques trying to alleviate this serious sensitivity bottleneck is unknown. In this paper, we show how the recently emerged principle of median-of-means can be used to design estimators for kernel mean embedding and MMD with excessive resistance properties to outliers, and optimal sub-Gaussian deviation bounds under mild assumptions.
引用
收藏
页数:12
相关论文
共 41 条
  • [31] Enhancing microseismic/acoustic emission source localization accuracy with an outlier-robust kernel density estimation approach
    Chen, Jie
    Huang, Huiqiong
    Rui, Yichao
    Pu, Yuanyuan
    Zhang, Sheng
    Li, Zheng
    Wang, Wenzhong
    INTERNATIONAL JOURNAL OF MINING SCIENCE AND TECHNOLOGY, 2024, 34 (07) : 943 - 956
  • [32] Outlier-Robust Truncated Maximum Likelihood Parameter Estimation of Compound-Gaussian Clutter with Inverse Gaussian Texture
    Tian, Chao
    Shui, Peng-Lang
    REMOTE SENSING, 2022, 14 (16)
  • [33] Outlier-robust tri-percentile parameter estimation of compound-Gaussian clutter with lognormal distributed texture
    Feng, Tian
    Shui, Peng-Lang
    DIGITAL SIGNAL PROCESSING, 2022, 120
  • [34] Quantum Entropy Scoring for Fast Robust Mean Estimation and Improved Outlier Detection
    Dong, Yihe
    Hopkins, Samuel B.
    Li, Jerry
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [35] Outlier-robust parameters estimation for compound-Gaussian Clutter using inverse gamma texture based on truncated moments
    Xu, Shuwen
    Wang, Le
    Zhang, Fuquan
    Shui, Penglang
    REMOTE SENSING LETTERS, 2019, 10 (03) : 274 - 282
  • [36] Iterative Maximum Likelihood and Outlier-robust Bipercentile Estimation of Parameters of Compound-Gaussian Clutter With Inverse Gaussian Texture
    Shui, Peng-Lang
    Shi, Li-Xiang
    Yu, Han
    Huang, Yu-Ting
    IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (11) : 1572 - 1576
  • [37] Robust Mean Estimation in High Dimensions: An Outlier-Fraction Agnostic and Efficient Algorithm
    Deshmukh, Aditya
    Liu, Jing
    Veeravalli, Venugopal V.
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2023, 69 (07) : 4675 - 4690
  • [38] Outlier-robust Tri-percentile Parameter Estimation Method of Compound-Gaussian Clutter with Inverse Gaussian Textures br
    Shui, Penglang
    Tian, Chao
    Feng, Tian
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (02) : 542 - 549
  • [39] GRNN-Based Outlier-Robust Parameter Estimation of Compound-Gaussian Sea Clutter With Generalized Inverse Gaussian Textures
    Zou, Peng-Jia
    Zhao, Zi-Jian
    He, Zhen-Ge
    Shui, Peng-Lang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [40] Outlier-robust estimation of a sparse linear mode using l1-penalized Huber's M-estimator
    Dalalyan, Arnak S.
    Thompson, Philip
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32