Bayesian Kernel Two-Sample Testing

被引:7
|
作者
Zhang, Qinyi [1 ]
Wild, Veit [1 ]
Filippi, Sarah [2 ]
Flaxman, Seth [3 ]
Sejdinovic, Dino [1 ]
机构
[1] Univ Oxford, Dept Stat, Oxford, England
[2] Imperial Coll London, Dept Math, London, England
[3] Univ Oxford, Dept Comp Sci, Oxford, England
基金
英国工程与自然科学研究理事会;
关键词
Bayes factor: Hypothesis testing; Kernel mean embeddings;
D O I
10.1080/10618600.2022.2067547
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In modern data analysis, nonparametric measures of discrepancies between random variables are particularly important. The subject is well-studied in the frequentist literature, while the development in the Bayesian setting is limited where applications are often restricted to univariate cases. Here, we propose a Bayesian kernel two-sample testing procedure based on modeling the difference between kernel mean embeddings in the reproducing kernel Hilbert space using the framework established by Flaxman et al. The use of kernel methods enables its application to random variables in generic domains beyond the multivariate Euclidean spaces. The proposed procedure results in a posterior inference scheme that allows an automatic selection of the kernel parameters relevant to the problem at hand. In a series of synthetic experiments and two real data experiments (i.e., testing network heterogeneity from high-dimensional data and six-membered monocyclic ring conformation comparison), we illustrate the advantages of our approach. Supplementary materials for this article are available online.
引用
收藏
页码:1164 / 1176
页数:13
相关论文
共 50 条
  • [21] Nonparametric Two-Sample Testing by Betting
    Shekhar, Shubhanshu
    Ramdas, Aaditya
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2024, 70 (02) : 1178 - 1203
  • [22] Two-Sample Test with Kernel Projected Wasserstein Distance
    Wang, Jie
    Gao, Rui
    Xie, Yao
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [23] A permutation-free kernel two-sample test
    Shekhar, Shubhanshu
    Kim, Ilmun
    Ramdas, Aaditya
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [24] CLASSIFICATION ACCURACY AS A PROXY FOR TWO-SAMPLE TESTING
    Kim, Ilmun
    Ramdas, Aaditya
    Singh, Aarti
    Wasserman, Larry
    ANNALS OF STATISTICS, 2021, 49 (01): : 411 - 434
  • [25] Two-sample testing with local community depth
    Evans, Ciaran
    Berenhaut, Kenneth S.
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
  • [26] Practical Methods for Graph Two-Sample Testing
    Ghoshdastidar, Debarghya
    von Luxburg, Ulrike
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [27] Two-sample Testing Using Deep Learning
    Kirchler, Matthias
    Khorasani, Shahryar
    Kloft, Marius
    Lippert, Christoph
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 1387 - 1397
  • [28] On the use of random forest for two-sample testing
    Hediger, Simon
    Michel, Loris
    Naef, Jeffrey
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2022, 170
  • [29] Sequential Predictive Two-Sample and Independence Testing
    Podkopaev, Aleksandr
    Ramdas, Aaditya
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [30] Testing Homogeneity in a Semiparametric Two-Sample Problem
    Yukun Liu
    Pengfei Li
    Yuejiao Fu
    JOURNAL OF PROBABILITY AND STATISTICS, 2012, 2012