Bayesian Kernel Two-Sample Testing

被引:7
|
作者
Zhang, Qinyi [1 ]
Wild, Veit [1 ]
Filippi, Sarah [2 ]
Flaxman, Seth [3 ]
Sejdinovic, Dino [1 ]
机构
[1] Univ Oxford, Dept Stat, Oxford, England
[2] Imperial Coll London, Dept Math, London, England
[3] Univ Oxford, Dept Comp Sci, Oxford, England
基金
英国工程与自然科学研究理事会;
关键词
Bayes factor: Hypothesis testing; Kernel mean embeddings;
D O I
10.1080/10618600.2022.2067547
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In modern data analysis, nonparametric measures of discrepancies between random variables are particularly important. The subject is well-studied in the frequentist literature, while the development in the Bayesian setting is limited where applications are often restricted to univariate cases. Here, we propose a Bayesian kernel two-sample testing procedure based on modeling the difference between kernel mean embeddings in the reproducing kernel Hilbert space using the framework established by Flaxman et al. The use of kernel methods enables its application to random variables in generic domains beyond the multivariate Euclidean spaces. The proposed procedure results in a posterior inference scheme that allows an automatic selection of the kernel parameters relevant to the problem at hand. In a series of synthetic experiments and two real data experiments (i.e., testing network heterogeneity from high-dimensional data and six-membered monocyclic ring conformation comparison), we illustrate the advantages of our approach. Supplementary materials for this article are available online.
引用
收藏
页码:1164 / 1176
页数:13
相关论文
共 50 条
  • [1] Two-sample Bayesian Nonparametric Hypothesis Testing
    Holmes, Chris C.
    Caron, Francois
    Griffin, Jim E.
    Stephens, David A.
    BAYESIAN ANALYSIS, 2015, 10 (02): : 297 - 320
  • [2] Bayesian multiple testing for two-sample multivariate endpoints
    Gönen, M
    Westfall, PH
    Johnson, WO
    BIOMETRICS, 2003, 59 (01) : 76 - 82
  • [3] A Kernel Two-Sample Test
    Gretton, Arthur
    Borgwardt, Karsten M.
    Rasch, Malte J.
    Schoelkopf, Bernhard
    Smola, Alexander
    JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 723 - 773
  • [4] A Bayesian Motivated Two-Sample Test Based on Kernel Density Estimates
    Merchant, Naveed
    Hart, Jeffrey D.
    ENTROPY, 2022, 24 (08)
  • [5] Generalized kernel two-sample tests
    Song, Hoseung
    Chen, Hao
    BIOMETRIKA, 2024, 111 (03) : 755 - 770
  • [6] A Simple Two-Sample Bayesian t-Test for Hypothesis Testing
    Wang, Min
    Liu, Guangying
    AMERICAN STATISTICIAN, 2016, 70 (02): : 195 - 201
  • [7] The Bayesian two-sample t test
    Gönen, M
    Johnson, WO
    Lu, YG
    Westfall, PH
    AMERICAN STATISTICIAN, 2005, 59 (03): : 252 - 257
  • [8] A Kernel Two-Sample Test for Functional Data
    Wynne, George
    Duncan, Andrew B.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [9] A Differentially Private Kernel Two-Sample Test
    Raj, Anant
    Law, Ho Chung Leon
    Sejdinovic, Dino
    Park, Mijung
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT I, 2020, 11906 : 697 - 724
  • [10] Boosting the power of kernel two-sample tests
    Chatterjee, A.
    Bhattacharya, B. B.
    BIOMETRIKA, 2024,