Bayesian Kernel Two-Sample Testing

被引:7
|
作者
Zhang, Qinyi [1 ]
Wild, Veit [1 ]
Filippi, Sarah [2 ]
Flaxman, Seth [3 ]
Sejdinovic, Dino [1 ]
机构
[1] Univ Oxford, Dept Stat, Oxford, England
[2] Imperial Coll London, Dept Math, London, England
[3] Univ Oxford, Dept Comp Sci, Oxford, England
基金
英国工程与自然科学研究理事会;
关键词
Bayes factor: Hypothesis testing; Kernel mean embeddings;
D O I
10.1080/10618600.2022.2067547
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In modern data analysis, nonparametric measures of discrepancies between random variables are particularly important. The subject is well-studied in the frequentist literature, while the development in the Bayesian setting is limited where applications are often restricted to univariate cases. Here, we propose a Bayesian kernel two-sample testing procedure based on modeling the difference between kernel mean embeddings in the reproducing kernel Hilbert space using the framework established by Flaxman et al. The use of kernel methods enables its application to random variables in generic domains beyond the multivariate Euclidean spaces. The proposed procedure results in a posterior inference scheme that allows an automatic selection of the kernel parameters relevant to the problem at hand. In a series of synthetic experiments and two real data experiments (i.e., testing network heterogeneity from high-dimensional data and six-membered monocyclic ring conformation comparison), we illustrate the advantages of our approach. Supplementary materials for this article are available online.
引用
收藏
页码:1164 / 1176
页数:13
相关论文
共 50 条
  • [41] Two-Sample Testing for Event Impacts in Time Series
    Scharwaechter, Erik
    Mueller, Emmanuel
    PROCEEDINGS OF THE 2020 SIAM INTERNATIONAL CONFERENCE ON DATA MINING (SDM), 2020, : 10 - 18
  • [42] TWO-SAMPLE HYPOTHESIS TESTING FOR INHOMOGENEOUS RANDOM GRAPHS
    Ghoshdastidar, Debarghya
    Gutzeit, Maurilio
    Carpentier, Alexandra
    von Luxburg, Ulrike
    ANNALS OF STATISTICS, 2020, 48 (04): : 2208 - 2229
  • [43] Two-Sample Statistical Testing for Weighted Data Sets
    Bour, Petr
    SPSM 2017: STOCHASTIC AND PHYSICAL MONITORING SYSTEMS, 2017, : 1 - 9
  • [44] Meta Two-Sample Testing: Learning Kernels for Testing with Limited Data
    Liu, Feng
    Xu, Wenkai
    Lu, Jie
    Sutherland, Danica J.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [45] Scalable kernel two-sample tests via empirical likelihood and jackknife
    Wen, Qian
    Yuan, Mingao
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (12) : 5975 - 5990
  • [46] Network Traffic Fingerprinting Based on Approximated Kernel Two-Sample Test
    Kohout, Jan
    Pevny, Tomas
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2018, 13 (03) : 788 - 801
  • [47] Bayesian models for two-sample time-course microarray experiments
    Angelini, Claudia
    De Canditiis, Daniela
    Pensky, Marianna
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (05) : 1547 - 1565
  • [48] Two-sample t α -test for testing hypotheses in small-sample experiments
    Tan, Yuan-De
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2023, 19 (01): : 1 - 19
  • [49] Union–intersection permutation solution for two-sample equivalence testing
    Fortunato Pesarin
    Luigi Salmaso
    Eleonora Carrozzo
    Rosa Arboretti
    Statistics and Computing, 2016, 26 : 693 - 701
  • [50] Addressing maximization bias in reinforcement learning with two-sample testing
    Waltz, Martin
    Okhrin, Ostap
    ARTIFICIAL INTELLIGENCE, 2024, 336