Bayesian Kernel Two-Sample Testing

被引：7

作者：

Zhang, Qinyi ^{[1
]}

Wild, Veit ^{[1
]}

Filippi, Sarah ^{[2
]}

Flaxman, Seth ^{[3
]}

Sejdinovic, Dino ^{[1
]}

机构：

[1] Univ Oxford, Dept Stat, Oxford, England

[2] Imperial Coll London, Dept Math, London, England

[3] Univ Oxford, Dept Comp Sci, Oxford, England

来源：

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS | 2022年 / 31卷 / 04期

基金：

英国工程与自然科学研究理事会;

关键词：

Bayes factor: Hypothesis testing; Kernel mean embeddings;

D O I：

10.1080/10618600.2022.2067547

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

In modern data analysis, nonparametric measures of discrepancies between random variables are particularly important. The subject is well-studied in the frequentist literature, while the development in the Bayesian setting is limited where applications are often restricted to univariate cases. Here, we propose a Bayesian kernel two-sample testing procedure based on modeling the difference between kernel mean embeddings in the reproducing kernel Hilbert space using the framework established by Flaxman et al. The use of kernel methods enables its application to random variables in generic domains beyond the multivariate Euclidean spaces. The proposed procedure results in a posterior inference scheme that allows an automatic selection of the kernel parameters relevant to the problem at hand. In a series of synthetic experiments and two real data experiments (i.e., testing network heterogeneity from high-dimensional data and six-membered monocyclic ring conformation comparison), we illustrate the advantages of our approach. Supplementary materials for this article are available online.

引用

页码：1164 / 1176

页数：13

共 50 条

[21] Nonparametric Two-Sample Testing by Betting
Shekhar, Shubhanshu
Ramdas, Aaditya
IEEE TRANSACTIONS ON INFORMATION THEORY, 2024, 70 (02) : 1178 - 1203
[22] Two-Sample Test with Kernel Projected Wasserstein Distance
Wang, Jie
Gao, Rui
Xie, Yao
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
[23] A permutation-free kernel two-sample test
Shekhar, Shubhanshu
Kim, Ilmun
Ramdas, Aaditya
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[24] CLASSIFICATION ACCURACY AS A PROXY FOR TWO-SAMPLE TESTING
Kim, Ilmun
Ramdas, Aaditya
Singh, Aarti
Wasserman, Larry
ANNALS OF STATISTICS, 2021, 49 (01): : 411 - 434
[25] Two-sample testing with local community depth
Evans, Ciaran
Berenhaut, Kenneth S.
INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
[26] Practical Methods for Graph Two-Sample Testing
Ghoshdastidar, Debarghya
von Luxburg, Ulrike
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[27] Two-sample Testing Using Deep Learning
Kirchler, Matthias
Khorasani, Shahryar
Kloft, Marius
Lippert, Christoph
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 1387 - 1397
[28] On the use of random forest for two-sample testing
Hediger, Simon
Michel, Loris
Naef, Jeffrey
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2022, 170
[29] Sequential Predictive Two-Sample and Independence Testing
Podkopaev, Aleksandr
Ramdas, Aaditya
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[30] Testing Homogeneity in a Semiparametric Two-Sample Problem
Yukun Liu
Pengfei Li
Yuejiao Fu
JOURNAL OF PROBABILITY AND STATISTICS, 2012, 2012

← 1 2 3 4 5 →