Alpha divergence minimization in multi-class Gaussian process classification

被引:8
|
作者
Villacampa-Calvo, Carlos [1 ]
Hernandez-Lobato, Daniel [1 ]
机构
[1] Univ Autonoma Madrid, Comp Sci Dept, Escuela Politecn Super, C Francisco Tomas & Valiente 11, E-28049 Madrid, Spain
关键词
Gaussian processes; Expectation propagation; alpha-divergences; Approximate inference; Variational inference;
D O I
10.1016/j.neucom.2019.09.090
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper analyzes the minimization of alpha-divergences in the context of multi-class Gaussian process classification. For this task, several methods are explored, including memory and computationally efficient variants of the Power Expectation Propagation algorithm, which allow for efficient training using stochastic gradients and mini-batches. When these methods are used for training, very large datasets (several millions of instances) can be considered. The proposed methods are also very general as they can interpolate between other popular approaches for approximate inference based on Expectation Propagation (EP) (alpha -> 1) and Variational Bayes (VB) (alpha -> 0) simply by varying the alpha parameter. An exhaustive empirical evaluation analyzes the generalization properties of each of the proposed methods for different values of the alpha parameter. The results obtained show that one can do better than EP and VB by considering intermediate values of alpha. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:210 / 227
页数:18
相关论文
共 50 条
  • [21] Novel approach to multi-class classification
    Fang, Y
    Qi, FH
    JOURNAL OF INFRARED AND MILLIMETER WAVES, 2004, 23 (06) : 418 - 422
  • [22] An algebraic multi-class classification method
    He, Q
    Liu, ZY
    Shi, ZZ
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 3307 - 3312
  • [23] Credal ensembling in multi-class classification
    Nguyen, Vu-Linh
    Zhang, Haifei
    Destercke, Sebastien
    MACHINE LEARNING, 2025, 114 (01)
  • [24] Parzen windows for multi-class classification
    Pan, Zhi-Wei
    Xiang, Dao-Hong
    Xiao, Quan-Wu
    Zhou, Ding-Xuan
    JOURNAL OF COMPLEXITY, 2008, 24 (5-6) : 606 - 618
  • [25] On multi-class classification by way of niching
    McIntyre, AR
    Heywood, MI
    GENETIC AND EVOLUTIONARY COMPUTATION GECCO 2004 , PT 2, PROCEEDINGS, 2004, 3103 : 581 - 592
  • [26] Multi-class support vector machine based on the minimization of class variance
    Zhiqiang Zhang
    Zeqian Xu
    Junyan Tan
    Hui Zou
    Neural Processing Letters, 2021, 53 : 517 - 533
  • [27] Multi-class support vector machine based on the minimization of class variance
    Zhang, Zhiqiang
    Xu, Zeqian
    Tan, Junyan
    Zou, Hui
    NEURAL PROCESSING LETTERS, 2021, 53 (01) : 517 - 533
  • [28] Binary classification trees for multi-class classification problems
    Lee, JS
    Oh, LS
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 770 - 774
  • [29] Graph based multi-class semi-supervised learning using Gaussian process
    Song, Yangqiu
    Zhang, Changshui
    Lee, Jianguo
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2006, 4109 : 450 - 458
  • [30] Boosting with Adaptive Sampling for Multi-class Classification
    Chen, Jianhua
    2015 IEEE 14TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2015, : 667 - 672