Large scale multi-output multi-class classification using Gaussian processes

被引:0
|
作者
Chunchao Ma
Mauricio A. Álvarez
机构
[1] University of Sheffield,Department of Computer Science
[2] University of Manchester,Department of Computer Science
来源
Machine Learning | 2023年 / 112卷
关键词
Gaussian processes; Multi-output Gaussian processes; Image data; Classification; Transfer learning;
D O I
暂无
中图分类号
学科分类号
摘要
Multi-output Gaussian processes (MOGPs) can help to improve predictive performance for some output variables, by leveraging the correlation with other output variables. In this paper, our main motivation is to use multiple-output Gaussian processes to exploit correlations between outputs where each output is a multi-class classification problem. MOGPs have been mostly used for multi-output regression. There are some existing works that use MOGPs for other types of outputs, e.g., multi-output binary classification. However, MOGPs for multi-class classification has been less studied. The reason is twofold: 1) when using a softmax function, it is not clear how to scale it beyond the case of a few outputs; 2) most common type of data in multi-class classification problems consists of image data, and MOGPs are not specifically designed to image data. We thus propose a new MOGPs model called Multi-output Gaussian Processes with Augment & Reduce (MOGPs-AR) that can deal with large scale classification and downsized image input data. Large scale classification is achieved by subsampling both training data sets and classes in each output whereas downsized image input data is handled by incorporating a convolutional kernel into the new model. We show empirically that our proposed model outperforms single-output Gaussian processes in terms of different performance metrics and multi-output Gaussian processes in terms of scalability, both in synthetic and in real classification problems. We include an example with the Ommiglot dataset where we showcase the properties of our model.
引用
收藏
页码:1077 / 1106
页数:29
相关论文
共 50 条
  • [31] Convolved Multi-output Gaussian Processes for Semi-Supervised Learning
    Vargas Cardona, Hernan Dario
    Alvarez, Mauricio A.
    Orozco, Alvaro A.
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2015, PT I, 2015, 9279 : 109 - 118
  • [32] Nonstationary multi-output Gaussian processes via harmonizable spectral mixtures
    Altamirano, Matias
    Tobar, Felipe
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [33] Non-linear process convolutions for multi-output Gaussian processes
    Alvarez, Mauricio A.
    Ward, Wil O. C.
    Guarnizo, Cristian
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [34] Near-Optimal Active Learning of Multi-Output Gaussian Processes
    Zhang, Yehong
    Trong Nghia Hoang
    Low, Kian Hsiang
    Kankanhalli, Mohan
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2351 - 2357
  • [35] Alpha divergence minimization in multi-class Gaussian process classification
    Villacampa-Calvo, Carlos
    Hernandez-Lobato, Daniel
    NEUROCOMPUTING, 2020, 378 : 210 - 227
  • [36] Hierarchical Learning for Large Multi-class Network Classification
    Liu, Lei
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2307 - 2312
  • [37] Binary Stochastic Representations for Large Multi-class Classification
    Gerald, Thomas
    Baskiotis, Nicolas
    Denoyer, Ludovic
    NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 : 155 - 165
  • [38] Steganographic domain classification using multi-class
    Xu Bo
    Wang Jiazhen
    Liu Xiaqin
    Yang Sumin
    ISTM/2007: 7TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-7, CONFERENCE PROCEEDINGS, 2007, : 1270 - 1273
  • [39] Detecting steganography using multi-class classification
    Rodriguez, Benjamin
    Peterson, Gilbert
    ADVANCES IN DIGITAL FORENSIC III, 2007, 242 : 193 - +
  • [40] Multi-class classification using a signomial function
    Hwang, Kyoungmi
    Lee, Kyungsik
    Lee, Chungmok
    Park, Sungsoo
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2015, 66 (03) : 434 - 449