LEARNING GAUSSIAN PROCESSES WITH BAYESIAN POSTERIOR OPTIMIZATION

被引:0
|
作者
Chamon, Luiz F. O. [1 ]
Patemain, Santiago [1 ]
Ribeiro, Alejandro [1 ]
机构
[1] Univ Penn, Elect & Syst Engn, Philadelphia, PA 19104 USA
关键词
D O I
10.1109/ieeeconf44664.2019.9048819
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gaussian processes (GPs) are often used as prior distributions in non-parametric Bayesian methods due to their numerical and analytical tractability. GP priors are specified by choosing a covariance function (along with its hyperparameters), a choice that is not only challenging in practice, but also has a profound impact on performance. This issue is typically overcome using hierarchical models, i.e., by learning a distribution over covariance functions/hyperparameters that defines a mixture of GPs. Yet, since choosing priors for hyperparameters can be challenging, maximum likelihood is often used instead to obtain point estimates. This approach, however, involves solving a non-convex optimization problem and is thus prone to overfitting. To address these issues, this work proposes a hybrid Bayesian-optimization solution in which the hyperparameters posterior distribution is obtained not using Bayes rule, but as the solution of a mathematical program. Explicitly, we obtain the hyperparameter distribution that minimizes a risk measure induced by the GP mixture. Previous knowledge, including properties such as sparsity and maximum entropy, is incorporated through (possibly non-convex) penalties instead of a prior. We prove that despite its infinite dimensionality and potential non-convexity, this problem can be solved exactly using duality and stochastic optimization.
引用
收藏
页码:482 / 486
页数:5
相关论文
共 50 条
  • [41] Using Gaussian Processes in Bayesian Robot Programming
    Aznar, Fidel
    Pujol, Francisco A.
    Pujol, Mar
    Rizo, Ramon
    DISTRIBUTED COMPUTING, ARTIFICIAL INTELLIGENCE, BIOINFORMATICS, SOFT COMPUTING, AND AMBIENT ASSISTED LIVING, PT II, PROCEEDINGS, 2009, 5518 : 547 - +
  • [42] Scalable Nonparametric Bayesian Inference on Point Processes with Gaussian Processes
    Samo, Yves-Laurent Kom
    Roberts, Stephen
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 2227 - 2236
  • [43] BAYESIAN DECONVOLUTION OF BERNOULLI-GAUSSIAN PROCESSES
    LAVIELLE, M
    SIGNAL PROCESSING, 1993, 33 (01) : 67 - 79
  • [44] Bayesian Gaussian Processes for Identifying the Deteriorating Patient
    Colopy, Glen Wright
    Pimentel, Marco A. F.
    Roberts, Stephen J.
    Clifton, David A.
    2016 38TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2016, : 5311 - 5314
  • [45] Robustness Guarantees for Bayesian Inference with Gaussian Processes
    Cardelli, Luca
    Kwiatkowska, Marta
    Laurenti, Luca
    Patane, Andrea
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 7759 - 7768
  • [46] Bayesian Hyperparameter Estimation using Gaussian Process and Bayesian Optimization
    Katakami, Shun
    Sakamoto, Hirotaka
    Okada, Masato
    JOURNAL OF THE PHYSICAL SOCIETY OF JAPAN, 2019, 88 (07)
  • [47] Optimization of nonlinear, non-Gaussian Bayesian filtering for diagnosis and prognosis of monotonic degradation processes
    Corbetta, Matteo
    Sbarufatti, Claudio
    Giglio, Marco
    Todd, Michael D.
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2018, 104 : 305 - 322
  • [48] Bayesian Active Learning for Posterior Estimation
    Kandasamy, Kirthevasan
    Schneider, Jeff
    Poczos, Barnabas
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 3605 - 3611
  • [49] Physics makes the difference: Bayesian optimization and active learning via augmented Gaussian process
    Ziatdinov, Maxim A.
    Ghosh, Ayana
    Kalinin, Sergei, V
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (01):
  • [50] Learning curves for Gaussian processes
    Sollich, P
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11, 1999, 11 : 344 - 350