Projective Integral Updates for High-Dimensional Variational Inference

被引:0
|
作者
Duersch, Jed A. [1 ]
机构
[1] Sandia Natl Labs, Livermore, CA 94550 USA
来源
关键词
Key words. variational inference; Gaussian mean-field; Hessian approximation; quasi-Newton; spike-and-slab; quadrature; cubature; Hadamard basis; CUBATURE; QUADRATURE;
D O I
10.1137/22M1529919
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Variational inference is an approximation framework for Bayesian inference that seeks to improve quantified uncertainty in predictions by optimizing a simplified distribution over parameters to stand in for the full posterior. Capturing model variations that remain consistent with training data enables more robust predictions by reducing parameter sensitivity. This work introduces a fixedpoint optimization for variational inference that is applicable when every feasible log density can be expressed as a linear combination of functions from a given basis. In such cases, the optimizer becomes a fixed-point of projective integral updates. When the basis spans univariate quadratics in each parameter, the feasible distributions are Gaussian mean-fields and the projective integral updates yield quasi-Newton variational Bayes (QNVB). Other bases and updates are also possible. Since these updates require high-dimensional integration, this work begins by proposing an efficient quasirandom sequence of quadratures for mean-field distributions. Each iterate of the sequence contains two evaluation points that combine to correctly integrate all univariate quadratic functions and, if the mean-field factors are symmetric, all univariate cubics. More importantly, averaging results over short subsequences achieves periodic exactness on a much larger space of multivariate polynomials of quadratic total degree. The corresponding variational updates require four loss evaluations with standard (not second-order) backpropagation to eliminate error terms from over half of all multivariate quadratic basis functions. This integration technique is motivated by first proposing stochastic blocked mean-field quadratures, which may be useful in other contexts. A PyTorch implementation of QNVB allows for better control over model uncertainty during training than competing methods. Experiments demonstrate superior generalizability for multiple learning problems and architectures.
引用
收藏
页码:69 / 100
页数:32
相关论文
共 50 条
  • [11] Stratified Stochastic Variational Inference for High-Dimensional Network Factor Model
    Aliverti, Emanule
    Russo, Massimiliano
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2022, 31 (02) : 502 - 511
  • [12] Correction to : Variational inference and sparsity in high-dimensional deep Gaussian mixture models
    Lucas Kock
    Nadja Klein
    David J.Nott
    Statistics and Computing, 2023, 33
  • [13] On inference in high-dimensional regression
    Battey, Heather S.
    Reid, Nancy
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2023, 85 (01) : 149 - 175
  • [14] High-dimensional kNN joins with incremental updates
    Cui Yu
    Rui Zhang
    Yaochun Huang
    Hui Xiong
    GeoInformatica, 2010, 14 : 55 - 82
  • [15] High-dimensional kNN joins with incremental updates
    Yu, Cui
    Zhang, Rui
    Huang, Yaochun
    Xiong, Hui
    GEOINFORMATICA, 2010, 14 (01) : 55 - 82
  • [16] Inference in High-Dimensional Parameter Space
    O'Hare, Anthony
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2015, 22 (11) : 997 - 1004
  • [17] High-dimensional simultaneous inference with the bootstrap
    Dezeure, Ruben
    Buhlmann, Peter
    Zhang, Cun-Hui
    TEST, 2017, 26 (04) : 685 - 719
  • [18] ASYMPTOTIC INFERENCE FOR HIGH-DIMENSIONAL DATA
    Kuelbs, Jim
    Vidyashankar, Anand N.
    ANNALS OF STATISTICS, 2010, 38 (02): : 836 - 869
  • [19] High-dimensional Simultaneous Inference of Quantiles
    Lou, Zhipeng
    Wu, Wei Biao
    SANKHYA-SERIES A-MATHEMATICAL STATISTICS AND PROBABILITY, 2025,
  • [20] High-dimensional simultaneous inference with the bootstrap
    Ruben Dezeure
    Peter Bühlmann
    Cun-Hui Zhang
    TEST, 2017, 26 : 685 - 719