Challenges and Opportunities in High-dimensional Variational Inference

被引:0
|
作者
Dhaka, Akash Kumar [1 ,2 ]
Catalina, Alejandro [1 ]
Welandawe, Manushi [3 ]
Andersen, Michael Riis [4 ]
Huggins, Jonathan H. [3 ]
Vehtari, Aki [1 ]
机构
[1] Aalto Univ, Espoo, Finland
[2] Silo AI, Helsinki, Finland
[3] Boston Univ, Boston, MA 02215 USA
[4] Tech Univ Denmark, Lyngby, Denmark
基金
芬兰科学院;
关键词
APPROXIMATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current black-box variational inference (BBVI) methods require the user to make numerous design choices-such as the selection of variational objective and approximating family-yet there is little principled guidance on how to do so. We develop a conceptual framework and set of experimental tools to understand the effects of these choices, which we leverage to propose best practices for maximizing posterior approximation accuracy. Our approach is based on studying the pre-asymptotic tail behavior of the density ratios between the joint distribution and the variational approximation, then exploiting insights and tools from the importance sampling literature. Our framework and supporting experiments help to distinguish between the behavior of BBVI methods for approximating low-dimensional versus moderate-to-high-dimensional posteriors. In the latter case, we show that mass-covering variational objectives are difficult to optimize and do not improve accuracy, but flexible variational families can improve accuracy and the effectiveness of importance sampling-at the cost of additional optimization challenges. Therefore, for moderate-to-high-dimensional posteriors we recommend using the (mode-seeking) exclusive KL divergence since it is the easiest to optimize, and improving the variational family or using model parameter transformations to make the posterior and optimal variational approximation more similar. On the other hand, in low-dimensional settings, we show that heavy-tailed variational families and mass-covering divergences are effective and can increase the chances that the approximation can be improved by importance sampling.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] High-dimensional rank-based inference
    Kong, Xiaoli
    Harrar, Solomon W.
    JOURNAL OF NONPARAMETRIC STATISTICS, 2020, 32 (02) : 294 - 322
  • [42] On statistical inference with high-dimensional sparse CCA
    Laha, Nilanjana
    Huey, Nathan
    Coull, Brent
    Mukherjee, Rajarshi
    INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2023, 12 (04)
  • [43] Inference for high-dimensional instrumental variables regression
    Gold, David
    Lederer, Johannes
    Tao, Jing
    JOURNAL OF ECONOMETRICS, 2020, 217 (01) : 79 - 111
  • [44] Lasso inference for high-dimensional time series
    Adamek, Robert
    Smeekes, Stephan
    Wilms, Ines
    JOURNAL OF ECONOMETRICS, 2023, 235 (02) : 1114 - 1143
  • [45] Universal Features for High-Dimensional Learning and Inference
    Huang, Shao-Lun
    Makur, Anuran
    Wornell, Gregory W.
    Zheng, Lizhong
    FOUNDATIONS AND TRENDS IN COMMUNICATIONS AND INFORMATION THEORY, 2024, 21 (1-2): : 1 - 299
  • [46] Simultaneous inference for high-dimensional time series
    Shumway, RH
    DIMENSION REDUCTION, COMPUTATIONAL COMPLEXITY AND INFORMATION, 1998, 30 : 110 - 110
  • [47] Markov Neighborhood Regression for High-Dimensional Inference
    Liang, Faming
    Xue, Jingnan
    Jia, Bochao
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (539) : 1200 - 1214
  • [48] Inference in High-Dimensional Online Changepoint Detection
    Chen, Yudong
    Wang, Tengyao
    Samworth, Richard J.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (546) : 1461 - 1472
  • [49] High-dimensional IV cointegration estimation and inference☆
    Phillips, Peter C. B.
    Kheifets, Igor L.
    JOURNAL OF ECONOMETRICS, 2024, 238 (02)
  • [50] Rejoinder on: High-dimensional simultaneous inference with the bootstrap
    Ruben Dezeure
    Peter Bühlmann
    Cun-Hui Zhang
    TEST, 2017, 26 : 751 - 758