FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation

被引:0
|
作者
Turkoglu, Mehmet Ozgur [1 ]
Becker, Alexander [1 ]
Guenduez, Hueseyin Anil [2 ]
Rezaei, Mina [2 ]
Bischl, Bernd [2 ]
Daudt, Rodrigo Caye [1 ]
D'Aronco, Stefano [1 ]
Wegner, Jan Dirk [1 ,3 ]
机构
[1] Swiss Fed Inst Technol, Zurich, Switzerland
[2] Ludwig Maximilians Univ Munchen, Munich, Germany
[3] Univ Zurich, Zurich, Switzerland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ability to estimate epistemic uncertainty is often crucial when deploying machine learning in the real world, but modern methods often produce overconfident, uncalibrated uncertainty predictions. A common approach to quantify epistemic uncertainty, usable across a wide class of prediction models, is to train a model ensemble. In a naive implementation, the ensemble approach has high computational cost and high memory demand. This challenges in particular modern deep learning, where even a single deep network is already demanding in terms of compute and memory, and has given rise to a number of attempts to emulate the model ensemble without actually instantiating separate ensemble members. We introduce FiLM-Ensemble, a deep, implicit ensemble method based on the concept of Feature-wise Linear Modulation (FiLM). That technique was originally developed for multi-task learning, with the aim of decoupling different tasks. We show that the idea can be extended to uncertainty quantification: by modulating the network activations of a single deep network with FiLM, one obtains a model ensemble with high diversity, and consequently well-calibrated estimates of epistemic uncertainty, with low computational overhead in comparison. Empirically, FiLM-Ensemble outperforms other implicit ensemble methods, and it comes very close to the upper bound of an explicit ensemble of networks (sometimes even beating it), at a fraction of the memory cost.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] GNN-FiLM: Graph Neural Networks with Feature-wise Linear Modulation
    Brockschmidt, Marc
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [2] GNN-FiLM: Graph Neural Networks with Feature-wise Linear Modulation
    Brockschmidt, Marc
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [3] Temporal FiLM: Capturing Long-Range Sequence Dependencies with Feature-Wise Modulation
    Birnbaum, Sawyer
    Kuleshov, Volodymyr
    Enam, S. Zayd
    Koh, Pang Wei
    Ermon, Stefano
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [4] Communication-Efficient Split Learning via Adaptive Feature-Wise Compression
    Oh, Yongjeong
    Lee, Jaeho
    Brinton, Christopher G.
    Jeon, Yo-Seb
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025,
  • [5] Improved Birthweight Prediction With Feature-Wise Linear Modulation, GRU, and Attention Mechanism in Ultrasound Data
    Priya, G. Mohana
    Sangeetha, S. K. B.
    JOURNAL OF ULTRASOUND IN MEDICINE, 2025, 44 (04) : 711 - 725
  • [6] Efficient Deweather Mixture-of-Experts with Uncertainty-Aware Feature-Wise Linear Modulation
    Zhang, Rongyu
    Luo, Yulin
    Liu, Jiaming
    Yang, Huanrui
    Dong, Zhen
    Gudovskiy, Denis
    Okuno, Tomoyuki
    Nakata, Yohei
    Keutzer, Kurt
    Du, Yuan
    Zhang, Shanghang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16812 - 16820
  • [7] Feature-wise attention based boosting ensemble method for fraud detection
    Cao, Ruihao
    Wang, Junli
    Mao, Mingze
    Liu, Guanjun
    Jiang, Changjun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [8] IMPROVING DEEP LEARNING SOUND EVENTS CLASSIFIERS USING GRAM MATRIX FEATURE-WISE CORRELATIONS
    Neto, Antonio Joia
    Pacheco, Andre G. C.
    Luvizon, Diogo Carbonera
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3780 - 3784
  • [9] Large capacity generative image steganography via image style transfer and feature-wise deep fusion
    Youqiang Sun
    Jianyi Liu
    Ru Zhang
    Applied Intelligence, 2023, 53 : 28675 - 28693
  • [10] Large capacity generative image steganography via image style transfer and feature-wise deep fusion
    Sun, Youqiang
    Liu, Jianyi
    Zhang, Ru
    APPLIED INTELLIGENCE, 2023, 53 (23) : 28675 - 28693