FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation

被引:0
|
作者
Turkoglu, Mehmet Ozgur [1 ]
Becker, Alexander [1 ]
Guenduez, Hueseyin Anil [2 ]
Rezaei, Mina [2 ]
Bischl, Bernd [2 ]
Daudt, Rodrigo Caye [1 ]
D'Aronco, Stefano [1 ]
Wegner, Jan Dirk [1 ,3 ]
机构
[1] Swiss Fed Inst Technol, Zurich, Switzerland
[2] Ludwig Maximilians Univ Munchen, Munich, Germany
[3] Univ Zurich, Zurich, Switzerland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ability to estimate epistemic uncertainty is often crucial when deploying machine learning in the real world, but modern methods often produce overconfident, uncalibrated uncertainty predictions. A common approach to quantify epistemic uncertainty, usable across a wide class of prediction models, is to train a model ensemble. In a naive implementation, the ensemble approach has high computational cost and high memory demand. This challenges in particular modern deep learning, where even a single deep network is already demanding in terms of compute and memory, and has given rise to a number of attempts to emulate the model ensemble without actually instantiating separate ensemble members. We introduce FiLM-Ensemble, a deep, implicit ensemble method based on the concept of Feature-wise Linear Modulation (FiLM). That technique was originally developed for multi-task learning, with the aim of decoupling different tasks. We show that the idea can be extended to uncertainty quantification: by modulating the network activations of a single deep network with FiLM, one obtains a model ensemble with high diversity, and consequently well-calibrated estimates of epistemic uncertainty, with low computational overhead in comparison. Empirically, FiLM-Ensemble outperforms other implicit ensemble methods, and it comes very close to the upper bound of an explicit ensemble of networks (sometimes even beating it), at a fraction of the memory cost.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Real-Time Style Modelling of Human Locomotion via Feature-Wise Transformations and Local Motion Phases
    Mason, Ian
    Starke, Sebastian
    Komura, Taku
    PROCEEDINGS OF THE ACM ON COMPUTER GRAPHICS AND INTERACTIVE TECHNIQUES, 2022, 5 (01)
  • [22] An Ensemble of Deep Learning Architectures for Automatic Feature Extraction
    Shaheen, Fatma
    Verma, Brijesh
    PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [23] Clustering Ensemble via Cluster-wise Optimization Graph Learning
    Zhang, Huan
    Du, Liang
    IEEE INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN SYSTEMS SCIENCE AND ENGINEERING (IEEE RASSE 2021), 2021,
  • [24] LEARNING IMAGE SIMILARITIES VIA PROBABILISTIC FEATURE MATCHING
    Zhang, Ziming
    Li, Ze-Nian
    Drew, Mark S.
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 1857 - 1860
  • [25] Learning region-wise deep feature representation for image analysis
    Zhu X.
    Wang Q.
    Li P.
    Zhang X.-Y.
    Wang L.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (11) : 14775 - 14784
  • [26] Boosting Verification of Deep Reinforcement Learning via Piece-wise Linear Decision Neural Networks
    Tian, Jiaxu
    Zhi, Dapeng
    Liu, Si
    Wang, Peixin
    Chen, Cheng
    Zhang, Min
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [27] Integration of deep feature extraction and ensemble learning for outlier detection
    Chakraborty, Debasrita
    Narayanan, Vaasudev
    Ghosh, Ashish
    PATTERN RECOGNITION, 2019, 89 : 161 - 171
  • [28] Adversarial defence by learning differentiated feature representation in deep ensemble
    Chen, Xi
    Wei, Huang
    Guo, Wei
    Zhang, Fan
    Du, Jiayu
    Zhou, Zhizhong
    MACHINE VISION AND APPLICATIONS, 2024, 35 (04)
  • [29] Joint Structural Learning to Rank with Deep Linear Feature Learning
    Zhao, Xueyi
    Li, Xi
    Zhang, Zhongfei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (10) : 2756 - 2769
  • [30] Deep ensemble learning based probabilistic load forecasting in smart grids
    Yang, Yandong
    Hong, Weijun
    Li, Shufang
    ENERGY, 2019, 189