FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation

被引:0
|
作者
Turkoglu, Mehmet Ozgur [1 ]
Becker, Alexander [1 ]
Guenduez, Hueseyin Anil [2 ]
Rezaei, Mina [2 ]
Bischl, Bernd [2 ]
Daudt, Rodrigo Caye [1 ]
D'Aronco, Stefano [1 ]
Wegner, Jan Dirk [1 ,3 ]
机构
[1] Swiss Fed Inst Technol, Zurich, Switzerland
[2] Ludwig Maximilians Univ Munchen, Munich, Germany
[3] Univ Zurich, Zurich, Switzerland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ability to estimate epistemic uncertainty is often crucial when deploying machine learning in the real world, but modern methods often produce overconfident, uncalibrated uncertainty predictions. A common approach to quantify epistemic uncertainty, usable across a wide class of prediction models, is to train a model ensemble. In a naive implementation, the ensemble approach has high computational cost and high memory demand. This challenges in particular modern deep learning, where even a single deep network is already demanding in terms of compute and memory, and has given rise to a number of attempts to emulate the model ensemble without actually instantiating separate ensemble members. We introduce FiLM-Ensemble, a deep, implicit ensemble method based on the concept of Feature-wise Linear Modulation (FiLM). That technique was originally developed for multi-task learning, with the aim of decoupling different tasks. We show that the idea can be extended to uncertainty quantification: by modulating the network activations of a single deep network with FiLM, one obtains a model ensemble with high diversity, and consequently well-calibrated estimates of epistemic uncertainty, with low computational overhead in comparison. Empirically, FiLM-Ensemble outperforms other implicit ensemble methods, and it comes very close to the upper bound of an explicit ensemble of networks (sometimes even beating it), at a fraction of the memory cost.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Introducing Technical Indicators to Electricity Price Forecasting: A Feature Engineering Study for Linear, Ensemble, and Deep Machine Learning Models
    Demir, Sumeyra
    Mincev, Krystof
    Kok, Koen
    Paterakis, Nikolaos G.
    APPLIED SCIENCES-BASEL, 2020, 10 (01):
  • [42] Understanding Deep Contrastive Learning via Coordinate-wise Optimization
    Tian, Yuandong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [43] Personalized Federated Learning with Layer-Wise Feature Transformation via Meta-Learning
    Tu, Jingke
    Huang, Jiaming
    Yang, Lei
    Lin, Wanyu
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (04)
  • [44] Unsupervised feature learning and automatic modulation classification using deep learning model
    Ali, Afan
    Fan Yangyu
    PHYSICAL COMMUNICATION, 2017, 25 : 75 - 84
  • [45] Efficient Deep Feature Learning and Extraction via StochasticNets
    Shafiee, Mohammad Javad
    Siva, Parthipan
    Fieguth, Paul
    Wong, Alexander
    PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 1101 - 1109
  • [46] Early Fault Detection via Multiple Feature Fusion and Ensemble Learning
    Song, Wenbin
    Wu, Di
    Shen, Weiming
    Boulet, Benoit
    IEEE SENSORS JOURNAL, 2024, 24 (05) : 7196 - 7204
  • [47] Feature Extraction for Hyperspectral Imagery via Ensemble Localized Manifold Learning
    Li, Fan
    Xu, Linlin
    Wong, Alexander
    Clausi, David A.
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2015, 12 (12) : 2486 - 2490
  • [48] Characterization of Residential Electricity Customers via Deep Ensemble Learning
    Lin, Weixuan
    Wu, Di
    ARTIFICIAL INTELLIGENCE FOR KNOWLEDGE MANAGEMENT, ENERGY, AND SUSTAINABILITY, 2022, 637 : 75 - 86
  • [49] Probabilistic medical image imputation via deep adversarial learning
    Ragheb Raad
    Dhruv Patel
    Chiao-Chih Hsu
    Vijay Kothapalli
    Deep Ray
    Bino Varghese
    Darryl Hwang
    Inderbir Gill
    Vinay Duddalwar
    Assad A. Oberai
    Engineering with Computers, 2022, 38 : 3975 - 3986
  • [50] Active Learning for Deep Object Detection via Probabilistic Modeling
    Choi, Jiwoong
    Elezi, Ismail
    Lee, Hyuk-Jae
    Farabet, Clement
    Alvarez, Jose M.
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10244 - 10253