An Analysis of the Interaction Between Transfer Learning Protocols in Deep Neural Networks

被引:1
|
作者
Plested, Jo [1 ]
Gedeon, Tom [1 ]
机构
[1] Australian Natl Univ, Res Sch Comp Sci, Canberra, ACT, Australia
关键词
Transfer learning; Convolutional neural networks;
D O I
10.1007/978-3-030-36708-4_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We extend work on the transferability of features in deep neural networks to explore the interaction between training hyperparameters, optimal number of layers to transfer and the size of a target dataset. We show that using the commonly adopted transfer learning protocols results in increased overfitting and significantly decreased accuracy compared to optimal protocols, particularly for very small target datasets. We demonstrate that there is a relationship between fine-tuning hyperparameters used and the optimal number of layers to transfer. Our research shows that if this relationship is not taken into account, the optimal number of layers to transfer to the target dataset will likely be estimated incorrectly. Best practice transfer learning protocols cannot be predicted from existing research that has analysed transfer learning under very specific conditions that are not universally applicable. Extrapolating transfer learning training settings from previous findings can in fact be counterintuitive, particularly in the case of smaller datasets. We present optimal transfer learning protocols for various target dataset sizes from very small to large when source and target datasets and tasks are similar. Our results show that using these settings results in a large increase in accuracy when compared to commonly used transfer learning protocols. These results are most significant with very small target datasets. We observed an increase in accuracy of 47.8% on our smallest dataset which comprised of only 10 training examples per class. These findings are important as they are likely to improve outcomes from past, current and future research in transfer learning. We expect that researchers will want to re-examine their experiments to incorporate our findings and to check the robustness of their existing results.
引用
收藏
页码:312 / 323
页数:12
相关论文
共 50 条
  • [1] Deep Convolutional Neural Networks with Transfer Learning for Visual Sentiment Analysis
    Devi, K. Usha Kingsly
    Gomathi, V
    NEURAL PROCESSING LETTERS, 2023, 55 (04) : 5087 - 5120
  • [2] Deep Convolutional Neural Networks with Transfer Learning for Visual Sentiment Analysis
    K. Usha Kingsly Devi
    V. Gomathi
    Neural Processing Letters, 2023, 55 : 5087 - 5120
  • [3] A Deep Learning Framework for Automated Transfer Learning of Neural Networks
    Balaiah, Thanasekhar
    Jeyadoss, Timothy Jones Thomas
    Thirumurugan, Sainee
    Ravi, Rahul Chander
    2019 11TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC 2019), 2019, : 428 - 432
  • [4] DSNNs:learning transfer from deep neural networks to spiking neural networks
    张磊
    Du Zidong
    Li Ling
    Chen Yunji
    HighTechnologyLetters, 2020, 26 (02) : 136 - 144
  • [5] DSNNs: learning transfer from deep neural networks to spiking neural networks
    Zhang L.
    Du Z.
    Li L.
    Chen Y.
    High Technology Letters, 2020, 26 (02): : 136 - 144
  • [6] Transfer Learning for Clinical Time Series Analysis Using Deep Neural Networks
    Gupta, Priyanka
    Malhotra, Pankaj
    Narwariya, Jyoti
    Vig, Lovekesh
    Shroff, Gautam
    JOURNAL OF HEALTHCARE INFORMATICS RESEARCH, 2020, 4 (02) : 112 - 137
  • [7] Transfer Learning for Clinical Time Series Analysis Using Deep Neural Networks
    Priyanka Gupta
    Pankaj Malhotra
    Jyoti Narwariya
    Lovekesh Vig
    Gautam Shroff
    Journal of Healthcare Informatics Research, 2020, 4 : 112 - 137
  • [8] Transfer Learning on Deep Neural Networks to Detect Pornography
    Albahli, Saleh
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 43 (02): : 701 - 717
  • [9] Deep representation-based transfer learning for deep neural networks
    Yang, Tao
    Yu, Xia
    Ma, Ning
    Zhang, Yifu
    Li, Hongru
    KNOWLEDGE-BASED SYSTEMS, 2022, 253
  • [10] Deep Learning Neural Networks and Bayesian Neural Networks in Data Analysis
    Chernoded, Andrey
    Dudko, Lev
    Myagkov, Igor
    Volkov, Petr
    XXIII INTERNATIONAL WORKSHOP HIGH ENERGY PHYSICS AND QUANTUM FIELD THEORY (QFTHEP 2017), 2017, 158