An Analysis of the Interaction Between Transfer Learning Protocols in Deep Neural Networks

被引:1
|
作者
Plested, Jo [1 ]
Gedeon, Tom [1 ]
机构
[1] Australian Natl Univ, Res Sch Comp Sci, Canberra, ACT, Australia
关键词
Transfer learning; Convolutional neural networks;
D O I
10.1007/978-3-030-36708-4_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We extend work on the transferability of features in deep neural networks to explore the interaction between training hyperparameters, optimal number of layers to transfer and the size of a target dataset. We show that using the commonly adopted transfer learning protocols results in increased overfitting and significantly decreased accuracy compared to optimal protocols, particularly for very small target datasets. We demonstrate that there is a relationship between fine-tuning hyperparameters used and the optimal number of layers to transfer. Our research shows that if this relationship is not taken into account, the optimal number of layers to transfer to the target dataset will likely be estimated incorrectly. Best practice transfer learning protocols cannot be predicted from existing research that has analysed transfer learning under very specific conditions that are not universally applicable. Extrapolating transfer learning training settings from previous findings can in fact be counterintuitive, particularly in the case of smaller datasets. We present optimal transfer learning protocols for various target dataset sizes from very small to large when source and target datasets and tasks are similar. Our results show that using these settings results in a large increase in accuracy when compared to commonly used transfer learning protocols. These results are most significant with very small target datasets. We observed an increase in accuracy of 47.8% on our smallest dataset which comprised of only 10 training examples per class. These findings are important as they are likely to improve outcomes from past, current and future research in transfer learning. We expect that researchers will want to re-examine their experiments to incorporate our findings and to check the robustness of their existing results.
引用
收藏
页码:312 / 323
页数:12
相关论文
共 50 条
  • [41] Federated Learning for Medical Image Analysis with Deep Neural Networks
    Nazir, Sajid
    Kaleem, Mohammad
    DIAGNOSTICS, 2023, 13 (09)
  • [42] Generalization Analysis of Pairwise Learning for Ranking With Deep Neural Networks
    Huang, Shuo
    Zhou, Junyu
    Feng, Han
    Zhou, Ding-Xuan
    NEURAL COMPUTATION, 2023, 35 (06) : 1135 - 1158
  • [43] Online Deep Learning: Learning Deep Neural Networks on the Fly
    Sahoo, Doyen
    Pham, Quang
    Lu, Jing
    Hoi, Steven C. H.
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2660 - 2666
  • [44] Deep convolutional neural networks with transfer learning for automated brain image classification
    Kaur, Taranjit
    Gandhi, Tapan Kumar
    MACHINE VISION AND APPLICATIONS, 2020, 31 (03)
  • [45] Auto-compression transfer learning methodology for deep convolutional neural networks
    Camacho, J. D.
    Villasenor, Carlos
    Gomez-Avila, Javier
    Lopez-Franco, Carlos
    Arana-Daniel, Nancy
    NEUROCOMPUTING, 2025, 630
  • [46] Rice leaf diseases prediction using deep neural networks with transfer learning
    Krishnamoorthy, N.
    Prasad, L. V. Narasimha
    Kumar, C. S. Pavan
    Subedi, Bharat
    Abraha, Haftom Baraki
    Sathishkumar, V. E.
    ENVIRONMENTAL RESEARCH, 2021, 198
  • [47] Transfer Learning with Deep Recurrent Neural Networks for Remaining Useful Life Estimation
    Zhang, Ansi
    Wang, Honglei
    Li, Shaobo
    Cui, Yuxin
    Liu, Zhonghao
    Yang, Guanci
    Hu, Jianjun
    APPLIED SCIENCES-BASEL, 2018, 8 (12):
  • [48] Decision support from financial disclosures with deep neural networks and transfer learning
    Kraus, Mathias
    Feuerriegel, Stefan
    DECISION SUPPORT SYSTEMS, 2017, 104 : 38 - 48
  • [49] Diabetic Retinopathy Recognition and Classification Using Transfer Learning Deep Neural Networks
    Mane, Deepak
    Ashtagi, Rashmi
    Suryawanshi, Ranjeetsingh
    Kaulage, Anant N.
    Hedaoo, Anushka N.
    Kulkarni, Prathamesh V.
    Gandhi, Yatin
    TRAITEMENT DU SIGNAL, 2024, 41 (05) : 2683 - 2691
  • [50] Sparse coding of pathology slides compared to transfer learning with deep neural networks
    Will Fischer
    Sanketh S. Moudgalya
    Judith D. Cohn
    Nga T. T. Nguyen
    Garrett T. Kenyon
    BMC Bioinformatics, 19