An Analysis of the Interaction Between Transfer Learning Protocols in Deep Neural Networks

被引：1

作者：

Plested, Jo ^{[1
]}

Gedeon, Tom ^{[1
]}

机构：

[1] Australian Natl Univ, Res Sch Comp Sci, Canberra, ACT, Australia

来源：

NEURAL INFORMATION PROCESSING (ICONIP 2019), PT I | 2019年 / 11953卷

关键词：

Transfer learning; Convolutional neural networks;

D O I：

10.1007/978-3-030-36708-4_26

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We extend work on the transferability of features in deep neural networks to explore the interaction between training hyperparameters, optimal number of layers to transfer and the size of a target dataset. We show that using the commonly adopted transfer learning protocols results in increased overfitting and significantly decreased accuracy compared to optimal protocols, particularly for very small target datasets. We demonstrate that there is a relationship between fine-tuning hyperparameters used and the optimal number of layers to transfer. Our research shows that if this relationship is not taken into account, the optimal number of layers to transfer to the target dataset will likely be estimated incorrectly. Best practice transfer learning protocols cannot be predicted from existing research that has analysed transfer learning under very specific conditions that are not universally applicable. Extrapolating transfer learning training settings from previous findings can in fact be counterintuitive, particularly in the case of smaller datasets. We present optimal transfer learning protocols for various target dataset sizes from very small to large when source and target datasets and tasks are similar. Our results show that using these settings results in a large increase in accuracy when compared to commonly used transfer learning protocols. These results are most significant with very small target datasets. We observed an increase in accuracy of 47.8% on our smallest dataset which comprised of only 10 training examples per class. These findings are important as they are likely to improve outcomes from past, current and future research in transfer learning. We expect that researchers will want to re-examine their experiments to incorporate our findings and to check the robustness of their existing results.

引用

页码：312 / 323

页数：12

共 50 条

[41] Federated Learning for Medical Image Analysis with Deep Neural Networks
Nazir, Sajid
Kaleem, Mohammad
DIAGNOSTICS, 2023, 13 (09)
[42] Generalization Analysis of Pairwise Learning for Ranking With Deep Neural Networks
Huang, Shuo
Zhou, Junyu
Feng, Han
Zhou, Ding-Xuan
NEURAL COMPUTATION, 2023, 35 (06) : 1135 - 1158
[43] Online Deep Learning: Learning Deep Neural Networks on the Fly
Sahoo, Doyen
Pham, Quang
Lu, Jing
Hoi, Steven C. H.
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2660 - 2666
[44] Deep convolutional neural networks with transfer learning for automated brain image classification
Kaur, Taranjit
Gandhi, Tapan Kumar
MACHINE VISION AND APPLICATIONS, 2020, 31 (03)
[45] Auto-compression transfer learning methodology for deep convolutional neural networks
Camacho, J. D.
Villasenor, Carlos
Gomez-Avila, Javier
Lopez-Franco, Carlos
Arana-Daniel, Nancy
NEUROCOMPUTING, 2025, 630
[46] Rice leaf diseases prediction using deep neural networks with transfer learning
Krishnamoorthy, N.
Prasad, L. V. Narasimha
Kumar, C. S. Pavan
Subedi, Bharat
Abraha, Haftom Baraki
Sathishkumar, V. E.
ENVIRONMENTAL RESEARCH, 2021, 198
[47] Transfer Learning with Deep Recurrent Neural Networks for Remaining Useful Life Estimation
Zhang, Ansi
Wang, Honglei
Li, Shaobo
Cui, Yuxin
Liu, Zhonghao
Yang, Guanci
Hu, Jianjun
APPLIED SCIENCES-BASEL, 2018, 8 (12):
[48] Decision support from financial disclosures with deep neural networks and transfer learning
Kraus, Mathias
Feuerriegel, Stefan
DECISION SUPPORT SYSTEMS, 2017, 104 : 38 - 48
[49] Diabetic Retinopathy Recognition and Classification Using Transfer Learning Deep Neural Networks
Mane, Deepak
Ashtagi, Rashmi
Suryawanshi, Ranjeetsingh
Kaulage, Anant N.
Hedaoo, Anushka N.
Kulkarni, Prathamesh V.
Gandhi, Yatin
TRAITEMENT DU SIGNAL, 2024, 41 (05) : 2683 - 2691
[50] Sparse coding of pathology slides compared to transfer learning with deep neural networks
Will Fischer
Sanketh S. Moudgalya
Judith D. Cohn
Nga T. T. Nguyen
Garrett T. Kenyon
BMC Bioinformatics, 19

← 1 2 3 4 5 →