An Analysis of the Interaction Between Transfer Learning Protocols in Deep Neural Networks

被引:1
|
作者
Plested, Jo [1 ]
Gedeon, Tom [1 ]
机构
[1] Australian Natl Univ, Res Sch Comp Sci, Canberra, ACT, Australia
关键词
Transfer learning; Convolutional neural networks;
D O I
10.1007/978-3-030-36708-4_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We extend work on the transferability of features in deep neural networks to explore the interaction between training hyperparameters, optimal number of layers to transfer and the size of a target dataset. We show that using the commonly adopted transfer learning protocols results in increased overfitting and significantly decreased accuracy compared to optimal protocols, particularly for very small target datasets. We demonstrate that there is a relationship between fine-tuning hyperparameters used and the optimal number of layers to transfer. Our research shows that if this relationship is not taken into account, the optimal number of layers to transfer to the target dataset will likely be estimated incorrectly. Best practice transfer learning protocols cannot be predicted from existing research that has analysed transfer learning under very specific conditions that are not universally applicable. Extrapolating transfer learning training settings from previous findings can in fact be counterintuitive, particularly in the case of smaller datasets. We present optimal transfer learning protocols for various target dataset sizes from very small to large when source and target datasets and tasks are similar. Our results show that using these settings results in a large increase in accuracy when compared to commonly used transfer learning protocols. These results are most significant with very small target datasets. We observed an increase in accuracy of 47.8% on our smallest dataset which comprised of only 10 training examples per class. These findings are important as they are likely to improve outcomes from past, current and future research in transfer learning. We expect that researchers will want to re-examine their experiments to incorporate our findings and to check the robustness of their existing results.
引用
收藏
页码:312 / 323
页数:12
相关论文
共 50 条
  • [31] Transfer Learning for Maritime Vessel Detection using Deep Neural Networks
    Farahnakian, Fahimeh
    Zelioli, Luca
    Heikkonen, Jukka
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 1 - 6
  • [32] Deep Neural Networks and Transfer Learning on a Multivariate Physiological Signal Dataset
    Bizzego, Andrea
    Gabrieli, Giulio
    Esposito, Gianluca
    BIOENGINEERING-BASEL, 2021, 8 (03):
  • [33] Music Genre Recognition using Deep Neural Networks and Transfer Learning
    Ghosal, Deepanway
    Kolekar, Maheshkumar H.
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2087 - 2091
  • [34] Transfer Learning for Arabic Named Entity Recognition With Deep Neural Networks
    Al-Smadi, Mohammad
    Al-Zboon, Saad
    Jararweh, Yaser
    Juola, Patrick
    IEEE ACCESS, 2020, 8 : 37736 - 37745
  • [35] A Transfer Approach Using Graph Neural Networks in Deep Reinforcement Learning
    Yang, Tianpei
    You, Heng
    Hao, Jianye
    Zheng, Yan
    Taylor, Matthew E.
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16352 - 16360
  • [36] Transfer Learning for Molecular Cancer Classification Using Deep Neural Networks
    Sevakula, Rahul K.
    Singh, Vikas
    Verma, Nishchal K.
    Kumar, Chandan
    Cui, Yan
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (06) : 2089 - 2100
  • [37] Power Control in massive MIMO Networks using Transfer Learning with Deep Neural Networks
    Ahmadi, Neda
    Mporas, Iosif
    Papazafeiropoulos, Anastasios
    Kourtessis, Pandelis
    Senior, John
    2022 IEEE 27TH INTERNATIONAL WORKSHOP ON COMPUTER AIDED MODELING AND DESIGN OF COMMUNICATION LINKS AND NETWORKS (CAMAD), 2022, : 89 - 93
  • [38] Transfer Entropy in Deep Neural Networks
    Andonie, R.
    Cataron, A.
    Moldovan, A.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2025, 20 (01)
  • [39] Convergence Analysis for Learning Orthonormal Deep Linear Neural Networks
    Qin, Zhen
    Tan, Xuwei
    Zhu, Zhihui
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 795 - 799
  • [40] Deep Learning with Convolutional Neural Networks for Histopathology Image Analysis
    Bosnacki, Dragan
    van Riel, Natal
    Veta, Mitko
    AUTOMATED REASONING FOR SYSTEMS BIOLOGY AND MEDICINE, 2019, 30 : 453 - 469