On the Importance of Data Size in Probing Fine-tuned Models

被引:0
|
作者
Mehrafarin, Houman [1 ]
Rajaee, Sara [1 ]
Pilehvar, Mohammad Taher [2 ]
机构
[1] Iran Univ Sci & Technol, Tehran, Iran
[2] Khatam Univ, Tehran Inst Adv Studies, Tehran, Iran
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several studies have investigated the reasons behind the effectiveness of fine-tuning, usually through the lens of probing. However, these studies often neglect the role of the size of the dataset on which the model is fine-tuned. In this paper, we highlight the importance of this factor and its undeniable role in probing performance. We show that the extent of encoded linguistic knowledge depends on the number of fine-tuning samples. The analysis also reveals that larger training data mainly affects higher layers, and that the extent of this change is a factor of the number of iterations updating the model during fine-tuning rather than the diversity of the training samples. Finally, we show through a set of experiments that fine-tuning data size affects the recoverability of the changes made to the model's linguistic knowledge.(1)
引用
收藏
页码:228 / 238
页数:11
相关论文
共 50 条
  • [21] On the Generalization Abilities of Fine-Tuned Commonsense Language Representation Models
    Shen, Ke
    Kejriwal, Mayank
    ARTIFICIAL INTELLIGENCE XXXVIII, 2021, 13101 : 3 - 16
  • [22] THE JUGGERNAUT GETS FINE-TUNED
    KUSUNOKI, S
    ELECTRONICS, 1990, 63 (01): : 88 - 90
  • [23] Evil in the Fine-Tuned World
    Azadegan, Ebrahim
    HEYTHROP JOURNAL, 2019, 60 (05): : 795 - 804
  • [24] Fine-tuned nanoparticles synthesis for controlled pore size and condensation degree
    Hjelvik, Elizabeth
    Noureddine, Achraff
    Agola, Jacob
    Croissant, Jonas
    Brinker, C. Jeffrey
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 255
  • [25] Fine-tuned allosteric regulation of neuronal NMDA receptors with potential therapeutic importance
    Stanley, Nathaniel
    Sinitskiy, Anton
    Sellers, Benjamin
    Pande, Vijay
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 253
  • [26] Performance Assessment of Fine-Tuned Barrier Recognition Models in Varying Conditions
    Thoma, Marios
    Partaourides, Harris
    Sreedharan, Ieswaria
    Theodosiou, Zenonas
    Michael, Loizos
    Lanitis, Andreas
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2023, PT II, 2023, 14185 : 172 - 181
  • [27] Comparative Study of Model Optimization Techniques in Fine-Tuned CNN Models
    Poojary, Ramaprasad
    Pai, Akul
    2019 INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTING TECHNOLOGIES AND APPLICATIONS (ICECTA), 2019,
  • [28] LogFiT: Log Anomaly Detection Using Fine-Tuned Language Models
    Almodovar, Crispin
    Sabrina, Fariza
    Karimi, Sarvnaz
    Azad, Salahuddin
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (02): : 1715 - 1723
  • [29] Improving Fine-Tuned Question Answering Models for Electronic Health Records
    Mairittha, Tittaya
    Mairittha, Nattaya
    Inoue, Sozo
    UBICOMP/ISWC '20 ADJUNCT: PROCEEDINGS OF THE 2020 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2020 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2020, : 688 - 691
  • [30] Vision-based Human Detection by Fine-Tuned SSD Models
    Cheng, Tang Jin
    Ab Nasir, Ahmad Fakhri
    Razman, Mohd Azraai Mohd
    Majeed, Anwar P. P. Abdul
    Li Lim, Thai
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (11) : 386 - 390