Lessons from the NLBSE 2024 Competition: Towards Building Efficient Models for GitHub Issue Classification

被引:0
|
作者
Gomez-Barrera, Daniel Fernando [1 ]
Becerra, Luccas Rojas [1 ]
Roncancio, Juan Pinzon [1 ]
Almanza, David Ortiz [1 ]
Arboleda, Juan [1 ]
Linares-Vasquez, Mario [1 ]
Manrique, Ruben Francisco [1 ]
机构
[1] Univ Los Andes, Bogota, Colombia
来源
PROCEEDINGS 2024 ACM/IEEE INTERNATIONAL WORKSHOP ON NL-BASED SOFTWARE ENGINEERING, NLBSE 2024 | 2024年
关键词
GitHub Issue Classification; Embedding; Few-shot learning; NLBSE; 2024; Competition;
D O I
10.1145/3643787.3648040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents the findings of our team's efforts during the "NLBSE 2024" competition, which centered on the multi-class classification of GitHub Issues. The challenge required models with strong few-shot learning capabilities to distinguish between 300 issues from five different repositories. Our primary strategy involved improving embeddings by developing the Classification Few Fit Sentence Transformer (CFFitST), a strategy that fine-tunes embeddings from a base sentence transformer to suit the dataset. We also explored various hypotheses concerning the optimal combination of information input and classification models. As a result, we managed to achieve an average improvement of 2.44% over the SetFit baseline.
引用
收藏
页码:45 / 48
页数:4
相关论文
共 50 条
  • [1] GitHub Issue Classification Using BERT-Style Models
    Bharadwaj, Shikhar
    Kadam, Tushar
    2022 IEEE/ACM 1ST INTERNATIONAL WORKSHOP ON NATURAL LANGUAGE-BASED SOFTWARE ENGINEERING (NLBSE 2022), 2022, : 40 - 43
  • [2] Building regional competition policy in ASEAN: lessons from the European Competition Network
    Alexandr Svetlicinii
    Asia Europe Journal, 2017, 15 : 341 - 357
  • [4] Towards A Directors' Competition Disqualification Regime in Nigeria: Lessons from the UK
    Subai, Pereowei
    Caliskan, Samet
    JOURNAL OF AFRICAN LAW, 2021, 65 (02) : 245 - 266
  • [5] Building better swarms through competition: Lessons learned from the AAAI/RoboCup rescue robot competition
    Stormont, DP
    Bhatt, A
    Boldt, B
    Skousen, S
    Berkemeier, MD
    IROS 2003: PROCEEDINGS OF THE 2003 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2003, : 2870 - 2875
  • [6] LESSONS FROM BUILDING ACOUSTIC MODELS WITH A MILLION HOURS OF SPEECH
    Parthasarathi, Hari Krishnan
    Strom, Nikko
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6670 - 6674
  • [7] Building Social Business Models: Lessons from the Grameen Experience
    Yunus, Muhammad
    Moingeon, Bertrand
    Lehmann-Ortega, Laurence
    LONG RANGE PLANNING, 2010, 43 (2-3) : 308 - 325
  • [8] Towards Efficient Edge Computing: Optimizing Lightweight Models for Weather Classification
    Seo, Kyeongeun
    Yang, Chang Mo
    Kim, Dongchil
    2024 9TH INTERNATIONAL CONFERENCE ON FOG AND MOBILE EDGE COMPUTING, FMEC 2024, 2024, : 183 - 184
  • [9] Towards Green Building Implementation in Indonesia: Lessons Learned from Singapore
    Hatmoko, Jati Utomo Dwi
    Sucipto, Taufiq Lilo Adi
    Prasetyo, Stefanus Catur Adi
    Setiawati, Apsari
    ADVANCED SCIENCE LETTERS, 2017, 23 (03) : 2548 - 2551
  • [10] Experience and Lessons Learned from the Army RCO Blind Signal Classification Competition
    Wang, Peng
    Vindiola, Manuel
    Hyatt, John S.
    Lee, Michael S.
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS, 2019, 11006