Mining Software Repositories with a Collaborative Heuristic Repository

被引:2
|
作者
Babii, Hlib [1 ]
Prenner, Julian Aron [1 ]
Stricker, Laurin [1 ]
Karmakar, Anjan [1 ]
Janes, Andrea [1 ]
Robbes, Romain [1 ]
机构
[1] Free Univ Bozen Bolzano, Bolzano, Italy
关键词
D O I
10.1109/ICSE-NIER52604.2021.00030
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many software engineering studies or tasks rely on categorizing software engineering artifacts. In practice, this is done either by defining simple but often imprecise heuristics, or by manual labelling of the artifacts. Unfortunately, errors in these categorizations impact the tasks that rely on them. To improve the precision of these categorizations, we propose to gather heuristics in a collaborative heuristic repository, to which researchers can contribute a large amount of diverse heuristics for a variety of tasks on a variety of SE artifacts. These heuristics are then leveraged by state-of-the-art weak supervision techniques to train high-quality classifiers, thus improving the categorizations. We present an initial version of the heuristic repository, which we applied to the concrete task of commit classification.
引用
收藏
页码:106 / 110
页数:5
相关论文
共 50 条
  • [31] Guest editorial: Mining software repositories 2018
    Kamei, Yasutaka
    Zaidman, Andy
    EMPIRICAL SOFTWARE ENGINEERING, 2020, 25 (03) : 2055 - 2057
  • [32] Guest editorial: Mining software repositories 2018
    Yasutaka Kamei
    Andy Zaidman
    Empirical Software Engineering, 2020, 25 : 2055 - 2057
  • [33] Mining software repositories for comprehensible software fault prediction models
    Vandecruys, Olivier
    Martens, David
    Baesens, Bart
    Mues, Christophe
    De Backer, Manu
    Haesen, Raf
    JOURNAL OF SYSTEMS AND SOFTWARE, 2008, 81 (05) : 823 - 839
  • [34] Mining Software Repositories to Identify Library Experts
    Santos, Adriano
    Souza, Mauricio
    Oliveira, Johnatan
    Figueiredo, Eduardo
    XII BRAZILIAN SYMPOSIUM ON SOFTWARE COMPONENTS, ARCHITECTURES, AND REUSE (SBCARS), 2018, : 83 - 91
  • [35] MetricMiner: Supporting Researchers in Mining Software Repositories
    Sokol, Francisco Zigmund
    Aniche, Mauricio Finavaro
    Gerosa, Marco Aurelio
    2013 IEEE 13TH INTERNATIONAL WORKING CONFERENCE ON SOURCE CODE ANALYSIS AND MANIPULATION (SCAM), 2013, : 142 - 146
  • [36] Mining Software Repositories for Automatic Interface Recommendation
    Sun, Xiaobing
    Li, Bin
    Duan, Yucong
    Shi, Wei
    Liu, Xiangyue
    SCIENTIFIC PROGRAMMING, 2016, 2016
  • [37] Research on mining software repositories to facilitate refactoring
    Nyamawe, Ally S.
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2023, 13 (05)
  • [38] Visual data mining and analysis of software repositories
    Voinea, Lucian
    Telea, Alexandru
    COMPUTERS & GRAPHICS-UK, 2007, 31 (03): : 410 - 428
  • [39] Manas: Mining Software Repositories to Assist AutoML
    Nguyen, Giang
    Islam, Md Johirul
    Pan, Rangeet
    Rajan, Hridesh
    2022 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2022), 2022, : 1368 - 1380
  • [40] Mining software repositories for software architecture - A systematic mapping study
    Soliman, Mohamed
    Albonico, Michel
    Malavolta, Ivano
    Wortmann, Andreas
    INFORMATION AND SOFTWARE TECHNOLOGY, 2025, 181