Mining Software Repositories with a Collaborative Heuristic Repository

被引:2
|
作者
Babii, Hlib [1 ]
Prenner, Julian Aron [1 ]
Stricker, Laurin [1 ]
Karmakar, Anjan [1 ]
Janes, Andrea [1 ]
Robbes, Romain [1 ]
机构
[1] Free Univ Bozen Bolzano, Bolzano, Italy
关键词
D O I
10.1109/ICSE-NIER52604.2021.00030
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many software engineering studies or tasks rely on categorizing software engineering artifacts. In practice, this is done either by defining simple but often imprecise heuristics, or by manual labelling of the artifacts. Unfortunately, errors in these categorizations impact the tasks that rely on them. To improve the precision of these categorizations, we propose to gather heuristics in a collaborative heuristic repository, to which researchers can contribute a large amount of diverse heuristics for a variety of tasks on a variety of SE artifacts. These heuristics are then leveraged by state-of-the-art weak supervision techniques to train high-quality classifiers, thus improving the categorizations. We present an initial version of the heuristic repository, which we applied to the concrete task of commit classification.
引用
收藏
页码:106 / 110
页数:5
相关论文
共 50 条
  • [41] Polyglot and Distributed Software Repository Mining with Crossflow
    Barmpis, Konstantinos
    Neubauer, Patrick
    Co, Jonathan
    Kolovos, Dimitris
    Matragkas, Nicholas
    Paige, Richard F.
    2020 IEEE/ACM 17TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES, MSR, 2020, : 374 - 384
  • [42] Text Mining Studies of Software Repository Contents
    Dobrzynski, Bartosz
    Sosnowski, Janusz
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING, ENASE 2023, 2023, : 562 - 569
  • [43] A survey and taxonomy of approaches for mining software repositories in the context of software evolution
    Kagdi, Huzefa
    Collard, Michael L.
    Maletic, Jonathan I.
    JOURNAL OF SOFTWARE MAINTENANCE AND EVOLUTION-RESEARCH AND PRACTICE, 2007, 19 (02): : 77 - 131
  • [44] Guest editorial: special section on mining software repositories
    Di Penta, Massimiliano
    Xie, Tao
    EMPIRICAL SOFTWARE ENGINEERING, 2015, 20 (02) : 291 - 293
  • [45] Introduction to the Special Issue on Mining Software Repositories in 2010
    Whitehead, Jim
    Zimmermann, Thomas
    EMPIRICAL SOFTWARE ENGINEERING, 2012, 17 (4-5) : 500 - 502
  • [46] Guest Editorial: Special Section on Mining Software Repositories
    Tan, Lin
    Hindle, Abram
    EMPIRICAL SOFTWARE ENGINEERING, 2019, 24 (03) : 1458 - 1460
  • [47] Guest Editorial: Special section on mining software repositories
    Romain Robbes
    Emily Hill
    Christian Bird
    Empirical Software Engineering, 2018, 23 : 833 - 834
  • [48] MSR 2004 - International Workshop on Mining Software Repositories
    Hassan, AE
    Holt, RC
    Mockus, A
    ICSE 2004: 26TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, PROCEEDINGS, 2004, : 770 - 771
  • [49] Guest Editorial: Special section on mining software repositories
    Robbes, Romain
    Hill, Emily
    Bird, Christian
    EMPIRICAL SOFTWARE ENGINEERING, 2018, 23 (02) : 833 - 834
  • [50] Guest Editorial: Special Section on Mining Software Repositories
    Lin Tan
    Abram Hindle
    Empirical Software Engineering, 2019, 24 : 1458 - 1460