Mining Software Repositories with a Collaborative Heuristic Repository

被引:2
|
作者
Babii, Hlib [1 ]
Prenner, Julian Aron [1 ]
Stricker, Laurin [1 ]
Karmakar, Anjan [1 ]
Janes, Andrea [1 ]
Robbes, Romain [1 ]
机构
[1] Free Univ Bozen Bolzano, Bolzano, Italy
关键词
D O I
10.1109/ICSE-NIER52604.2021.00030
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many software engineering studies or tasks rely on categorizing software engineering artifacts. In practice, this is done either by defining simple but often imprecise heuristics, or by manual labelling of the artifacts. Unfortunately, errors in these categorizations impact the tasks that rely on them. To improve the precision of these categorizations, we propose to gather heuristics in a collaborative heuristic repository, to which researchers can contribute a large amount of diverse heuristics for a variety of tasks on a variety of SE artifacts. These heuristics are then leveraged by state-of-the-art weak supervision techniques to train high-quality classifiers, thus improving the categorizations. We present an initial version of the heuristic repository, which we applied to the concrete task of commit classification.
引用
收藏
页码:106 / 110
页数:5
相关论文
共 50 条
  • [21] Mining Software Repositories for Accurate Authorship
    Meng, Xiaozhu
    Miller, Barton P.
    Williams, William R.
    Bernat, Andrew R.
    2013 29TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE (ICSM), 2013, : 250 - 259
  • [22] The challenges & case for mining software repositories
    Razzaq, Saad
    Maqbool, Fahad
    Anjum, Bilal
    Zafar, Samreen
    Laila, Umme
    Noor, Faiza
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 734 - +
  • [23] Guest Editorial: Mining software repositories
    Robbes, Romain
    Kamei, Yasutaka
    Pinzger, Martin
    EMPIRICAL SOFTWARE ENGINEERING, 2017, 22 (03) : 1143 - 1145
  • [24] Mining Software Repositories for Social Norms
    Dam, Hoa Khanh
    Savarimuthu, Bastin Tony Roy
    Avery, Daniel
    Ghose, Aditya
    2015 IEEE/ACM 37th IEEE International Conference on Software Engineering, Vol 2, 2015, : 627 - 630
  • [25] Software Process Simulation based on Mining Software Repositories
    Honsel, Verena
    Honsel, Daniel
    Grabowski, Jens
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 828 - 831
  • [26] The SmartSHARK Ecosystem for Software Repository Mining
    Trautsch, Alexander
    Trautsch, Fabian
    Herbold, Steffen
    Ledel, Benjamin
    Grabowski, Jens
    2020 ACM/IEEE 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS (ICSE-COMPANION 2020), 2020, : 25 - 28
  • [27] Introduction to the special issue on mining software repositories
    Tao Xie
    Thomas Zimmermann
    Arie van Deursen
    Empirical Software Engineering, 2013, 18 : 1043 - 1046
  • [28] Mining Software Repositories Using Topic Models
    Thomas, Stephen W.
    2011 33RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2011, : 1138 - 1139
  • [29] Mining expertise of developers from software repositories
    Hammad, Maen
    Hijazi, Haneen
    Hammad, Mustafa
    Otoom, Ahmed Fawzi
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2020, 62 (03) : 227 - 239
  • [30] Introduction to the special issue on mining software repositories
    Xie, Tao
    Zimmermann, Thomas
    van Deursen, Arie
    EMPIRICAL SOFTWARE ENGINEERING, 2013, 18 (06) : 1043 - 1046