Improving the detection of community smells through socio-technical and sentiment analysis

被引:5
|
作者
Almarimi, Nuri [1 ]
Ouni, Ali [1 ]
Chouchen, Moataz [1 ]
Mkaouer, Mohamed Wiem [2 ]
机构
[1] Univ Quebec, Ecole Technol Super, Montreal, PQ, Canada
[2] Rochester Inst Technol, Rochester, NY 14623 USA
基金
加拿大自然科学与工程研究理事会;
关键词
human factors in software engineering; software organization and properties; STRENGTH DETECTION; SOFTWARE; INSIGHTS; QUALITY;
D O I
10.1002/smr.2505
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Open source software development is regarded as a collaborative activity in which developers interact to build a software product. Such a human collaboration is described as an organized effort of the "social" activity of organizations, individuals, and stakeholders, which can affect the development community and the open source project health. Negative effects of the development community manifest typically in the form of community smells, which represent symptoms of organizational and social issues within the open source software development community that often lead to additional project costs and reduced software quality. Recognizing the advantages of the early detection of potential community smells in a software project, we introduce a novel approach that learns from various community organizational, social, and emotional aspects to provide an automated support for detecting community smells. In particular, our approach learns from a set of interleaving organizational-social and emotional symptoms that characterize the existence of community smell instances in a software project. We build a multi-label learning model to detect 10 common types of community smells. We use the ensemble classifier chain (ECC) model that transforms multi-label problems into several single-label problems, which are solved using genetic programming (GP) to find the optimal detection rules for each smell type. To evaluate the performance of our approach, we conducted an empirical study on a benchmark of 143 open source projects. The statistical tests of our results show that our approach can detect community smells with an average F-measure of 93%, achieving a better performance compared to different state-of-the-art techniques. Furthermore, we investigate the most influential community-related metrics to identify each community smell type.
引用
收藏
页数:26
相关论文
共 50 条