Semantically-enhanced topic recommendation systems for software projects

被引:6
|
作者
Izadi, Maliheh [1 ]
Nejati, Mahtab [2 ]
Heydarnoori, Abbas [3 ]
机构
[1] Delft Univ Technol, Delft, Netherlands
[2] Univ Waterloo, Waterloo, ON, Canada
[3] Bowling Green State Univ, Bowling Green, OH 43403 USA
关键词
Recommender system; Topics; Tags; Semantic relationships; Knowledge graph; Software projects; GitHub; TAG RECOMMENDATION; KNOWLEDGE;
D O I
10.1007/s10664-022-10272-w
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software-related platforms such as GitHub and Stack Overflow, have enabled their users to collaboratively label software entities with a form of metadata called topics. Tagging software repositories with relevant topics can be exploited for facilitating various downstream tasks. For instance, a correct and complete set of topics assigned to a repository can increase its visibility. Consequently, this improves the outcome of tasks such as browsing, searching, navigation, and organization of repositories. Unfortunately, assigned topics are usually highly noisy, and some repositories do not have well-assigned topics. Thus, there have been efforts on recommending topics for software projects, however, the semantic relationships among these topics have not been exploited so far. In this work, we propose two recommender models for tagging software projects that incorporate the semantic relationship among topics. Our approach has two main phases; (1) we first take a collaborative approach to curate a dataset of quality topics specifically for the domain of software engineering and development. We also enrich this data with the semantic relationships among these topics and encapsulate them in a knowledge graph we call SED-KGraph. Then, (2) we build two recommender systems; The first one operates only based on the list of original topics assigned to a repository and the relationships specified in our knowledge graph. The second predictive model, however, assumes there are no topics available for a repository, hence it proceeds to predict the relevant topics based on both textual information of a software project (such as its README file), and SED-KGraph. We built SED-KGraph in a crowd-sourced project with 170 contributors from both academia and industry. Through their contributions, we constructed SED-KGraph with 2,234 carefully evaluated relationships among 863 community-curated topics. Regarding the recommenders' performance, the experiment results indicate that our solutions outperform baselines that neglect the semantic relationships among topics by at least 25% and 23% in terms of Average Success Rate and Mean Average Precision metrics, respectively. We share SED-KGraph, as a rich form of knowledge for the community to re-use and build upon. We also release the source code of our two recommender models, KGRec and KGRec+.
引用
收藏
页数:36
相关论文
共 50 条
  • [1] Semantically-enhanced topic recommendation systems for software projects
    Maliheh Izadi
    Mahtab Nejati
    Abbas Heydarnoori
    Empirical Software Engineering, 2023, 28
  • [2] Semantically-Enhanced Topic Modeling
    Viegas, Felipe
    Luiz, Washington
    Gomes, Christian
    Khatibi, Amir
    Canuto, Sergio
    Mourao, Fernando
    Salles, Thiago
    Rocha, Leonardo
    Goncalves, Marcos Andre
    CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 893 - 902
  • [3] Verification of Semantically-Enhanced Artifact Systems
    Hariri, Babak Bagheri
    Calvanese, Diego
    Montali, Marco
    Santoso, Ario
    Solomakhin, Dmitry
    SERVICE-ORIENTED COMPUTING, ICSOC 2013, 2013, 8274 : 600 - 607
  • [4] Semantically-Enhanced Recommenders
    Codina, Victor
    Ceccaroni, Luigi
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2012, 248 : 69 - 78
  • [5] Semantically-Enhanced Ubiquitous User Modeling
    Plumbaum, Till
    USER MODELING, ADAPTATION, AND PERSONALIZATION, PROCEEDINGS, 2010, 6075 : 407 - 410
  • [6] Semantically-enhanced Configurability in State Estimation Structures of Power Systems
    Milis, Georgios M.
    Asprou, Markos
    Kyriakides, Elias
    Panayiotou, Christos G.
    Polycarpou, Marios M.
    2015 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2015, : 679 - 686
  • [7] SePeRe: Semantically-Enhanced System for Pest Recognition
    Garceran-Saez, Jesus
    Garcia-Sanchez, Francisco
    ICT FOR AGRICULTURE AND ENVIRONMENT, 2019, 901 : 3 - 11
  • [8] A Semantically-Enhanced Modelling Environment for Business Process as a Service
    Hinkelmann, Knut
    Kurjakovic, Sabrina
    Lammel, Benjamin
    Laurenzi, Emanuele
    Woitsch, Robert
    2016 4TH INTERNATIONAL CONFERENCE ON ENTERPRISE SYSTEMS (ES) PROCEEDINGS, 2016, : 143 - 152
  • [9] Semantically-enhanced extension of the discussion analysis algorithm in SAKE
    Lukac, G.
    Butka, P.
    Mach, M.
    2008 6TH INTERNATIONAL SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS, 2008, : 224 - +
  • [10] Sequential visual place recognition using semantically-enhanced features
    Paturkar, Varun
    Yadav, Rohit
    Kala, Rahul
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (17) : 50477 - 50491