VisRepo: A Visual Retrieval Tool for Large-Scale Open-Source Projects

被引:0
|
作者
Yue, Xiaoqi [1 ]
Liu, Chao [1 ]
Zhang, Neng [2 ]
Hu, Haibo [1 ]
Zhang, Xiaohong [1 ]
机构
[1] Chongqing Univ, Chongqing, Peoples R China
[2] Sun Yat Sen Univ, Zhuhai, Peoples R China
基金
中国博士后科学基金;
关键词
Open-Source Project Retrieval; Visualization; Software Data Mining;
D O I
10.1145/3671016.3671409
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To improve software development productivity, developers frequently search for projects on open-source communities such as GitHub. However, it is challenging for users to quickly find suitable projects from numerous results due to the overload of project information. Although many tools have been proposed to rank the relevancy of searched results, manually inspecting them one by one is irreplaceable and time-consuming. To fill this gap, we propose a visual retrieval tool named VisRepo for open-source software projects. Firstly, it mines software project data from four perspectives including topic, technology, usability, and comprehensibility, and connects projects based on the same owners/contributors and similar topics. Then, visualization technique is employed to present complex software data intuitively. VisRepo provides users an interactive retrieval paradigm of Search-Explore-Check-Recommend with in-depth insights and better exploration experience. We evaluate VisRepo on 7w+ open-source JavaScript projects. Experimental results showed that VisRepo outperforms GitHub search engine in terms of time consumption and accuracy, meanwhile enabling a more interactive and useful user experience. Demo Source Code: https://github.com/YUEchn/visrepo Demo Video: https://youtu.be/-fqL8ngSmwQ
引用
收藏
页码:499 / 502
页数:4
相关论文
共 50 条
  • [31] BeeGround - An Open-Source Simulation Platform for Large-Scale Swarm Robotics Applications
    Lim, Sean
    Wang, Shiyi
    Lennox, Barry
    Arvin, Farshad
    2021 7TH INTERNATIONAL CONFERENCE ON AUTOMATION, ROBOTICS AND APPLICATIONS (ICARA 2021), 2021, : 75 - 79
  • [32] An open-source framework for large-scale transient topology optimization using PETSc
    Hansotto Kristiansen
    Niels Aage
    Structural and Multidisciplinary Optimization, 2022, 65
  • [33] A large-scale empirical exploration on refactoring activities in open source software projects
    Vassallo, Carmine
    Grano, Giovanni
    Palomba, Fabio
    Gall, Harald C.
    Bacchelli, Alberto
    SCIENCE OF COMPUTER PROGRAMMING, 2019, 180 : 1 - 15
  • [34] Code Coverage and Postrelease Defects: A Large-Scale Study on Open Source Projects
    Kochhar, Pavneet Singh
    Lo, David
    Lawall, Julia
    Nagappan, Nachiappan
    IEEE TRANSACTIONS ON RELIABILITY, 2017, 66 (04) : 1213 - 1228
  • [35] A bug finder refined by a large set of open-source projects
    Nam, Jaechang
    Wang, Song
    Xi, Yuan
    Tan, Lin
    INFORMATION AND SOFTWARE TECHNOLOGY, 2019, 112 : 164 - 175
  • [36] TypeScript: An Open-Source Programming Language with Options for Robust Development and Large-Scale Applications
    Acropolis Institute of Technology and Research, Dept. of Computer Science and Information Technology, Indore, India
    Int. Conf. Adv. Comput. Res. Sci. Eng. Technol., ACROSET, 2024,
  • [37] QuoVidi: An open-source web application for the organization of large-scale biological treasure hunts
    Lobet, Guillaume
    Descamps, Charlotte
    Leveau, Lola
    Guillet, Alain
    Rees, Jean-Francois
    ECOLOGY AND EVOLUTION, 2021, 11 (08): : 3516 - 3526
  • [38] GATECloud.net: a platform for large-scale, open-source text processing on the cloud
    Tablan, Valentin
    Roberts, Ian
    Cunningham, Hamish
    Bontcheva, Kalina
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2013, 371 (1983):
  • [39] SGL: A domain-specific language for large-scale analysis of open-source code
    Foo, Darius
    Yi, Ang Ming
    Yeo, Jason
    Sharma, Asankhaya
    2018 IEEE CYBERSECURITY DEVELOPMENT CONFERENCE (SECDEV 2018), 2018, : 61 - 68
  • [40] Empowering OCL research: a large-scale corpus of open-source data from GitHub
    Josh G. M. Mengerink
    Jeroen Noten
    Alexander Serebrenik
    Empirical Software Engineering, 2019, 24 : 1574 - 1609