Code Understanding Linter to Detect Variable Misuse

被引:0
|
作者
Ryou, Yeonhee [1 ]
Joh, Sangwoo [1 ]
Yang, Joonmo [1 ]
Kim, Sujin [1 ]
Kim, Youil [1 ]
机构
[1] Samsung Res, Seoul, South Korea
关键词
language models of code; automated code review; bug detection; variable misuse;
D O I
10.1145/3551349.3559497
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We share our experience in developing Code Understanding Linter, an automated code review tool based on language models of code. We introduce several ideas to make the tool be more practical, including combining two different language models, filtering meaningless outputs from the model, and generating developer-friendly diagnosis messages by interpreting the outputs from the model. On top of those ideas, we describe the design and implementation of an automated code review tool to detect variable-misuse defects in Python codes and suggest how to fix them. We evaluated the tool with a set of code repositories in Samsung Electronics, which contains real-world Python codes. Our experiment proves that our tool can discover hidden defects in the real-world codes, but the false positive rate is far higher than we expected. After manually investigating every false positives, we discuss the limitations of the language models and possible solutions.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Understanding Misunderstandings in Source Code
    Gopstein, Dan
    Iannacone, Jake
    Yan, Yu
    DeLong, Lois
    Zhuang, Yanyan
    Yeh, Martin K. -C.
    Cappos, Justin
    ESEC/FSE 2017: PROCEEDINGS OF THE 2017 11TH JOINT MEETING ON FOUNDATIONS OF SOFTWARE ENGINEERING, 2017, : 129 - 139
  • [32] THE DEVELOPMENT OF AN UNDERSTANDING OF THE PAULINE CODE
    WILCKENS, U
    NEW TESTAMENT STUDIES, 1982, 28 (02) : 154 - 190
  • [33] Understanding and Supporting Code Performances
    Manesh, Daniel
    PROCEEDINGS OF THE 16TH CONFERENCE ON CREATIVITY AND COGNITION, C&C 2024, 2024, : 21 - 24
  • [34] The forgotten variable of understanding
    Smedslund, J
    CAHIERS DE PSYCHOLOGIE COGNITIVE-CURRENT PSYCHOLOGY OF COGNITION, 1997, 16 (1-2): : 217 - 221
  • [35] Understanding the MSJC Masonry Code
    Constr Specifier, 3 (64):
  • [36] THE MISUSE OF ANALYSIS OF VARIANCE TO DETECT SYNERGY IN COMBINATION-DRUG STUDIES
    CAUDLE, RM
    WILLIAMS, GM
    PAIN, 1993, 55 (03) : 313 - 317
  • [37] Understanding and Detecting Harmful Code
    Lima, Rodrigo
    Souza, Jairo
    Fonseca, Baldoino
    Teixeira, Leopoldo
    Gheyi, Rohit
    Ribeiro, Marcio
    Garcia, Alessandro
    de Mello, Rafael
    34TH BRAZILIAN SYMPOSIUM ON SOFTWARE ENGINEERING, SBES 2020, 2020, : 223 - 232
  • [38] Understanding the chromatin remodeling code
    Ha, Misook
    PLANT SCIENCE, 2013, 211 : 137 - 145
  • [39] Code Syntax Is Understanding Response
    Kamp, Poul-Henning
    COMMUNICATIONS OF THE ACM, 2011, 54 (02) : 7 - 7
  • [40] Scalable analytics to detect DNS misuse for establishing stealthy communication channels
    Schales, D. L.
    Jang, J.
    Wang, T.
    Hu, X.
    Kirat, D.
    Wuest, B.
    Stoecklin, M. Ph.
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2016, 60 (04)