Evaluating Human-AI Partnership for LLM-based Code Migration

被引:0
|
作者
Omidvar-Tehrani, Behrooz [1 ]
Ishaani, M. [2 ]
Anubhai, Anmol [2 ]
机构
[1] AWS AI Labs, Santa Clara, CA 95054 USA
[2] Amazon Web Serv, Seattle, WA USA
来源
EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024 | 2024年
关键词
Application Modernization; Code Migration; Human-AI Partnership; Human-in-the-Loop Techniques; Trust Framework;
D O I
10.1145/3613905.3650896
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The potential of Generative AI, especially Large Language Models (LLMs), to transform software development is remarkable. In this paper, we focus on one area in software development called "code migration". We define code migration as the process of transitioning the language version of a code repository by converting both the source code and its dependencies. Carefully designing an effective human-AI partnership is essential for boosting developer productivity and faster migrations when performing code migrations. Though human-AI partnerships have been generally explored in the literature, their application to code migrations remains largely unexamined. In this work, we leverage an LLM-based code migration tool called Amazon Q Code Transformation to conduct semi-structured interviews with 11 participants undertaking code migrations. We discuss human's role in the human-AI partnership ( human as a director and a reviewer) and define a trust framework based on various model outcomes to earn trust with LLMs. The guidelines presented in this paper offer a vital starting point for designing human-AI partnerships that effectively augment and complement human capabilities in software development with Generative AI.
引用
收藏
页数:8
相关论文
共 50 条
  • [11] The GitHub Recent Bugs Dataset for Evaluating LLM-based Debugging Applications
    Lee, Jae Yong
    Kang, Sungmin
    Yoon, Juyeon
    Yoo, Shin
    2024 IEEE CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION, ICST 2024, 2024, : 442 - 444
  • [12] Towards LLM-Based System Migration in Language-Driven Engineering
    Busch, Daniel
    Bainczyk, Alexander
    Steffen, Bernhard
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2024, 14390 LNCS : 191 - 200
  • [13] Perfection Not Required? Human-AI Partnerships in Code Translation
    Weisz, Justin D.
    Muller, Michael
    Houde, Stephanie
    Richards, John
    Ross, Steven, I
    Martinez, Fernando
    Agarwal, Mayank
    Talamadupula, Kartik
    IUI '21 - 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, 2021, : 402 - 412
  • [14] Human-AI Collaboration for Remote Sighted Assistance: Perspectives from the LLM Era
    Yu, Rui
    Lee, Sooyeon
    Xie, Jingyi
    Billah, Syed Masum
    Carroll, John M.
    FUTURE INTERNET, 2024, 16 (07)
  • [15] Human-AI Collaboration in Cooperative Games: A Study of Playing Codenames with an LLM Assistant
    Sidji, Matthew
    Smith, Wally
    Rogerson, Melissa J.
    Proceedings of the ACM on Human-Computer Interaction, 2024, 8 (CHI PLAY)
  • [16] Escalating LLM-based Code Translation Benchmarking into the Class-level Era
    Xue, Pengyu
    Wu, Linhao
    Wang, Chengyi
    Li, Xiang
    Yang, Zhen
    Jin, Ruikai
    Zhang, Yuxiang
    Li, Jia
    Pei, Yifei
    Shen, Zhaoyan
    Lyu, Xiran
    arXiv,
  • [17] CodeSift: An LLM-Based Reference-Less Framework for Automatic Code Validation
    Aggarwal, Pooja
    Chatterjee, Oishik
    Dai, Ting
    Mohapatra, Prateeti
    Paulovicks, Brent
    Blancett, Brad
    De Magalhaes, Arthur
    2024 IEEE 17TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, CLOUD 2024, 2024, : 404 - 410
  • [18] Behavior Alignment: A New Perspective of Evaluating LLM-based Conversational Recommendation Systems
    Yang, Dayu
    Chen, Fumian
    Fang, Hui
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2286 - 2290
  • [19] Evaluating LLM-based generative AI tools in emergency triage: A comparative study of ChatGPT Plus, Copilot Pro, and triage nurses
    Arslan, B.
    Nuhoglu, C.
    Satici, M. O.
    Altinbilek, E.
    AMERICAN JOURNAL OF EMERGENCY MEDICINE, 2025, 89 : 174 - 181
  • [20] Exploring the application of LLM-based AI in UX design: an empirical case study of ChatGPT
    Zhou, Zhibin
    Li, Yaoqi
    Yu, Junnan
    HUMAN-COMPUTER INTERACTION, 2024,