Towards Trustworthy AI Software Development Assistance

被引:0
|
作者
Maninger, Daniel [1 ,3 ]
Narasimhan, Krishna [1 ,2 ]
Mezini, Mira [1 ,3 ,4 ]
机构
[1] Tech Univ Darmstadt, Darmstadt, Germany
[2] AI Qual & Testing Hub, Frankfurt, Germany
[3] Hessian Ctr Artificial Intelligence Hessian AI, Darmstadt, Germany
[4] Natl Res Ctr Appl Cybersecur ATHENE, Darmstadt, Germany
关键词
D O I
10.1145/3639476.3639770
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is expected that in the near future, AI software development assistants will play an important role in the software industry. However, current software development assistants tend to be unreliable, often producing incorrect, unsafe, or low-quality code. We seek to resolve these issues by introducing a holistic architecture for constructing, training, and using trustworthy AI software development assistants. In the center of the architecture, there is a foundational LLM trained on datasets representative of real-world coding scenarios and complex software architectures, and fine-tuned on code quality criteria beyond correctness. The LLM will make use of graph-based code representations for advanced semantic comprehension. We envision a knowledge graph integrated into the system to provide up-to-date background knowledge and to enable the assistant to provide appropriate explanations. Finally, a modular framework for constrained decoding will ensure that certain guarantees (e.g., for correctness and security) hold for the generated code.
引用
收藏
页码:112 / 116
页数:5
相关论文
共 50 条
  • [41] Towards AI-Driven Software Development: Challenges and Lessons from the Field (Keynote)
    Yahav, Eran
    PROCEEDINGS OF THE 31ST ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2023, 2023, : 1 - 1
  • [42] An AI Harms and Governance Framework for Trustworthy AI
    Peckham, Jeremy B.
    COMPUTER, 2024, 57 (03) : 59 - 68
  • [43] Trustworthy AI: AI made in Germany and Europe?
    Hirsch-Kreinsen, Hartmut
    Krokowski, Thorben
    AI & SOCIETY, 2024, 39 (06) : 2921 - 2931
  • [44] AI-Assisted Security: A Step towards Reimagining Software Development for a Safer Future
    Shi, Yong
    Sakib, Nazmus
    Shahriar, Hossain
    Lo, Dan
    Chi, Hongmei
    Qian, Kai
    2023 IEEE 47TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC, 2023, : 991 - 992
  • [45] The EU AI Act and the Wager on Trustworthy AI
    Bellogin, Alejandro
    Grau, Oliver
    Larsson, Stefan
    Schimpf, Gerhard
    Sengupta, Biswa
    Solmaz, Guerkan
    COMMUNICATIONS OF THE ACM, 2024, 67 (12) : 58 - 65
  • [46] Editorial: Trustworthy AI for healthcare
    Agafonov, Oleg
    Babic, Aleksandar
    Sousa, Sonia
    Alagaratnam, Sharmini
    FRONTIERS IN DIGITAL HEALTH, 2024, 6
  • [47] MLOps as Enabler of Trustworthy AI
    Billeter, Yann
    Denzel, Philipp
    Chavarriaga, Ricardo
    Forster, Oliver
    Schilling, Frank-Peter
    Brunner, Stefan
    Frischknecht-Gruber, Carmen
    Reif, Monika
    Weng, Joanna
    2024 11TH IEEE SWISS CONFERENCE ON DATA SCIENCE, SDS 2024, 2024, : 37 - 40
  • [48] Building trustworthy software
    Hogan, Hank
    CONTROL ENGINEERING, 2007, 54 (07) : 78 - 81
  • [49] Towards trustworthy ai-enabled decision support systems: Validation of the multisource ai scorecard table (MAST)
    Salehi, Pouria
    Ba, Yang
    Kim, Nayoung
    Mosallanezhad, Ahmadreza
    Pan, Anna
    Cohen, Myke C.
    Wang, Yixuan
    Zhao, Jieqiong
    Bhatti, Shawaiz
    Sung, James
    Blasch, Erik
    Mancenido, Michelle V.
    Chiou, Erin K.
    Journal of Artificial Intelligence Research, 2024, 80 : 1311 - 1341
  • [50] Towards Trustworthy AI-Enabled Decision Support Systems: Validation of the Multisource AI Scorecard Table (MAST)
    Salehi, Pouria
    Ba, Yang
    Kim, Nayoung
    Mosallanezhad, Ahmadreza
    Pan, Anna
    Cohen, Myke C.
    Wang, Yixuan
    Zhao, Jieqiong
    Bhatti, Shawaiz
    Sung, James
    Blasch, Erik
    Mancenido, Michelle, V
    Chiou, Erin K.
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2024, 80 : 1311 - 1341