MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization

被引:0
|
作者
Yang, Zhiyu [4 ]
Zhou, Zihan [5 ]
Wang, Shuo [1 ]
ConG, Xin [1 ,2 ,3 ]
Han, Xu [1 ,2 ,3 ]
Yan, Yukun [1 ]
Liu, Zhenghao [6 ]
Tan, Zhixing [7 ]
Liu, Pengyuan [4 ]
Yu, Dong [4 ]
Liu, Zhiyuan [1 ,2 ,3 ]
Shi, Xiaodong [5 ]
Sun, Maosong [1 ,2 ,3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Tech, Beijing, Peoples R China
[2] Tsinghua Univ, Inst AI, Beijing, Peoples R China
[3] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
[4] Beijing Language & Culture Univ, Beijing, Peoples R China
[5] Xiamen Univ, Xiamen, Peoples R China
[6] Northeastern Univ, Shenyang, Peoples R China
[7] Zhongguancun Lab, Beijing, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Scientific data visualization plays a crucial role in research by enabling the direct display of complex information and assisting researchers in identifying implicit patterns. Despite its importance, the use of Large Language Models (LLMs) for scientific data visualization remains rather unexplored. In this study, we introduce MatPlotAgent, an efficient modelagnostic LLM agent framework designed to automate scientific data visualization tasks. Leveraging the capabilities of both code LLMs and multi-modal LLMs, MatPlotAgent consists of three core modules: query understanding, code generation with iterative debugging, and a visual feedback mechanism for error correction. To address the lack of benchmarks in this field, we present MatPlotBench, a high-quality benchmark consisting of 100 human-verified test cases. Additionally, we introduce a scoring approach that utilizes GPT-4V for automatic evaluation. Experimental results demonstrate that MatPlotAgent can improve the performance of various LLMs, including both commercial and open-source models. Furthermore, the proposed evaluation method shows a strong correlation with human-annotated scores.
引用
收藏
页码:11789 / 11804
页数:16
相关论文
共 50 条
  • [21] Identifying Citizen-Related Issues from Social Media Using LLM-Based Data Augmentation
    dos Santos, Vitor Gaboardi
    Santos, Guto Leoni
    Lynn, Theo
    Benatallah, Boualem
    ADVANCED INFORMATION SYSTEMS ENGINEERING, CAISE 2024, 2024, 14663 : 531 - 546
  • [22] ConDefects: A New Dataset to Address the Data Leakage Concern for LLM-based Fault Localization and Program Repair
    Wu, Yonghao
    Zhang, Jie M.
    Li, Zheng
    Liu, Yong
    arXiv, 2023,
  • [23] Speak From Heart: An Emotion-Guided LLM-Based Multimodal Method for Emotional Dialogue Generation
    Liu, Chenxiao
    Xie, Zheyong
    Zhao, Sirui
    Zhou, Jin
    Xu, Tong
    Li, Minglei
    Chen, Enhong
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 533 - 542
  • [24] UniDE: A multi-level and low-resource framework for automatic dialogue evaluation via LLM-based data augmentation and multitask learning
    Ye, Guanghui
    Zhao, Huan
    Zhang, Zixing
    Jiang, Zhihua
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (03)
  • [25] Usability Heuristic Evaluation of Scientific Data Analysis and Visualization Tools
    Swaid, Samar
    Maat, Mnsa
    Krishnan, Hari
    Ghoshal, Devarshi
    Ramakrishnan, Lavanya
    ADVANCES IN USABILITY AND USER EXPERIENCE, AHFE 2017, 2018, 607 : 471 - 482
  • [26] Some theoretical issues of scientific visualization as a method of data analysis
    Pilyugin, Victor
    Malikova, Eugeniya
    Adzhiev, Valery
    Pasko, Alexander
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2013, 7870 : 131 - 142
  • [27] GenG: An LLM-based Generic Time Series Data Generation Approach for Edge Intelligence via Cross-domain Collaboration
    Zhou, Xiaomao
    Jia, Qingmin
    Hu, Yujiao
    Xie, Renchao
    Huang, Tao
    Yu, E. Richard
    IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS, INFOCOM WKSHPS 2024, 2024,
  • [28] A Visualization Method for Scientific Research Data of University Teachers Based on Temporal Hierarchical Layout Strategy
    Yang, Qihang
    Xiao, Bin
    Fu, Lijun
    Li, Jin
    Liu, Xiaojuan
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 3979 - 3985
  • [29] Evaluation Study for an ISO 13606 Archetype Based Medical Data Visualization Method
    Kopanitsa, Georgy
    JOURNAL OF MEDICAL SYSTEMS, 2015, 39 (08)
  • [30] Evaluation Study for an ISO 13606 Archetype Based Medical Data Visualization Method
    Georgy Kopanitsa
    Journal of Medical Systems, 2015, 39